Can LLM find the green circle? Investigation and Human-guided tool manipulation for compositional generalization (2312.07763v1)

Published 12 Dec 2023 in cs.CL

Abstract: The meaning of complex phrases in natural language is composed of their individual components. The task of compositional generalization evaluates a model's ability to understand new combinations of components. Previous studies trained smaller, task-specific models, which exhibited poor generalization. While LLMs exhibit impressive generalization abilities on many tasks through in-context learning (ICL), their potential for compositional generalization remains unexplored. In this paper, we first empirically investigate prevailing ICL methods in compositional generalization. We find that they struggle with complex compositional questions due to cumulative errors in long reasoning steps and intricate logic required for tool-making. Consequently, we propose a human-guided tool manipulation framework (HTM) that generates tools for sub-questions and integrates multiple tools. Our method enhances the effectiveness of tool creation and usage with minimal human effort. Experiments show that our method achieves state-of-the-art performance on two compositional generalization benchmarks and outperforms existing methods on the most challenging test split by 70%.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (20)

Authors (6)

Min Zhang (630 papers)
Jianfeng He (32 papers)
Shuo Lei (10 papers)
Murong Yue (8 papers)
Linhang Wang (1 paper)
Chang-Tien Lu (54 papers)

Citations (4)

View on Semantic Scholar

Can LLM find the green circle? Investigation and Human-guided tool manipulation for compositional generalization (2312.07763v1)

Related Papers