Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

How Do In-Context Examples Affect Compositional Generalization? (2305.04835v3)

Published 8 May 2023 in cs.CL and cs.AI

Abstract: Compositional generalization--understanding unseen combinations of seen primitives--is an essential reasoning capability in human intelligence. The AI community mainly studies this capability by fine-tuning neural networks on lots of training samples, while it is still unclear whether and how in-context learning--the prevailing few-shot paradigm based on LLMs--exhibits compositional generalization. In this paper, we present CoFe, a test suite to investigate in-context compositional generalization. We find that the compositional generalization performance can be easily affected by the selection of in-context examples, thus raising the research question what the key factors are to make good in-context examples for compositional generalization. We study three potential factors: similarity, diversity and complexity. Our systematic experiments indicate that in-context examples should be structurally similar to the test case, diverse from each other, and individually simple. Furthermore, two strong limitations are observed: in-context compositional generalization on fictional words is much weaker than that on commonly used ones; it is still critical that the in-context examples should cover required linguistic structures, even though the backbone model has been pre-trained on large corpus. We hope our analysis would facilitate the understanding and utilization of in-context learning paradigm.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Shengnan An (12 papers)
  2. Zeqi Lin (25 papers)
  3. Qiang Fu (159 papers)
  4. Bei Chen (56 papers)
  5. Nanning Zheng (146 papers)
  6. Jian-Guang Lou (69 papers)
  7. Dongmei Zhang (193 papers)
Citations (37)

Summary

We haven't generated a summary for this paper yet.