Compositional Exemplars for In-context Learning (2302.05698v3)

Published 11 Feb 2023 in cs.CL, cs.AI, and cs.LG

Abstract: Large pretrained LLMs (LMs) have shown impressive In-Context Learning (ICL) ability, where the model learns to do an unseen task via a prompt consisting of input-output examples as the demonstration, without any parameter updates. The performance of ICL is highly dominated by the quality of the selected in-context examples. However, previous selection methods are mostly based on simple heuristics, leading to sub-optimal performance. In this work, we formulate in-context example selection as a subset selection problem. We propose CEIL (Compositional Exemplars for In-context Learning), which is instantiated by Determinantal Point Processes (DPPs) to model the interaction between the given input and in-context examples, and optimized through a carefully-designed contrastive learning objective to obtain preference from LMs. We validate CEIL on 12 classification and generation datasets from 7 distinct NLP tasks, including sentiment analysis, paraphrase detection, natural language inference, commonsense reasoning, open-domain question answering, code generation, and semantic parsing. Extensive experiments demonstrate not only the state-of-the-art performance but also the transferability and compositionality of CEIL, shedding new light on effective and efficient in-context learning. Our code is released at https://github.com/HKUNLP/icl-ceil.

PDF HTML Abstract

Compositional Exemplars for In-context Learning

The paper "Compositional Exemplars for In-context Learning" addresses the intricacies of selecting in-context examples for large pre-trained LLMs (LMs) during in-context learning (ICL). The paper introduces Compositional Exemplars for In-context Learning (CEIL), a novel approach utilizing Determinantal Point Processes (DPPs) to enhance the selection process of demonstration examples used when prompting LLMs for unseen tasks, requiring no parameter updates.

Key Contributions

Reformulating In-context Example Selection: The paper proposes rethinking in-context example selection as a subset selection problem, arguing that the interaction between examples is crucial for performance. Through DPPs, the authors formulate a joint probability model for example sets, therefore incorporating interactions between examples that traditional independent selection methods ignore.
Learning from Contrastive Objectives: By incorporating contrastive learning, the DPPs are refined to prefer more contextually appropriate example subsets. The model is trained using subsets annotated with scores reflecting the example's utility in improving output accuracy, as judged by the LLM itself.
Performance on Diverse NLP Tasks: CEIL was validated across 12 datasets spanning 7 tasks, showcasing superior state-of-the-art performance in tasks such as sentiment analysis, semantic parsing, and more. Notable gains were observed in complex tasks like natural language inference, where understanding the nuanced interrelationships between examples can be crucial.
Transferability and Compositionality: Beyond obtaining high accuracy, CEIL demonstrated robustness in transferring learned preferences across different LLMs and datasets, which is practically advantageous, reducing the need for task-specific retraining. Additionally, the approach showed promise in handling compositional tasks that require dynamic adaption of examples to generate suitable decomposed representations.

Implications and Speculations for the Future

The findings imply that example interrelationship modeling is a significant factor in optimizing in-context learning for LLMs. As LMs continue to expand in both scale and capability, methodologies like CEIL will be pivotal in maintaining efficiency and effectiveness in real-world applications, where parameters or architectures of models can often remain static due to technical or infrastructural constraints.

Future work could further investigate the domain adaptability of CEIL and enhance its efficiency for real-time applications. Additionally, exploring alternative contrastive frameworks or scaling CEIL for even larger context sizes, which new-generation LMs can support, holds potential for uncovering broader applications and understanding of ICL dynamics.

Conclusion

The research detailed in "Compositional Exemplars for In-context Learning" highlights an innovative step forward in optimizing example selection for in-context learning. The introduction of CEIL provides a robust framework that considers both diversity and relevance within the exemplar set, addressing limitations seen in previous heuristic-based approaches. As a result of its advanced techniques and comprehensive validation, CEIL establishes a refined benchmark in the ongoing efforts to enhance LLMs via effective contextual demonstrations.

PDF Markdown Bookmark Chat (Pro)

References (68)

Authors (5)

Jiacheng Ye (21 papers)
Zhiyong Wu (171 papers)
Jiangtao Feng (24 papers)
Tao Yu (282 papers)
Lingpeng Kong (134 papers)

Citations (88)

View on Semantic Scholar

GitHub

GitHub - HKUNLP/icl-ceil: [ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”. (101 stars)

Compositional Exemplars for In-context Learning (2302.05698v3)