One size doesn't fit all: Predicting the Number of Examples for In-Context Learning (2403.06402v2)

Published 11 Mar 2024 in cs.CL and cs.LG

Abstract: In-context learning (ICL) refers to the process of adding a small number of localized examples (ones that are semantically similar to the input) from a training set of labelled data to an LLM's prompt with an objective to effectively control the generative process seeking to improve the downstream task performance. Existing ICL approaches use an identical number of examples (a pre-configured hyper-parameter) for each data instance. Our work alleviates the limitations of this 'one fits all' approach by dynamically predicting the number of examples for each data instance to be used in few-shot inference with LLMs. In particular, we employ a multi-label classifier, the parameters of which are fitted using a training set, where the label for each instance in the training set indicates if using a specific value of k (number of most similar examples from 0 up to a maximum value) leads to correct k-shot downstream predictions. Our experiments on a number of text classification benchmarks show that AICL substantially outperforms standard ICL by up to 17%.

References (52)

Authors (3)

Manish Chandra (2 papers)
Debasis Ganguly (29 papers)
Iadh Ounis (36 papers)

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

One size doesn't fit all: Predicting the Number of Examples for In-Context Learning (2403.06402v2)

Summary

Related Papers