Prompt-Augmented Linear Probing: Scaling beyond the Limit of Few-shot In-Context Learners (2212.10873v3)

Published 21 Dec 2022 in cs.CL and cs.LG

Abstract: Through in-context learning (ICL), large-scale LLMs are effective few-shot learners without additional model fine-tuning. However, the ICL performance does not scale well with the number of available training samples as it is limited by the inherent input length constraint of the underlying LLM. Meanwhile, many studies have revealed that LLMs are also powerful feature extractors, allowing them to be utilized in a black-box manner and enabling the linear probing paradigm, where lightweight discriminators are trained on top of the pre-extracted input representations. This paper proposes prompt-augmented linear probing (PALP), a hybrid of linear probing and ICL, which leverages the best of both worlds. PALP inherits the scalability of linear probing and the capability of enforcing LLMs to derive more meaningful representations via tailoring input into a more conceivable form. Throughout in-depth investigations on various datasets, we verified that PALP significantly enhances the input representations closing the gap between ICL in the data-hungry scenario and fine-tuning in the data-abundant scenario with little training overhead, potentially making PALP a strong alternative in a black-box scenario.

References (38)

Authors (7)

Hyunsoo Cho (28 papers)
Hyuhng Joon Kim (11 papers)
Junyeob Kim (7 papers)
Sang-Woo Lee (34 papers)
Sang-goo Lee (40 papers)
Kang Min Yoo (40 papers)
Taeuk Kim (38 papers)

Citations (21)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Prompt-Augmented Linear Probing: Scaling beyond the Limit of Few-shot In-Context Learners (2212.10873v3)

Summary

Related Papers