Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Z-ICL: Zero-Shot In-Context Learning with Pseudo-Demonstrations (2212.09865v2)

Published 19 Dec 2022 in cs.CL and cs.AI

Abstract: Although LLMs can be prompted for both zero- and few-shot learning, performance drops significantly when no demonstrations are available. In this paper, we introduce Z-ICL, a new zero-shot method that closes the gap by constructing pseudo-demonstrations for a given test input using a raw text corpus. Concretely, pseudo-demonstrations are constructed by (1) finding the nearest neighbors to the test input from the corpus and pairing them with random task labels, and (2) applying a set of techniques to reduce the amount of direct copying the model does from the resulting demonstrations. Evaluation on nine classification datasets shows that Z-ICL outperforms previous zero-shot methods by a significant margin, and is on par with in-context learning with labeled training data in the few-shot setting. Overall, Z-ICL provides a significantly higher estimate of the zero-shot performance levels of a model, and supports future efforts to develop better pseudo-demonstrations that further improve zero-shot results.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Xinxi Lyu (5 papers)
  2. Sewon Min (45 papers)
  3. Iz Beltagy (39 papers)
  4. Luke Zettlemoyer (225 papers)
  5. Hannaneh Hajishirzi (176 papers)
Citations (55)