Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Exploring Lottery Prompts for Pre-trained Language Models (2305.19500v1)

Published 31 May 2023 in cs.CL

Abstract: Consistently scaling pre-trained LLMs (PLMs) imposes substantial burdens on model adaptation, necessitating more efficient alternatives to conventional fine-tuning. Given the advantage of prompting in the zero-shot setting and the observed performance fluctuation among different prompts, we explore the instance-level prompt and their generalizability. By searching through the prompt space, we first validate the assumption that for every instance, there is almost always a lottery prompt that induces the correct prediction from the PLM, and such prompt can be obtained at a low cost thanks to the inherent ability of PLMs. Meanwhile, we find that some strong lottery prompts have high performance over the whole training set, and they are equipped with distinguishable linguistic features. Lastly, we attempt to generalize the searched strong lottery prompts to unseen data with prompt ensembling method without any parameter tuning. Experiments are conducted on various types of NLP classification tasks and demonstrate that the proposed method can achieve comparable results with other gradient-free and optimization-free baselines.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Yulin Chen (134 papers)
  2. Ning Ding (122 papers)
  3. Xiaobin Wang (39 papers)
  4. Shengding Hu (34 papers)
  5. Hai-Tao Zheng (94 papers)
  6. Zhiyuan Liu (433 papers)
  7. Pengjun Xie (85 papers)
Citations (5)