Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Logit Separability-Driven Samples and Multiple Class-Related Words Selection for Advancing In-Context Learning (2406.10908v5)

Published 16 Jun 2024 in cs.CL

Abstract: Effective organization of in-context learning (ICL) demonstrations is key to improving the quality of LLM responses. To create better sample-label pairs that instruct LLM understanding, we introduce logit separability, a criterion to assess the clarity of both samples and class-related words at the logit level. This facilitates the optimization of sample and label selection, enhancing the precision of information provided in ICL demonstrations. Additionally, we find that incorporating multiple class-related words for each sample, rather than relying on a single class name, improves performance by offering a broader range of label information. Building on these insights, we propose LICL, a logit separability-based method that jointly organizes samples and integrates multiple class-related words into each sample-label pair. Evaluations across seven classification datasets show that this approach significantly improves ICL performance by providing clearer instructions and richer label information.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Zhu Zixiao (1 paper)
  2. Feng Zijian (1 paper)
  3. Zhou Hanzhang (1 paper)
  4. Qian Junlang (1 paper)
  5. Mao Kezhi (1 paper)