Grimoire is All You Need for Enhancing Large Language Models (2401.03385v2)

Published 7 Jan 2024 in cs.CL

Abstract: In-context Learning (ICL) is one of the key methods for enhancing the performance of LLMs on specific tasks by providing a set of few-shot examples. However, the ICL capability of different types of models shows significant variation due to factors such as model architecture, volume of learning data, and the size of parameters. Generally, the larger the model's parameter size and the more extensive the learning data, the stronger its ICL capability. In this paper, we propose a method SLEICL that involves learning from examples using strong LLMs and then summarizing and transferring these learned skills to weak LLMs for inference and application. This ensures the stability and effectiveness of ICL. Compared to directly enabling weak LLMs to learn from prompt examples, SLEICL reduces the difficulty of ICL for these models. Our experiments, conducted on up to eight datasets with five LLMs, demonstrate that weak LLMs achieve consistent improvement over their own zero-shot or few-shot capabilities using the SLEICL method. Some weak LLMs even surpass the performance of GPT4-1106-preview (zero-shot) with the aid of SLEICL.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (33)

Authors (7)

Ding Chen (22 papers)
Shichao Song (19 papers)
Qingchen Yu (7 papers)
Zhiyu Li (69 papers)
Wenjin Wang (56 papers)
Feiyu Xiong (53 papers)
Bo Tang (111 papers)

Citations (4)

View on Semantic Scholar

Tweets

https://twitter.com/aakas888/status/1744546332847837301

Grimoire is All You Need for Enhancing Large Language Models (2401.03385v2)

Related Papers

Tweets