Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression (2402.16058v1)

Published 25 Feb 2024 in cs.CL

Abstract: LLMs require lengthy prompts as the input context to produce output aligned with user intentions, a process that incurs extra costs during inference. In this paper, we propose the Gist COnditioned deCOding (Gist-COCO) model, introducing a novel method for compressing prompts which also can assist the prompt interpretation and engineering. Gist-COCO employs an encoder-decoder based LLM and then incorporates an additional encoder as a plugin module to compress prompts with inputs using gist tokens. It finetunes the compression plugin module and uses the representations of gist tokens to emulate the raw prompts in the vanilla LLM. By verbalizing the representations of gist tokens into gist prompts, the compression ability of Gist-COCO can be generalized to different LLMs with high compression rates. Our experiments demonstrate that Gist-COCO outperforms previous prompt compression models in both passage and instruction compression tasks. Further analysis on gist verbalization results suggests that our gist prompts serve different functions in aiding LLMs. They may directly provide potential answers, generate the chain-of-thought, or simply repeat the inputs. All data and codes are available at https://github.com/OpenMatch/Gist-COCO .

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (57)

Authors (7)

Xinze Li (34 papers)
Zhenghao Liu (77 papers)
Chenyan Xiong (95 papers)
Shi Yu (37 papers)
Yukun Yan (39 papers)
Shuo Wang (382 papers)
Ge Yu (63 papers)

Citations (3)

View on Semantic Scholar

GitHub

GitHub - OpenMatch/Gist-COCO: This is the code repo for our paper "Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression". (12 stars)

Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression (2402.16058v1)

Related Papers

GitHub