Prompt-SAW: Leveraging Relation-Aware Graphs for Textual Prompt Compression (2404.00489v2)

Published 30 Mar 2024 in cs.CL, cs.AI, and cs.LG

Abstract: LLMs have shown exceptional abilities for multiple different natural language processing tasks. While prompting is a crucial tool for LLM inference, we observe that there is a significant cost associated with exceedingly lengthy prompts. Existing attempts to compress lengthy prompts lead to substandard results in terms of readability/interpretability of the compressed prompt, with a detrimental impact on prompt utility. To address this, we propose PromptSAW: Prompt compresSion via Relation AWare graphs, an effective strategy for prompt compression over task-agnostic and task-aware prompts. Prompt-SAW uses the prompt's textual information to build a graph and later extracts key information elements in the graph to come up with the compressed prompt. We also propose GSM8K-aug, i.e., an extended version of the existing GSM8K benchmark for task-agnostic prompts in order to provide a comprehensive evaluation platform. Experimental evaluation using benchmark datasets shows that prompts compressed by Prompt-SAW are not only better in terms of readability, but they also outperform the best-performing baseline models by up to 10.1 and 77.1, respectively, for task-agnostic and task-aware settings while compressing the original prompt text by 34.9 and 56.7.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (46)

Authors (11)

Muhammad Asif Ali (18 papers)
Zhengping Li (3 papers)
Shu Yang (178 papers)
Keyuan Cheng (9 papers)
Yang Cao (295 papers)
Tianhao Huang (10 papers)
Lijie Hu (50 papers)
Lu Yu (87 papers)
Di Wang (407 papers)
Guimin Hu (11 papers)
Weimin Lyu (19 papers)

Citations (9)

View on Semantic Scholar

Prompt-SAW: Leveraging Relation-Aware Graphs for Textual Prompt Compression (2404.00489v2)

Related Papers