PILLOW: Enhancing Efficient Instruction Fine-tuning via Prompt Matching (2312.05621v2)

Published 9 Dec 2023 in cs.CL

Abstract: Instruction fine-tuning has conventionally been employed to adapt LLMs to a variety of tasks. Nonetheless, this technique often necessitates substantial computational resources, making it impractical for deployment by individuals or small-scale entities. Recently, Low-Rank Adaptation (LoRA) has become a promising alternative, offering high capabilities on par with full tuning with reduced resource overhead. However, attaining satisfactory performance through the fine-tuning of LoRA is a non-trivial challenge. In this paper, we propose PILLOW, which aims to improve LoRA's performance by a discrimination-based prompting method, leveraging LLMs' In-Context Learning ability. PILLOW incorporates a matching network that selects prompts from a user-defined prompt pool, concatenates the selected prompts with the user instruction as input, and performs inference using the LoRA-fine-tuned LLMs. Trained with Reinforcement Learning, PILLOW exhibits commensurate performance on various evaluation metrics compared with typical instruction fine-tuning methods, utilizing only consumer-grade GPU resources and exhibiting a large reduction in computational costs.

References (35)

Authors (6)

Zhenting Qi (19 papers)
Xiaoyu Tan (21 papers)
Shaojie Shi (1 paper)
Chao Qu (39 papers)
Yinghui Xu (48 papers)
Yuan Qi (85 papers)

Citations (8)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

PILLOW: Enhancing Efficient Instruction Fine-tuning via Prompt Matching (2312.05621v2)

Summary

Related Papers