Propulsion: Steering LLM with Tiny Fine-Tuning (2409.10927v3)

Published 17 Sep 2024 in cs.CL

Abstract: The rapid advancements in LLMs have revolutionized NLP and related fields. However, fine-tuning these models for specific tasks remains computationally expensive and risks degrading pre-learned features. To address these challenges, we propose Propulsion, a novel parameter efficient fine-tuning (PEFT) method designed to optimize task-specific performance while drastically reducing computational overhead. Inspired by the concept of controlled adjustments in physical motion, Propulsion selectively re-scales specific dimensions of a pre-trained model, guiding output predictions toward task objectives without modifying the model's parameters. By introducing lightweight, trainable Propulsion parameters at the pre-trained layer, we minimize the number of parameters updated during fine-tuning, preventing overfitting or overwriting of existing knowledge. Our theoretical analysis, supported by Neural Tangent Kernel (NTK) theory, shows that Propulsion approximates the performance of full fine-tuning with far fewer trainable parameters. Empirically, Propulsion reduces the parameter count from 355.3 million to just 0.086 million, achieving over a 10x reduction compared to standard approaches like LoRA while maintaining competitive performance across benchmarks.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

Authors (3)

Md Kowsher (17 papers)
Nusrat Jahan Prottasha (12 papers)
Prakash Bhat (6 papers)

Citations (2)

View on Semantic Scholar

Propulsion: Steering LLM with Tiny Fine-Tuning (2409.10927v3)

Related Papers