Papers

Topics

Authors

Recent

View all

Detailed Answer

Quick Answer

Concise responses based on abstracts only

Detailed Answer

Well-researched responses based on abstracts and relevant paper content.

Custom Instructions Pro

Preferences or requirements that you'd like Emergent Mind to consider when generating responses

Gemini 2.5 Flash

Gemini 2.5 Flash 39 tok/s

Gemini 2.5 Pro 49 tok/s Pro

GPT-5 Medium 12 tok/s Pro

GPT-5 High 18 tok/s Pro

GPT-4o 91 tok/s Pro

Kimi K2 191 tok/s Pro

GPT OSS 120B 456 tok/s Pro

Claude Sonnet 4 37 tok/s Pro

2000 character limit reached

GIFT-SW: Gaussian noise Injected Fine-Tuning of Salient Weights for LLMs (2408.15300v1)

Published 27 Aug 2024 in cs.LG and cs.AI

Abstract: Parameter Efficient Fine-Tuning (PEFT) methods have gained popularity and democratized the usage of LLMs. Recent studies have shown that a small subset of weights significantly impacts performance. Based on this observation, we introduce a novel PEFT method, called Gaussian noise Injected Fine Tuning of Salient Weights (GIFT-SW). Our method updates only salient columns, while injecting Gaussian noise into non-salient ones. To identify these columns, we developeda generalized sensitivity metric that extends and unifies metrics from previous studies. Experiments with LLaMA models demonstrate that GIFT-SW outperforms full fine-tuning and modern PEFT methods under the same computational budget. Moreover, GIFT-SW offers practical advantages to recover performance of models subjected to mixed-precision quantization with keeping salient weights in full precision.

Citations (2)

View on Semantic Scholar