Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Zeroth-Order Fine-Tuning of LLMs with Extreme Sparsity (2406.02913v1)

Published 5 Jun 2024 in cs.LG and cs.AI

Abstract: Zeroth-order optimization (ZO) is a memory-efficient strategy for fine-tuning LLMs using only forward passes. However, the application of ZO fine-tuning in memory-constrained settings such as mobile phones and laptops is still challenging since full precision forward passes are infeasible. In this study, we address this limitation by integrating sparsity and quantization into ZO fine-tuning of LLMs. Specifically, we investigate the feasibility of fine-tuning an extremely small subset of LLM parameters using ZO. This approach allows the majority of un-tuned parameters to be quantized to accommodate the constraint of limited device memory. Our findings reveal that the pre-training process can identify a set of "sensitive parameters" that can guide the ZO fine-tuning of LLMs on downstream tasks. Our results demonstrate that fine-tuning 0.1% sensitive parameters in the LLM with ZO can outperform the full ZO fine-tuning performance, while offering wall-clock time speedup. Additionally, we show that ZO fine-tuning targeting these 0.1% sensitive parameters, combined with 4 bit quantization, enables efficient ZO fine-tuning of an Llama2-7B model on a GPU device with less than 8 GiB of memory and notably reduced latency.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (12)
  1. Wentao Guo (17 papers)
  2. Jikai Long (3 papers)
  3. Yimeng Zeng (9 papers)
  4. Zirui Liu (58 papers)
  5. Xinyu Yang (109 papers)
  6. Yide Ran (5 papers)
  7. Osbert Bastani (97 papers)
  8. Christopher De Sa (77 papers)
  9. Xiaodong Yu (44 papers)
  10. Beidi Chen (61 papers)
  11. Zhaozhuo Xu (43 papers)
  12. Jacob R. Gardner (39 papers)
Citations (7)
X Twitter Logo Streamline Icon: https://streamlinehq.com