Private Fine-tuning of Large Language Models with Zeroth-order Optimization (2401.04343v2)

Published 9 Jan 2024 in cs.LG, cs.CL, and cs.CR

Abstract: Differentially private stochastic gradient descent (DP-SGD) allows models to be trained in a privacy-preserving manner, but has proven difficult to scale to the era of foundation models. We introduce DP-ZO, a private fine-tuning framework for LLMs by privatizing zeroth order optimization methods. A key insight into the design of our method is that the direction of the gradient in the zeroth-order optimization we use is random and the only information from training data is the step size, i.e., a scalar. Therefore, we only need to privatize the scalar step size, which is memory-efficient. DP-ZO provides a strong privacy-utility trade-off across different tasks, and model sizes that are comparable to DP-SGD in $(\varepsilon,\delta)$-DP. Notably, DP-ZO possesses significant advantages over DP-SGD in memory efficiency, and obtains higher utility in $\varepsilon$-DP when using the Laplace mechanism.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (59)

Authors (5)

Xinyu Tang (20 papers)
Ashwinee Panda (19 papers)
Milad Nasr (48 papers)
Saeed Mahloujifar (43 papers)
Prateek Mittal (129 papers)

Citations (11)

View on Semantic Scholar

Tweets

https://twitter.com/PandaAshwinee/status/1828515009049305220

https://twitter.com/PandaAshwinee/status/1768763350031241729

Private Fine-tuning of Large Language Models with Zeroth-order Optimization (2401.04343v2)

Related Papers

Tweets