A Split-and-Privatize Framework for Large Language Model Fine-Tuning (2312.15603v1)

Published 25 Dec 2023 in cs.CL

Abstract: Fine-tuning is a prominent technique to adapt a pre-trained LLM to downstream scenarios. In parameter-efficient fine-tuning, only a small subset of modules are trained over the downstream datasets, while leaving the rest of the pre-trained model frozen to save computation resources. In recent years, a popular productization form arises as Model-as-a-Service (MaaS), in which vendors provide abundant pre-trained LLMs, server resources and core functions, and customers can fine-tune, deploy and invoke their customized model by accessing the one-stop MaaS with their own private dataset. In this paper, we identify the model and data privacy leakage risks in MaaS fine-tuning, and propose a Split-and-Privatize (SAP) framework, which manage to mitigate the privacy issues by adapting the existing split learning architecture. The proposed SAP framework is sufficiently investigated by experiments, and the results indicate that it can enhance the empirical privacy by 62% at the cost of 1% model performance degradation on the Stanford Sentiment Treebank dataset.

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

References (35)

Authors (9)

Xicong Shen (1 paper)
Yang Liu (2253 papers)
Huiqi Liu (1 paper)
Jue Hong (4 papers)
Bing Duan (13 papers)
Zirui Huang (2 papers)
Yunlong Mao (5 papers)
Ye Wu (39 papers)
Di Wu (477 papers)

Citations (8)

View on Semantic Scholar

A Split-and-Privatize Framework for Large Language Model Fine-Tuning (2312.15603v1)

Related Papers