FewFedPIT: Towards Privacy-preserving and Few-shot Federated Instruction Tuning (2403.06131v2)

Published 10 Mar 2024 in cs.CR and cs.AI

Abstract: Instruction tuning has been identified as a crucial technique for optimizing the performance of LLMs in generating human-aligned responses. Nonetheless, gathering diversified and superior-quality instruction data for such tuning presents notable obstacles, especially in domains with rigid privacy provisions. Federated instruction tuning (FedIT) has emerged as a promising solution, by consolidating collaborative training across multiple data owners, thereby resulting in a privacy-preserving learning model. However, FedIT encounters limitations such as scarcity of instructional data and risk of exposure to training data extraction attacks. In this paper, we propose a novel federated algorithm, FewFedPIT, designed to simultaneously enhance privacy protection and model performance of federated few-shot learning. FewFedPITcomprises three vital components on the client side: (1) synthetic data generation, which utilizes LLMs' in-context learning capacity to generate synthetic data autonomously, thus expanding the local database; (2) parameter isolation training, which individually updates the public parameters in the synthetic data and the private parameters in the local data, consequently mitigating the noise impact of the synthetic data; (3) local aggregation sharing, which mixes public and private parameters before uploading, effectively preventing data extraction attacks. Extensive experiments on three open-source datasets demonstrate the effectiveness of FewFedPITin, enhancing privacy preservation and improving federated few-shot performance.

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

References (41)

Authors (8)

Zhuo Zhang (42 papers)
Jingyuan Zhang (50 papers)
Jintao Huang (12 papers)
Lizhen Qu (68 papers)
Hongzhi Zhang (33 papers)
Zenglin Xu (145 papers)
Qifan Wang (129 papers)
Xun Zhou (62 papers)

Citations (3)

View on Semantic Scholar

FewFedPIT: Towards Privacy-preserving and Few-shot Federated Instruction Tuning (2403.06131v2)

Related Papers