FedPFT: Federated Proxy Fine-Tuning of Foundation Models (2404.11536v2)

Published 17 Apr 2024 in cs.LG and cs.AI

Abstract: Adapting Foundation Models (FMs) for downstream tasks through Federated Learning (FL) emerges a promising strategy for protecting data privacy and valuable FMs. Existing methods fine-tune FM by allocating sub-FM to clients in FL, however, leading to suboptimal performance due to insufficient tuning and inevitable error accumulations of gradients. In this paper, we propose Federated Proxy Fine-Tuning (FedPFT), a novel method enhancing FMs adaptation in downstream tasks through FL by two key modules. First, the sub-FM construction module employs a layer-wise compression approach, facilitating comprehensive FM fine-tuning across all layers by emphasizing those crucial neurons. Second, the sub-FM alignment module conducts a two-step distillations-layer-level and neuron-level-before and during FL fine-tuning respectively, to reduce error of gradient by accurately aligning sub-FM with FM under theoretical guarantees. Experimental results on seven commonly used datasets (i.e., four text and three vision) demonstrate the superiority of FedPFT.

References (39)

Authors (8)

Zhaopeng Peng (7 papers)
Xiaoliang Fan (17 papers)
Yufan Chen (34 papers)
Zheng Wang (400 papers)
Shirui Pan (198 papers)
Chenglu Wen (30 papers)
Ruisheng Zhang (5 papers)
Cheng Wang (386 papers)

Citations (4)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/gm8xx8/status/1780773212277682195

https://twitter.com/aili_app/status/1782329771554476460

FedPFT: Federated Proxy Fine-Tuning of Foundation Models (2404.11536v2)

Summary

Related Papers

Tweets