PrunePEFT: Iterative Hybrid Pruning for Parameter-Efficient Fine-tuning of LLMs (2506.07587v1)

Published 9 Jun 2025 in cs.LG and cs.AI

Abstract: Parameter Efficient Fine-Tuning (PEFT) methods have emerged as effective and promising approaches for fine-tuning pre-trained LLMs. Compared with Full parameter Fine-Tuning (FFT), PEFT achieved comparable task performance with a substantial reduction of trainable parameters, which largely saved the training and storage costs. However, using the PEFT method requires considering a vast design space, such as the type of PEFT modules and their insertion layers. Inadequate configurations can lead to sub-optimal results. Conventional solutions such as architectural search techniques, while effective, tend to introduce substantial additional overhead. In this paper, we propose a novel approach, PrunePEFT, which formulates the PEFT strategy search as a pruning problem and introduces a hybrid pruning strategy that capitalizes on the sensitivity of pruning methods to different PEFT modules. This method extends traditional pruning techniques by iteratively removing redundant or conflicting PEFT modules, thereby optimizing the fine-tuned configuration. By efficiently identifying the most relevant modules, our approach significantly reduces the computational burden typically associated with architectural search processes, making it a more scalable and efficient solution for fine-tuning large pre-trained models.

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (6)

Tongzhou Yu (1 paper)
Zhuhao Zhang (3 papers)
Guanghui Zhu (11 papers)
Shen Jiang (3 papers)
Meikang Qiu (23 papers)
Yihua Huang (17 papers)

PrunePEFT: Iterative Hybrid Pruning for Parameter-Efficient Fine-tuning of LLMs (2506.07587v1)

Related Papers