Boosting Pruned Networks with Linear Over-parameterization (2204.11444v3)

Published 25 Apr 2022 in cs.CV

Abstract: Structured pruning compresses neural networks by reducing channels (filters) for fast inference and low footprint at run-time. To restore accuracy after pruning, fine-tuning is usually applied to pruned networks. However, too few remaining parameters in pruned networks inevitably bring a great challenge to fine-tuning to restore accuracy. To address this challenge, we propose a novel method that first linearly over-parameterizes the compact layers in pruned networks to enlarge the number of fine-tuning parameters and then re-parameterizes them to the original layers after fine-tuning. Specifically, we equivalently expand the convolution/linear layer with several consecutive convolution/linear layers that do not alter the current output feature maps. Furthermore, we utilize similarity-preserving knowledge distillation that encourages the over-parameterized block to learn the immediate data-to-data similarities of the corresponding dense layer to maintain its feature learning ability. The proposed method is comprehensively evaluated on CIFAR-10 and ImageNet which significantly outperforms the vanilla fine-tuning strategy, especially for large pruning ratio.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (21)

Authors (6)

Yu Qian (18 papers)
Jian Cao (81 papers)
Xiaoshuang Li (5 papers)
Jie Zhang (847 papers)
Hufei Li (1 paper)
Jue Chen (5 papers)

Citations (1)

View on Semantic Scholar

Boosting Pruned Networks with Linear Over-parameterization (2204.11444v3)

Related Papers