Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

PAC-Net: A Model Pruning Approach to Inductive Transfer Learning (2206.05703v2)

Published 12 Jun 2022 in cs.LG, cs.AI, physics.comp-ph, stat.AP, and stat.ML

Abstract: Inductive transfer learning aims to learn from a small amount of training data for the target task by utilizing a pre-trained model from the source task. Most strategies that involve large-scale deep learning models adopt initialization with the pre-trained model and fine-tuning for the target task. However, when using over-parameterized models, we can often prune the model without sacrificing the accuracy of the source task. This motivates us to adopt model pruning for transfer learning with deep learning models. In this paper, we propose PAC-Net, a simple yet effective approach for transfer learning based on pruning. PAC-Net consists of three steps: Prune, Allocate, and Calibrate (PAC). The main idea behind these steps is to identify essential weights for the source task, fine-tune on the source task by updating the essential weights, and then calibrate on the target task by updating the remaining redundant weights. Under the various and extensive set of inductive transfer learning experiments, we show that our method achieves state-of-the-art performance by a large margin.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Sanghoon Myung (3 papers)
  2. In Huh (2 papers)
  3. Wonik Jang (2 papers)
  4. Jae Myung Choe (2 papers)
  5. Jisu Ryu (8 papers)
  6. Dae Sin Kim (5 papers)
  7. Kee-Eung Kim (24 papers)
  8. Changwook Jeong (8 papers)
Citations (12)