Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Improved Techniques for Quantizing Deep Networks with Adaptive Bit-Widths (2103.01435v3)

Published 2 Mar 2021 in cs.CV

Abstract: Quantizing deep networks with adaptive bit-widths is a promising technique for efficient inference across many devices and resource constraints. In contrast to static methods that repeat the quantization process and train different models for different constraints, adaptive quantization enables us to flexibly adjust the bit-widths of a single deep network during inference for instant adaptation in different scenarios. While existing research shows encouraging results on common image classification benchmarks, this paper investigates how to train such adaptive networks more effectively. Specifically, we present two novel techniques for quantizing deep neural networks with adaptive bit-widths of weights and activations. First, we propose a collaborative strategy to choose a high-precision teacher for transferring knowledge to the low-precision student while jointly optimizing the model with all bit-widths. Second, to effectively transfer knowledge, we develop a dynamic block swapping method by randomly replacing the blocks in the lower-precision student network with the corresponding blocks in the higher-precision teacher network. Extensive experiments on multiple image classification datasets including video classification benchmarks for the first time, well demonstrate the efficacy of our approach over state-of-the-art methods.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Ximeng Sun (23 papers)
  2. Rameswar Panda (79 papers)
  3. Chun-Fu Chen (28 papers)
  4. Naigang Wang (15 papers)
  5. Bowen Pan (16 papers)
  6. Kailash Gopalakrishnan (12 papers)
  7. Aude Oliva (42 papers)
  8. Rogerio Feris (105 papers)
  9. Kate Saenko (178 papers)
Citations (3)

Summary

We haven't generated a summary for this paper yet.