Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Differentiable Transportation Pruning (2307.08483v2)

Published 17 Jul 2023 in cs.CV

Abstract: Deep learning algorithms are increasingly employed at the edge. However, edge devices are resource constrained and thus require efficient deployment of deep neural networks. Pruning methods are a key tool for edge deployment as they can improve storage, compute, memory bandwidth, and energy usage. In this paper we propose a novel accurate pruning technique that allows precise control over the output network size. Our method uses an efficient optimal transportation scheme which we make end-to-end differentiable and which automatically tunes the exploration-exploitation behavior of the algorithm to find accurate sparse sub-networks. We show that our method achieves state-of-the-art performance compared to previous pruning methods on 3 different datasets, using 5 different models, across a wide range of pruning ratios, and with two types of sparsity budgets and pruning granularities.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Yunqiang Li (7 papers)
  2. Jan C. van Gemert (49 papers)
  3. Torsten Hoefler (203 papers)
  4. Bert Moons (7 papers)
  5. Evangelos Eleftheriou (23 papers)
  6. Bram-Ernst Verhoef (2 papers)
Citations (4)