Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Binary Neural Networks as a general-propose compute paradigm for on-device computer vision (2202.03716v1)

Published 8 Feb 2022 in cs.CV

Abstract: For binary neural networks (BNNs) to become the mainstream on-device computer vision algorithm, they must achieve a superior speed-vs-accuracy tradeoff than 8-bit quantization and establish a similar degree of general applicability in vision tasks. To this end, we propose a BNN framework comprising 1) a minimalistic inference scheme for hardware-friendliness, 2) an over-parameterized training scheme for high accuracy, and 3) a simple procedure to adapt to different vision tasks. The resultant framework overtakes 8-bit quantization in the speed-vs-accuracy tradeoff for classification, detection, segmentation, super-resolution and matching: our BNNs not only retain the accuracy levels of their 8-bit baselines but also showcase 1.3-2.4$\times$ faster FPS on mobile CPUs. Similar conclusions can be drawn for prototypical systolic-array-based AI accelerators, where our BNNs promise 2.8-7$\times$ fewer execution cycles than 8-bit and 2.1-2.7$\times$ fewer cycles than alternative BNN designs. These results suggest that the time for large-scale BNN adoption could be upon us.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Guhong Nie (2 papers)
  2. Lirui Xiao (2 papers)
  3. Menglong Zhu (18 papers)
  4. Dongliang Chu (1 paper)
  5. Yue Shen (243 papers)
  6. Peng Li (390 papers)
  7. Kang Yang (69 papers)
  8. Li Du (72 papers)
  9. Bo Chen (309 papers)
Citations (5)