Binary Neural Networks: A Survey (2004.03333v1)

Published 31 Mar 2020 in cs.NE, cs.CV, and cs.LG

Abstract: The binary neural network, largely saving the storage and computation, serves as a promising technique for deploying deep models on resource-limited devices. However, the binarization inevitably causes severe information loss, and even worse, its discontinuity brings difficulty to the optimization of the deep network. To address these issues, a variety of algorithms have been proposed, and achieved satisfying progress in recent years. In this paper, we present a comprehensive survey of these algorithms, mainly categorized into the native solutions directly conducting binarization, and the optimized ones using techniques like minimizing the quantization error, improving the network loss function, and reducing the gradient error. We also investigate other practical aspects of binary neural networks such as the hardware-friendly design and the training tricks. Then, we give the evaluation and discussions on different tasks, including image classification, object detection and semantic segmentation. Finally, the challenges that may be faced in future research are prospected.

Authors (6)

Haotong Qin (60 papers)
Ruihao Gong (40 papers)
Xianglong Liu (128 papers)
Xiao Bai (34 papers)
Jingkuan Song (115 papers)
Nicu Sebe (270 papers)

Citations (417)

View on Semantic Scholar

Summary

A Comprehensive Look at Binary Neural Networks

What are Binary Neural Networks?

Binary Neural Networks (BNNs) are a fascinating segment of deep learning research focused on reducing the computational complexity and storage requirements of neural networks. Unlike traditional networks that use 32-bit floating-point numbers for weight and activation values, BNNs restrict these values to just 1-bit binary values (-1 or +1). This translates to significant memory savings and faster computations, making BNNs a promising solution for deploying neural networks on resource-constrained devices like mobile phones and embedded systems.

The Challenges of Binarization

While BNNs offer numerous advantages in terms of efficiency, they bring along significant challenges. The most pressing of these is the information loss that occurs during the binarization process. When weights and activations are constrained to binary values, the discrepancy between the original full-precision values and their binary approximations can degrade the overall performance of the network. Another issue is the non-differentiable nature of the binarization function, making it difficult to optimize BNNs using traditional backpropagation methods.

Types of BNN Algorithms

The paper classifies BNN algorithms into two major categories:

Naive Binary Neural Networks: These methods apply a fixed binarization function, often the sign function, directly to the network weights and activations. Despite their simplicity, they struggle to maintain high accuracy due to significant quantization errors.
Optimization-Based Binary Neural Networks: These methods seek to mitigate the limitations of naive BNNs by employing various optimization techniques. These can include minimizing quantization error, improving the loss function, and reducing gradient error to better approximate full-precision performance.

Naive Binary Neural Networks

One of the earliest efforts, BinaryConnect, binarizes weights during forward propagation but uses full-precision weights during backward propagation to update the model. Binarized Neural Networks (BNNs) extend this by binarizing both weights and activations, achieving memory savings of up to 32x and computational speedups of up to 58x on CPUs.

Optimization-Based BNNs: Methods and Techniques

Minimizing Quantization Error

Optimization-based approaches often include a scaling factor to approximate the full-precision weights with binary ones more accurately. For example, XNOR-Net introduced a scaling factor to minimize the quantization error of weights, achieving better performance compared to naive BNNs.

Improving Network Loss Function

Another promising direction focuses on enhancing the network's loss function. Techniques like Loss-Aware Binarization (LAB) directly add penalty terms to the loss function, considering the constraints imposed by binarization. This helps in aligning the training of BNNs with their full-precision counterparts.

Reducing Gradient Error

Backpropagation in BNNs involves challenges due to the non-differentiable binarization function. Methods like Straight-Through Estimator (STE) approximate the gradient during backpropagation. More advanced techniques, such as the use of soft quantization functions, help reduce the gradient error, bringing the performance of BNNs closer to that of full-precision networks.

Real-World Applications and Hardware Implementations

One of the standout capabilities of BNNs is their suitability for real-world applications and deployment on hardware with limited computational resources. BNNs have been successfully deployed on FPGAs and even consumer-grade hardware like the Raspberry Pi for various tasks, including image classification, object detection, and semantic segmentation.

Performance Analysis

The paper evaluates various BNN methods on standard datasets such as CIFAR-10 and ImageNet. While BNNs can match full-precision networks on smaller datasets like CIFAR-10, there is still a performance gap on larger datasets like ImageNet. Optimized approaches, however, show promise in narrowing this gap, particularly through techniques that reduce quantization and gradient errors.

Future Directions

The field of BNNs is ripe for further exploration. Future research may focus on:

Network Architecture: Designing architectures inherently more suitable for binarization could yield higher performance.
Task-Specific Optimization: Adapting BNNs for tasks other than image classification, such as object detection and natural language processing.
Gradient Optimization: Developing better gradient approximation methods to enhance the training efficiency of BNNs.

Conclusion

Binary Neural Networks have shown tremendous potential in making deep learning models more efficient and accessible for deployment on low-resource hardware. The field is swiftly evolving, with optimization-based methods showing promising results in mitigating the typical performance losses associated with binarization. As research progresses, we can expect to see more robust and versatile BNNs that can rival their full-precision counterparts across various applications.

PDF Markdown