ReActXGB: A Hybrid Binary Convolutional Neural Network Architecture for Improved Performance and Computational Efficiency (2405.08020v1)
Abstract: Binary convolutional neural networks (BCNNs) provide a potential solution to reduce the memory requirements and computational costs associated with deep neural networks (DNNs). However, achieving a trade-off between performance and computational resources remains a significant challenge. Furthermore, the fully connected layer of BCNNs has evolved into a significant computational bottleneck. This is mainly due to the conventional practice of excluding the input layer and fully connected layer from binarization to prevent a substantial loss in accuracy. In this paper, we propose a hybrid model named ReActXGB, where we replace the fully convolutional layer of ReActNet-A with XGBoost. This modification targets to narrow the performance gap between BCNNs and real-valued networks while maintaining lower computational costs. Experimental results on the FashionMNIST benchmark demonstrate that ReActXGB outperforms ReActNet-A by 1.47% in top-1 accuracy, along with a reduction of 7.14% in floating-point operations (FLOPs) and 1.02% in model size.
- Z. Liu, Z. Shen, M. Savvides, and K. Cheng, “ReActNet: Towards precise binary neural network with generalized activation functions,” in European Conference on Computer Vision (ECCV), 2020, pp. 2980–2988.
- Y. Wang, W. Huang, Y. Dong, F. Sun and A. Yao, “Compacting binary neural networks by sparse kernel selection,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, pp. 24374-24383.
- Y. Zhang, J. Pan, X. Liu, H. Chen, D. Chen, and Z. Zhang, “FracBNN: Accurate and fpga-efficient binary neural networks with fractional activations,” in ACM/SIGDA International Symposium on FieldProgrammable Gate Arrays (FPGA), 2021, pp. 171–182.
- T. Chen and C. Guestrin, “XGBoost: A scalable tree boosting system,” in ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2016, pp. 785-794.
- K. He, X. Zhang, S. Ran, and J. Sun, “Deep residual learning for image recognition,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 770-778.
- M. Rastegari, V. Ordonez, J. Redmon, and A. Farhadi, “XNOR-Net: ImageNet classification using binary convolutional neural networks.” arXiv preprint arXiv:1603.05279, 2016.