A Low-Cost Neural ODE with Depthwise Separable Convolution for Edge Domain Adaptation on FPGAs (2107.12824v4)
Abstract: High-performance deep neural network (DNN)-based systems are in high demand in edge environments. Due to its high computational complexity, it is challenging to deploy DNNs on edge devices with strict limitations on computational resources. In this paper, we derive a compact while highly-accurate DNN model, termed dsODENet, by combining recently-proposed parameter reduction techniques: Neural ODE (Ordinary Differential Equation) and DSC (Depthwise Separable Convolution). Neural ODE exploits a similarity between ResNet and ODE, and shares most of weight parameters among multiple layers, which greatly reduces the memory consumption. We apply dsODENet to a domain adaptation as a practical use case with image classification datasets. We also propose a resource-efficient FPGA-based design for dsODENet, where all the parameters and feature maps except for pre- and post-processing layers can be mapped onto on-chip memories. It is implemented on Xilinx ZCU104 board and evaluated in terms of domain adaptation accuracy, inference speed, FPGA resource utilization, and speedup rate compared to a software counterpart. The results demonstrate that dsODENet achieves comparable or slightly better domain adaptation accuracy compared to our baseline Neural ODE implementation, while the total parameter size without pre- and post-processing layers is reduced by 54.2% to 79.8%. Our FPGA implementation accelerates the inference speed by 23.8 times.
- Deep Residual Learning for Image Recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 770–778, 2016.
- François Chollet. Xception: Deep Learning with Depthwise Separable Convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 1800–1807, Jul 2017.
- MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. arXiv:1704.04861, 2017.
- Searching for MobileNetV3. In Proceedings of the International Conference on Computer Vision (ICCV’19), pages 1314–1324, Oct 2019.
- Neural Ordinary Differential Equations. In Proceedings of the Annual Conference on Neural Information Processing Systems (NeuroIPS’18), pages 6572–6583, Dec 2018.
- Accelerating ODE-Based Neural Networks on Low-Cost FPGAs. In Proceedings of the IEEE International Parallel and Distributed Processing Symposium (IPDPS’21) Workshops, pages 88–95, Mar 2021.
- A Survey of FPGA-Based Neural Network Accelerator. arXiv:1712.08934v3, Dec 2018.
- Weight sparseness for a feature-map-split-cnn toward low-cost embedded fpgas. IEICE Transactions on Information and Systems, 104-D(12):2040–2047, 2021.
- AddNet: Deep Neural Networks Using FPGA-Optimized Multipliers. IEEE Transactions on Very Large Scale Integration (VLSI) Systems, 28(1):115–128, Jan 2020.
- A CNN Accelerator on FPGA Using Depthwise Separable Convolution. IEEE Transactions on Circuits and Systems II: Express Briefs, 65(10):1415–1419, Oct 2018.
- FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations. In Proceedings of the International Symposium on Field Programmable Gate Arrays, (FPGA21), pages 171–182, 2021.
- Edge intelligence: Paving the last mile of artificial intelligence with edge computing. Proceedings of the IEEE, 107(8):1738–1762, 2019.
- dsODENet: Neural ODE and Depthwise Separable Convolution for Domain Adaptation on FPGAs. In Proceedings of the Euromicro International Conference on Parallel, Distributed and Network-based Processing (PDP’22), pages 152–156, Mar 2022.
- MobileDA: Toward Edge-Domain Adaptation. IEEE Internet of Things Journal, 7(8):6909–6918, Aug 2020.
- Deep CORAL: Correlation Alignment for Deep Domain Adaptation. arXiv:1607.01719, 2016.
- Adapting Visual Category Models to New Domains. Proceedings of the European Conference in Computer Vision (ECCV’10), 6314:213–226, Sep 2010.
- Unsupervised Domain Adaptation by Backpropagation. In Proceedings of the International Conference on Machine Learning (ICML’15), pages 1180–1189, Jul 2015.
- The German Traffic Sign Recognition Benchmark: A Multi-class Classification Competition. In Proceedings of the International Joint Conference on Neural Networks (IJCNN’11), pages 1453–1460, Aug 2011.
- Reading Digits in Natural Images with Unsupervised Feature Learning. In Proceedings of the NIPS Workshop on Deep Learning and Unsupervised Feature Learning, Dec 2011.
- Gradient-based Learning Applied to Document Recognition. Proceedings of the IEEE, 86(11):2278–2324, 1998.
- Conditional Adversarial Domain Adaptation. In Proceedings of the Annual Conference on Neural Information Processing Systems (NeuroIPS’18), pages 1640–1650, Dec 2018.
- CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation. In Proceedings of the Tenth International Conference on Learning Representations, (ICLR’22), Apr 2022.
- Domain Separation Networks. In Proceedings of the Annual Conference on Neural Information Processing Systems (NeuroIPS’16), pages 343–351, Dec 2016.
- Discriminative feature alignment: Improving transferability of unsupervised domain adaptation by gaussian-guided latent alignment. Pattern Recognition, 116:107943, 2021.
- Adversarial Discriminative Domain Adaptation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17), pages 2962–2971, Jul 2017.
- Self-ensembling for visual domain adaptation. In Proceedings of the International Conference on Learning Representations (ICLR’18), 2018.
- Hiroki Kawakami (3 papers)
- Hirohisa Watanabe (3 papers)
- Keisuke Sugiura (19 papers)
- Hiroki Matsutani (24 papers)