HybridDNN: A Framework for High-Performance Hybrid DNN Accelerator Design and Implementation (2004.03804v1)

Published 8 Apr 2020 in cs.AR and cs.CV

Abstract: To speedup Deep Neural Networks (DNN) accelerator design and enable effective implementation, we propose HybridDNN, a framework for building high-performance hybrid DNN accelerators and delivering FPGA-based hardware implementations. Novel techniques include a highly flexible and scalable architecture with a hybrid Spatial/Winograd convolution (CONV) Processing Engine (PE), a comprehensive design space exploration tool, and a complete design flow to fully support accelerator design and implementation. Experimental results show that the accelerators generated by HybridDNN can deliver 3375.7 and 83.3 GOPS on a high-end FPGA (VU9P) and an embedded FPGA (PYNQ-Z1), respectively, which achieve a 1.8x higher performance improvement compared to the state-of-art accelerator designs. This demonstrates that HybridDNN is flexible and scalable and can target both cloud and embedded hardware platforms with vastly different resource constraints.

Authors (5)

Hanchen Ye (9 papers)
Xiaofan Zhang (79 papers)
Zhize Huang (1 paper)
Gengsheng Chen (1 paper)
Deming Chen (62 papers)

Citations (58)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

HybridDNN: A Framework for High-Performance Hybrid DNN Accelerator Design and Implementation (2004.03804v1)

Summary

Related Papers