PolyLUT-Add: FPGA-based LUT Inference with Wide Inputs (2406.04910v2)

Published 7 Jun 2024 in cs.LG, cs.AI, and cs.AR

Abstract: FPGAs have distinct advantages as a technology for deploying deep neural networks (DNNs) at the edge. Lookup Table (LUT) based networks, where neurons are directly modeled using LUTs, help maximize this promise of offering ultra-low latency and high area efficiency on FPGAs. Unfortunately, LUT resource usage scales exponentially with the number of inputs to the LUT, restricting PolyLUT to small LUT sizes. This work introduces PolyLUT-Add, a technique that enhances neuron connectivity by combining $A$ PolyLUT sub-neurons via addition to improve accuracy. Moreover, we describe a novel architecture to improve its scalability. We evaluated our implementation over the MNIST, Jet Substructure classification, and Network Intrusion Detection benchmark and found that for similar accuracy, PolyLUT-Add achieves a LUT reduction of $2.0-13.9\times$ with a $1.2-1.6\times$ decrease in latency.

Authors (4)

Binglei Lou (3 papers)
Richard Rademacher (3 papers)
David Boland (6 papers)
Philip H. W. Leong (12 papers)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/spatialmlnet/status/1801623034312167680

PolyLUT-Add: FPGA-based LUT Inference with Wide Inputs (2406.04910v2)

Summary

Related Papers

Tweets