An In-Memory Analog Computing Co-Processor for Energy-Efficient CNN Inference on Mobile Devices (2105.13904v1)

Published 24 May 2021 in cs.AR and cs.LG

Abstract: In this paper, we develop an in-memory analog computing (IMAC) architecture realizing both synaptic behavior and activation functions within non-volatile memory arrays. Spin-orbit torque magnetoresistive random-access memory (SOT-MRAM) devices are leveraged to realize sigmoidal neurons as well as binarized synapses. First, it is shown the proposed IMAC architecture can be utilized to realize a multilayer perceptron (MLP) classifier achieving orders of magnitude performance improvement compared to previous mixed-signal and digital implementations. Next, a heterogeneous mixed-signal and mixed-precision CPU-IMAC architecture is proposed for convolutional neural networks (CNNs) inference on mobile processors, in which IMAC is designed as a co-processor to realize fully-connected (FC) layers whereas convolution layers are executed in CPU. Architecture-level analytical models are developed to evaluate the performance and energy consumption of the CPU-IMAC architecture. Simulation results exhibit 6.5% and 10% energy savings for CPU-IMAC based realizations of LeNet and VGG CNN models, for MNIST and CIFAR-10 pattern recognition tasks, respectively.

Authors (5)

Mohammed Elbtity (6 papers)
Abhishek Singh (71 papers)
Brendan Reidy (6 papers)
Xiaochen Guo (5 papers)
Ramtin Zand (38 papers)

Citations (15)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

An In-Memory Analog Computing Co-Processor for Energy-Efficient CNN Inference on Mobile Devices (2105.13904v1)

Summary

Related Papers