A 64-core mixed-signal in-memory compute chip based on phase-change memory for deep neural network inference (2212.02872v1)
Abstract: The need to repeatedly shuttle around synaptic weight values from memory to processing units has been a key source of energy inefficiency associated with hardware implementation of artificial neural networks. Analog in-memory computing (AIMC) with spatially instantiated synaptic weights holds high promise to overcome this challenge, by performing matrix-vector multiplications (MVMs) directly within the network weights stored on a chip to execute an inference workload. However, to achieve end-to-end improvements in latency and energy consumption, AIMC must be combined with on-chip digital operations and communication to move towards configurations in which a full inference workload is realized entirely on-chip. Moreover, it is highly desirable to achieve high MVM and inference accuracy without application-wise re-tuning of the chip. Here, we present a multi-core AIMC chip designed and fabricated in 14-nm complementary metal-oxide-semiconductor (CMOS) technology with backend-integrated phase-change memory (PCM). The fully-integrated chip features 64 256x256 AIMC cores interconnected via an on-chip communication network. It also implements the digital activation functions and processing involved in ResNet convolutional neural networks and long short-term memory (LSTM) networks. We demonstrate near software-equivalent inference accuracy with ResNet and LSTM networks while implementing all the computations associated with the weight layers and the activation functions on-chip. The chip can achieve a maximal throughput of 63.1 TOPS at an energy efficiency of 9.76 TOPS/W for 8-bit input/output matrix-vector multiplications.
- Manuel Le Gallo (33 papers)
- Riduan Khaddam-Aljameh (2 papers)
- Milos Stanisavljevic (1 paper)
- Athanasios Vasilopoulos (7 papers)
- Benedikt Kersting (7 papers)
- Martino Dazzi (9 papers)
- Geethan Karunaratne (25 papers)
- Matthias Braendli (3 papers)
- Abhairaj Singh (1 paper)
- Silvia M. Mueller (1 paper)
- Julian Buechel (2 papers)
- Xavier Timoneda (8 papers)
- Vinay Joshi (8 papers)
- Urs Egger (3 papers)
- Angelo Garofalo (33 papers)
- Anastasios Petropoulos (5 papers)
- Theodore Antonakopoulos (3 papers)
- Kevin Brew (4 papers)
- Samuel Choi (2 papers)
- Timothy Philip (2 papers)