On the Reliability of Computing-in-Memory Accelerators for Deep Neural Networks (2205.13018v1)

Published 25 May 2022 in cs.AR

Abstract: Computing-in-memory with emerging non-volatile memory (nvCiM) is shown to be a promising candidate for accelerating deep neural networks (DNNs) with high energy efficiency. However, most non-volatile memory (NVM) devices suffer from reliability issues, resulting in a difference between actual data involved in the nvCiM computation and the weight value trained in the data center. Thus, models actually deployed on nvCiM platforms achieve lower accuracy than their counterparts trained on the conventional hardware (e.g., GPUs). In this chapter, we first offer a brief introduction to the opportunities and challenges of nvCiM DNN accelerators and then show the properties of different types of NVM devices. We then introduce the general architecture of nvCiM DNN accelerators. After that, we discuss the source of unreliability and how to efficiently model their impact. Finally, we introduce representative works that mitigate the impact of device variations.

Authors (3)

Zheyu Yan (23 papers)
Xiaobo Sharon Hu (34 papers)
Yiyu Shi (136 papers)

Citations (5)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

On the Reliability of Computing-in-Memory Accelerators for Deep Neural Networks (2205.13018v1)

Summary

Related Papers