Mixed-Precision In-Memory Computing (1701.04279v5)

Published 16 Jan 2017 in cs.ET

Abstract: As CMOS scaling reaches its technological limits, a radical departure from traditional von Neumann systems, which involve separate processing and memory units, is needed in order to significantly extend the performance of today's computers. In-memory computing is a promising approach in which nanoscale resistive memory devices, organized in a computational memory unit, are used for both processing and memory. However, to reach the numerical accuracy typically required for data analytics and scientific computing, limitations arising from device variability and non-ideal device characteristics need to be addressed. Here we introduce the concept of mixed-precision in-memory computing, which combines a von Neumann machine with a computational memory unit. In this hybrid system, the computational memory unit performs the bulk of a computational task, while the von Neumann machine implements a backward method to iteratively improve the accuracy of the solution. The system therefore benefits from both the high precision of digital computing and the energy/areal efficiency of in-memory computing. We experimentally demonstrate the efficacy of the approach by accurately solving systems of linear equations, in particular, a system of 5,000 equations using 998,752 phase-change memory devices.

Citations (332)

View on Semantic Scholar

Summary

The paper introduces mixed-precision in-memory computing by integrating digital precision with nanoscale resistive memory to overcome energy and latency challenges.
It experimentally validates the approach by solving 5,000 linear equations with nearly one million phase-change memory devices, achieving high accuracy.
The study demonstrates significant energy and performance improvements, paving the way for scalable, data-intensive computing applications.

Insights into Mixed-Precision In-Memory Computing

The paper "Mixed-Precision In-Memory Computing," authored by Manuel Le Gallo et al., proposes a hybrid computing approach that integrates von Neumann machines with nanoscale resistive memory devices to enhance computational efficiency and accuracy. As the paper articulates, this method seeks to address the inefficiencies of traditional von Neumann architectures, particularly the latency and energy costs associated with data transfer between processing and memory units. This innovative work combines the high precision of digital computing with the efficiency of in-memory computing, aiming to offer practical solutions in fields requiring the processing of large-scale data.

Theoretical Innovation: Integration of Two Computational Paradigms

The core contribution lies in the concept of mixed-precision in-memory computing, which melds digital precision with the areal and energy efficiency of in-memory systems. In this design, a von Neumann machine iteratively refines computational tasks performed by a computational memory unit, thus achieving high precision without the typical energy consumption of entirely digital implementations. This synergy leverages the von Neumann machine's precision to counterbalance the intrinsic variabilities and inaccuracies of nanoscale resistive memory devices.

Experimental Validation: Solving Linear Systems

A significant portion of the empirical work conducted involves solving linear equations—a common computational constraint in data-centric applications such as cognitive computing and data analytics. The authors employed phase-change memory (PCM) devices to experimentally realize scalar and matrix-vector multiplications. They successfully solved a system of 5,000 linear equations using nearly one million phase-change memory devices, demonstrating the mixed-precision system's capacity to handle substantial computational tasks while maintaining accuracy comparable to high-precision processing alone.

System Implementation and Evaluation

In terms of system performance, the paper underscores the mixed-precision approach’s potential to surpass traditional high-precision-only computing in both efficiency and energy consumption. By conducting operations predominantly within the computational memory unit, the proposed system minimizes data movement, a primary contributor to energy inefficiency in conventional computing architectures. This is corroborated by experimental results showing substantial energy gains when compared to CPU and GPU executions.

Implications and Future Directions

The implementation of mixed-precision in-memory computing signifies a key advancement towards sustainable and efficient computing architectures, especially in applications involving large datasets. The paper suggests future research could expand this framework to other computationally demanding domains such as automatic control, optimization, and machine learning. Moreover, further enhancements in the precision of memristive devices or the deployment of error-correction mechanisms could broaden the scope and scalability of this approach to embrace challenges associated with ill-conditioned problems and larger datasets.

In summary, the paper by Le Gallo et al. forms a substantial contribution to the field of computer architecture, presenting a feasible path towards integrating the strengths of both traditional and emerging computing paradigms. Through methodical experimentation and robust theoretical grounding, this research lays the foundation for further advancements in the practical deployment of energy-efficient, large-scale computational systems.

PDF Markdown