Low-power scalable multilayer optoelectronic neural networks enabled with incoherent light (2402.01988v1)
Abstract: Optical approaches have made great strides towards the goal of high-speed, energy-efficient computing necessary for modern deep learning and AI applications. Read-in and read-out of data, however, limit the overall performance of existing approaches. This study introduces a multilayer optoelectronic computing framework that alternates between optical and optoelectronic layers to implement matrix-vector multiplications and rectified linear functions, respectively. Our framework is designed for real-time, parallelized operations, leveraging 2D arrays of LEDs and photodetectors connected via independent analog electronics. We experimentally demonstrate this approach using a system with a three-layer network with two hidden layers and operate it to recognize images from the MNIST database with a recognition accuracy of 92% and classify classes from a nonlinear spiral data with 86% accuracy. By implementing multiple layers of a deep neural network simultaneously, our approach significantly reduces the number of read-ins and read-outs required and paves the way for scalable optical accelerators requiring ultra low energy.
- Deep learning. Nature 521, 436–444 (2015).
- AI and compute, 2018. https://openai.com/blog/ai-and-compute 4 (2018).
- Sevilla, J. et al. Compute trends across three eras of machine learning. arXiv preprint arXiv:2202.05924 (2022).
- Compute and energy consumption trends in deep learning inference. arXiv preprint arXiv:2109.05472 (2021).
- Horowitz, M. 1.1 computing’s energy problem (and what we can do about it). In 2014 IEEE International Solid-State Circuits Conference Digest of Technical Papers (ISSCC), 10–14 (IEEE, 2014).
- Christensen, D. V. et al. 2022 roadmap on neuromorphic computing and engineering. Neuromorphic Computing and Engineering (2022).
- Why future supercomputing requires optics. Nature Photonics 4, 261–263 (2010).
- Fully parallel, high-speed incoherent optical method for performing discrete fourier transforms. Optics Letters 2, 1–3 (1978).
- Optical implementation of the hopfield model. Applied Optics 24, 1469–1475 (1985).
- Optical neurochip based on a three-layered feed-forward model. Optics Letters 15, 1362–1364 (1990).
- Adaptive optical networks using photorefractive crystals. Applied Optics 27, 1752–1759 (1988).
- Denz, C. Optical neural networks (Springer Science & Business Media, 2013).
- Miller, D. A. Attojoule optoelectronics for low-energy information processing and communications. Journal of Lightwave Technology 35, 346–396 (2017).
- Wetzstein, G. et al. Inference in artificial intelligence with deep optics and photonics. Nature 588, 39–47 (2020).
- Shastri, B. J. et al. Photonics for artificial intelligence and neuromorphic computing. Nature Photonics 15, 102–114 (2021).
- Large-scale optical neural networks based on photoelectric multiplication. Physical Review X 9, 021032 (2019).
- Bogaerts, W. et al. Programmable photonic circuits. Nature 586, 207–216 (2020).
- Tait, A. N. et al. Neuromorphic photonic networks using silicon photonic weight banks. Scientific Reports 7, 1–10 (2017).
- Tait, A. N. et al. Silicon photonic modulator neuron. Physical Review Applied 11, 064043 (2019).
- Nahmias, M. A. et al. Photonic multiply-accumulate operations for neural networks. IEEE Journal of Selected Topics in Quantum Electronics 26, 1–18 (2019).
- Feldmann, J. et al. Parallel convolutional processing using an integrated photonic tensor core. Nature 589, 52–58 (2021).
- Xu, X. et al. 11 tops photonic convolutional accelerator for optical neural networks. Nature 589, 44–51 (2021).
- An on-chip photonic deep neural network for image classification. Nature 1–6 (2022).
- Lin, X. et al. All-optical machine learning using diffractive deep neural networks. Science 361, 1004–1008 (2018).
- Miscuglio, M. et al. Massively parallel amplitude-only fourier neural network. Optica 7, 1812–1819 (2020).
- Fully reconfigurable coherent optical vector–matrix multiplication. Optics Letters 45, 5752–5755 (2020).
- Bernstein, L. et al. Freely scalable and reconfigurable optical hardware for deep learning. Scientific reports 11, 1–12 (2021).
- Zhou, T. et al. Large-scale neuromorphic optoelectronic computing with a reconfigurable diffractive processing unit. Nature Photonics 15, 367–373 (2021).
- Bernstein, L. et al. Single-shot optical neural network. arXiv preprint arXiv:2205.09103 (2022).
- Wang, T. et al. An optical neural network using less than 1 photon per multiplication. Nature Communications 13, 1–8 (2022).
- Zhang, H. et al. An optical neural chip for implementing complex-valued neural network. Nature Communications 12, 1–11 (2021).
- Zuo, Y. et al. All-optical neural network with nonlinear activation functions. Optica 6, 1132–1137 (2019).
- Li, G. H. et al. All-optical ultrafast relu function for energy-efficient nanophotonic deep learning. arXiv preprint arXiv:2201.03787 (2022).
- Hybrid optical-electronic convolutional neural networks with optimized diffractive optics for image classification. Scientific reports 8, 1–10 (2018).
- Wang, T. et al. Image sensing with multilayer nonlinear optical neural networks. Nature Photonics 17, 408–415 (2023).
- Williamson, I. A. et al. Reprogrammable electro-optic nonlinear activation functions for optical neural networks. IEEE Journal of Selected Topics in Quantum Electronics 26, 1–12 (2019).
- Photonic extreme learning machine by free-space optical propagation. Photonics Research 9, 1446–1454 (2021).
- Shi, W. et al. Loen: Lensless opto-electronic neural network empowered machine vision. Light: Science & Applications 11, 1–12 (2022).
- Miller, D. A. All linear optical devices are mode converters. Optics Express 20, 23985–23993 (2012).
- Optical complex media as universal reconfigurable linear operators. Optica 6, 465–472 (2019).
- All-optical synthesis of an arbitrary linear transformation using diffractive surfaces. Light: Science & Applications 10, 1–21 (2021).
- Arbitrary linear transformations for photons in the frequency synthetic dimension. Nature Communications 12, 1–9 (2021).
- Patterson, D. Good news about the carbon footprint of machine learning training. https://blog.research.google/2022/02/good-news-about-carbon-footprint-of.html (2022). URL https://blog.research.google/2022/02/good-news-about-carbon-footprint-of.html.
- Viswanathula, R. Global AI and data science. https://community.ibm.com/community/user/ai-datascience/blogs/rachana-vishwanathula/2023/05/04/estimating-chatgpts-carbon-footprint (2023).
- Mini-led, micro-led and oled displays: Present status and future perspectives. Light: Science & Applications 9, 1–16 (2020).
- Deng, L. The mnist database of handwritten digit images for machine learning research [best of the web]. IEEE Signal Processing Magazine 29, 141–142 (2012).