Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 133 tok/s
Gemini 2.5 Pro 54 tok/s Pro
GPT-5 Medium 30 tok/s Pro
GPT-5 High 34 tok/s Pro
GPT-4o 61 tok/s Pro
Kimi K2 194 tok/s Pro
GPT OSS 120B 430 tok/s Pro
Claude Sonnet 4.5 39 tok/s Pro
2000 character limit reached

Domain wall and Magnetic Tunnel Junction Hybrid for on-chip Learning in UNet architecture (2403.02863v2)

Published 5 Mar 2024 in cs.ET, eess.IV, and physics.app-ph

Abstract: We present spintronic devices based hardware implementation of UNet for segmentation tasks. Our approach involves designing hardware for convolution, deconvolution, rectified activation function (ReLU), and max pooling layers of the UNet architecture. We designed the convolution and deconvolution layers of the network using the synaptic behavior of the domain wall MTJ. We also construct the ReLU and max pooling functions of the network utilizing the spin hall driven orthogonal current injected MTJ. To incorporate the diverse physics of spin-transport, magnetization dynamics, and CMOS elements in our UNet design, we employ a hybrid simulation setup that couples micromagnetic simulation, non-equilibrium Green's function, SPICE simulation along with network implementation. We evaluate our UNet design on the CamVid dataset and achieve segmentation accuracies of 83.71$\%$ on test data, on par with the software implementation with 821mJ of energy consumption for on-chip training over 150 epochs. We further demonstrate nearly one order $(10\times)$ improvement in the energy requirement of the network using unstable ferromagnet ($\Delta$=4.58) over the stable ferromagnet ($\Delta$=45) based ReLU and max pooling functions while maintaining the similar accuracy. The hybrid architecture comprising domain wall MTJ and unstable FM-based MTJ leads to an on-chip energy consumption of 85.79mJ during training, with a testing energy cost of 1.55 $\mu J$.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (45)
  1. Semantic image segmentation and object labeling. IEEE transactions on circuits and systems for video technology, 17(3):298–312, 2007.
  2. Recent progress in semantic image segmentation. Artificial Intelligence Review, 52:1089–1106, 2019.
  3. Martin Thoma. A survey of semantic segmentation. arXiv preprint arXiv:1602.06541, 2016.
  4. Deep semantic segmentation of natural and medical images: a review. Artificial Intelligence Review, 54:137–178, 2021.
  5. Deep learning for image and point cloud fusion in autonomous driving: A review. IEEE Transactions on Intelligent Transportation Systems, 23(2):722–739, 2021.
  6. An assessment of support vector machines for land cover classification. International Journal of remote sensing, 23(4):725–749, 2002.
  7. Neuromorphic spintronics. Nature electronics, 3(7):360–370, 2020.
  8. High speed vlsi architecture for improved region based active contour segmentation technique. Integration, 77:25–37, 2021.
  9. Optimizing cnn-based segmentation with deeply customized convolutional and deconvolutional architectures on fpga. ACM Transactions on Reconfigurable Technology and Systems (TRETS), 11(3):1–22, 2018.
  10. 4gbit density stt-mram using perpendicular mtj realized with compact cell structure. In 2016 IEEE International Electron Devices Meeting (IEDM), pages 27–1. IEEE, 2016.
  11. Orthogonal spin current injected magnetic tunnel junction for convolutional neural networks. IEEE Transactions on Electron Devices, 70(7):3943–3950, 2023.
  12. Resonant spin-transfer-torque nano-oscillators. Physical Review Applied, 8(6):064014, 2017.
  13. Review on spintronics: Principles and device applications. Journal of Magnetism and Magnetic Materials, 509:166711, 2020.
  14. Implementing p-bits with embedded mtj. IEEE Electron Device Letters, 38(12):1767–1770, 2017.
  15. Enhancing image segmentation performance with mram based processing-in-memory architecture. In 2023 IEEE Nanotechnology Materials and Devices Conference (NMDC), pages 836–841. IEEE, 2023.
  16. U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18, pages 234–241. Springer, 2015.
  17. Memristive crossbar arrays for brain-inspired computing. Nature materials, 18(4):309–323, 2019.
  18. A crossbar array of magnetoresistive memory devices for in-memory computing. Nature, 601(7892):211–216, 2022.
  19. On-chip learning of a domain-wall-synapse-crossbar-array-based convolutional neural network. Neuromorphic Computing and Engineering, 2(2):024006, 2022.
  20. Red: A reram-based efficient accelerator for deconvolutional computation. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 39(12):4736–4747, 2020.
  21. Regan: A pipelined reram-based accelerator for generative adversarial networks. In 2018 23rd Asia and South Pacific Design Automation Conference (ASP-DAC), pages 178–183. IEEE, 2018.
  22. Deep learning. MIT press, 2016.
  23. She-mtj based relu-max pooling functions for on-chip training of neural networks. AIP Advances, 14(2):025130, 2024.
  24. Deep sparse rectifier neural networks. In Proceedings of the fourteenth international conference on artificial intelligence and statistics, pages 315–323. JMLR Workshop and Conference Proceedings, 2011.
  25. Improvement of learning for cnn with relu activation by sparse regularization. In 2017 international joint conference on neural networks (IJCNN), pages 2684–2691. IEEE, 2017.
  26. Rectified linear units improve restricted boltzmann machines. In Proceedings of the 27th international conference on machine learning (ICML-10), pages 807–814, 2010.
  27. The design and verification of mumax3. AIP advances, 4(10), 2014.
  28. Effects of spatially engineered dzyaloshinskii-moriya interaction in ferromagnetic films. Physical Review B, 95(14):144401, 2017.
  29. Proposal for an all-spin artificial neural network: Emulating neural and synaptic functionalities through domain wall motion in ferromagnets. IEEE transactions on biomedical circuits and systems, 10(6):1152–1160, 2016.
  30. On-chip learning for domain wall synapse based fully connected neural network. Journal of Magnetism and Magnetic Materials, 489:165434, 2019.
  31. Predictive technology model (ptm). https://ptm.asu.edu/.
  32. The non-equilibrium green function (negf) method. arXiv preprint arXiv:2008.01275, 2020.
  33. Voltage asymmetry of spin-transfer torques. IEEE Transactions on Nanotechnology, 11(2):261–272, 2011.
  34. John C Slonczewski. Current-driven excitation of magnetic multilayers. Journal of Magnetism and Magnetic Materials, 159(1-2):L1–L7, 1996.
  35. Physics-based spice-compatible compact model for simulating hybrid mtj/cmos circuits. IEEE Transactions on Electron Devices, 60(9):2808–2814, 2013.
  36. Spin angular momentum transfer in a current-perpendicular spin-valve nanomagnet. In Quantum Sensing and Nanophotonic Devices, volume 5359, pages 445–455. SPIE, 2004.
  37. Spin-orbit torques: Materials, mechanisms, performances, and potential applications. Progress in Materials Science, 118:100761, 2021.
  38. Current-induced switching of perpendicularly magnetized magnetic layers using spin torque from the spin hall effect. Physical review letters, 109(9):096602, 2012.
  39. Spin current, spin accumulation and spin hall effect. Science and Technology of Advanced Materials, 9(1):014105, 2008.
  40. Current-driven dynamics of chiral ferromagnetic domain walls. Nature materials, 12(7):611–616, 2013.
  41. Current-driven dynamics of dzyaloshinskii domain walls in the presence of in-plane fields: Full micromagnetic and one-dimensional analysis. Journal of Applied Physics, 115(21), 2014.
  42. Magnetic properties and field-driven dynamics of chiral domain walls in epitaxial pt/co/au x pt 1- x trilayers. Physical Review B, 98(21):214413, 2018.
  43. Highly efficient spin-current generation by the spin hall effect in au 1- x pt x. Physical Review Applied, 10(3):031001, 2018.
  44. Power efficient relu design for neuromorphic computing using spin hall effect. Journal of Physics D: Applied Physics, 56(41):415001, 2023.
  45. Semantic object classes in video: A high-definition ground truth database. Pattern Recognition Letters, 30(2):88–97, 2009.

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Lightbulb Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.