Efficient and Scalable Architecture for Multiple-chip Implementation of Simulated Bifurcation Machines
Abstract: Ising machines are specialized computers for finding the lowest energy states of Ising spin models, onto which many practical combinatorial optimization problems can be mapped. Simulated bifurcation (SB) is a quantum-inspired parallelizable algorithm for Ising problems that enables scalable multi-chip implementations of Ising machines. However, the computational performance of a previously proposed multi-chip architecture tends to saturate as the number of chips increases for a given problem size because both computation and communication are exclusive in the time domain. In this paper, we propose a streaming architecture for multi-chip implementations of SB-based Ising machines with full spin-to-spin connectivity. The data flow in in-chip computation is harmonized with the data flow in inter-chip communication, enabling the computation and communication to overlap and the communication time to be hidden. Systematic experiments demonstrate linear strong scaling of performance up to the vicinity of the ideal communication limit determined only by the latency of chip-to-chip communication. Our eight-FPGA (field-programmable gate array) cluster can compute a 32,768-spin problem with a high pipeline efficiency of 97.9%. The performance of a 79-FPGA cluster for a 100,000-spin problem, projected using a theoretical performance model validated on smaller experimental clusters, is comparable to that of a state-of-the-art 100,000-spin optical Ising machine.
- A currency arbitrage machine based on the simulated bifurcation algorithm for ultrafast detection of optimal opportunity. In 2020 IEEE International Symposium on Circuits and Systems (ISCAS), pages 1–5, Oct 2020.
- Correlation-diversified portfolio construction by finding maximum independent set in large-scale market graph. arXiv preprint arXiv:2308.04769, 2023.
- Pilot allocation optimization using digital annealer for multi-cell massive mimo. In 2022 IEEE Wireless Communications and Networking Conference (WCNC), pages 2304–2309, April 2022.
- High-speed resource allocation algorithm using a coherent ising machine for NOMA systems. arXiv preprint arXiv:2212.01578, 2022.
- Efficient migration to the next generation of networks based on digital annealing. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 4740–4744, June 2021.
- Thomas Morstyn. Annealing-based quantum computing for combinatorial optimal power flow. IEEE Transactions on Smart Grid, pages 1–1, 2022.
- Andrew Lucas. Ising formulations of many np problems. Frontiers in Physics, 2, 2014.
- FÂ Barahona. On the computational complexity of ising spin glass models. Journal of Physics A: Mathematical and General, 15(10):3241, oct 1982.
- Quantum annealing with manufactured spins. Nature, 473(7346):194–198, 2011.
- Digital annealer for high-speed solving of combinatorial optimization problems and its applications. In 2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC), pages 667–672, Jan 2020.
- 100,000-spin coherent ising machine. Science Advances, 7(40):eabh0952, 2021.
- Statica: A 512-spin 0.25m-weight annealing processor with an all-spin-updates-at-once architecture for combinatorial optimization with complete spin–spin interactions. IEEE Journal of Solid-State Circuits, 56(1):165–178, Jan 2021.
- Binary optimization by momentum annealing. Phys. Rev. E, 100:012111, Jul 2019.
- Increasing ising machine capacity with multi-chip architectures. In Proceedings of the 49th Annual International Symposium on Computer Architecture, ISCA ’22, page 508–521, New York, NY, USA, 2022. Association for Computing Machinery.
- Massively parallel probabilistic computing with sparse ising machines. Nature Electronics, 5(7):460–468, 2022.
- Large-scale photonic ising machine by spatial light modulation. Phys. Rev. Lett., 122:213902, May 2019.
- Photonic spatial-euler ising machine for solving 20000-node max-cut problem. arXiv preprint arXiv:2301.04651, 2023.
- Ising machines as hardware solvers of combinatorial optimization problems. Nature Reviews Physics, 4(6):363–379, 2022.
- Combinatorial optimization by simulating adiabatic bifurcations in nonlinear hamiltonian systems. Science Advances, 5(4):eaav2372, 2019.
- Hayato Goto. Bifurcation-based adiabatic quantum computation with a nonlinear oscillator network. Scientific reports, 6(1):1–8, 2016.
- Hayato Goto. Quantum computation based on quantum adiabatic bifurcations of kerr-nonlinear parametric oscillators. Journal of the Physical Society of Japan, 88(6):061015, 2019.
- Fpga-based simulated bifurcation machine. In 2019 29th International Conference on Field Programmable Logic and Applications (FPL), pages 59–66, 2019.
- Pairs-trading system using quantum-inspired combinatorial optimization accelerator for optimal path search in market graphs. IEEE Access, 11:104406 – 104416, 2023.
- Real-time trading system based on selections of potentially profitable, uncorrelated, and balanced stocks by np-hard combinatorial optimization. IEEE Access, 11:120023 – 120033, 2023.
- Distance-based clustering using qubo formulations. Scientific reports, 12(1):2669, 2022.
- A reconfigurable fabric for accelerating large-scale datacenter services. In 2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA), pages 13–24, June 2014.
- A cloud-scale acceleration architecture. In 2016 49th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), pages 1–13, 2016.
- Scaling performance for n-body stream computation with a ring of fpgas. In Proceedings of the 10th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies, HEART 2019, New York, NY, USA, 2019. Association for Computing Machinery.
- Hierarchical tree algorithm for collisional N-body simulations on GRAPE. Publications of the Astronomical Society of Japan, 68(3):30, 03 2016.
- Opencl-enabled parallel raytracing for astrophysical application on multiple fpgas with optical links. In 2020 IEEE/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC), pages 48–55, Nov 2020.
- Dfx: A low-latency multi-fpga appliance for accelerating transformer-based text generation. In 2022 55th IEEE/ACM International Symposium on Microarchitecture (MICRO), pages 616–630. IEEE, 2022.
- Aigean: An open framework for deploying machine learning on heterogeneous clusters. ACM Trans. Reconfigurable Technol. Syst., 15(3), dec 2022.
- A scalable emulator for quantum fourier transform using multiple-fpgas with high-bandwidth-memory. IEEE Access, 10:65103–65117, 2022.
- Streaming message interface: High-performance distributed memory programming on reconfigurable hardware. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC ’19, New York, NY, USA, 2019. Association for Computing Machinery.
- A 1.3-mbit annealing system composed of fully-synchronized 9-board x 9-chip x 16-kbit annealing processor chips for large-scale combinatorial optimization problems. In 2021 IEEE Asian Solid-State Circuits Conferenc e (A-SSCC), pages 1–3, Nov 2021.
- Highly-parallel fpga accelerator for simulated quantum annealing. IEEE Transactions on Emerging Topics in Computing, 9(4):2019–2029, 2021.
- Scalable fully coupled annealing processing system and multi-chip fpga implementation. Microprocessors and Microsystems, 95:104674, 2022.
- Amorphica: 4-replica 512 fully connected spin 336mhz metamorphic annealer with programmable optimization strategy and compressed-spin-transfer multi-chip extension. In 2023 IEEE International Solid- State Circuits Conference (ISSCC), pages 42–44, 2023.
- Scaling out ising machines using a multi-chip architecture for simulated bifurcation. Nature Electronics, 4(3):208–217, Mar 2021.
- Roofline: An insightful visual performance model for multicore architectures. Commun. ACM, 52(4):65–76, apr 2009.
- High-performance combinatorial optimization based on classical mechanics. Science Advances, 7(6):eabe7953, 2021.
- Simulated bifurcation assisted by thermal fluctuation. Communications Physics, 5(1):153, 2022.
- Intel Corporation. Intel fpga programmable acceleration card d5005 product brief.
- Intel Corporation. Serial lite iii streaming intel fpga ip user guide.
- Intel Corporation. Intel fpga sdk for opencl pro edition: Programming guide.
- Aval Global Co., Ltd. Apx-aa10l1 – aval global. http://aval-global.com/?page_id=1817.
- Intel Corporation. Intel arria 10 device overview. https://www.intel.com/content/www/us/en/docs/programmable/683332/current/device-overview.html.
- Intel Corporation. Intel fpga sdk for opencl pro edition: Programming guide. https://www.intel.com/content/www/us/en/docs/programmable/683846/17-1/introduction.html.
- Mersenne twister: a 623-dimensionally equidistributed uniform pseudo-random number generator. ACM Transactions on Modeling and Computer Simulation (TOMACS), 8(1):3–30, 1998.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.