Theoretical Analysis of the Efficient-Memory Matrix Storage Method for Quantum Emulation Accelerators with Gate Fusion on FPGAs (2410.11146v1)
Abstract: Quantum emulators play an important role in the development and testing of quantum algorithms, especially given the limitations of the current FTQC era. Developing high-speed, memory-optimized quantum emulators is a growing research trend, with gate fusion being a promising technique. However, existing gate fusion implementations often struggle to efficiently support large-scale quantum systems with a high number of qubits due to a lack of optimizations for the exponential growth in memory requirements. Therefore, this study proposes the EMMS (Efficient-Memory Matrix Storage) method for storing quantum operators and states, along with an EMMS-based Quantum Emulator Accelerator (QEA) architecture that incorporates multiple processing elements (PEs) to accelerate tensor product and matrix multiplication computations in quantum emulation with gate fusion. The theoretical analysis of the QEA on the Xilinx ZCU102 FPGA, using varying numbers of PEs and different depths of unitary and local data memory, reveals a linear increase in memory depth with the number of qubits. This scaling highlights the potential of the EMMS-based QEA to accommodate larger quantum circuits, providing insights into selecting appropriate memory sizes and FPGA devices. Furthermore, the estimated performance of the QEA with PE counts ranging from $22$ to $25$ on the Xilinx ZCU102 FPGA demonstrates that increasing the number of PEs significantly reduces the computation cycle count for circuits with fewer than 18 qubits, making it significantly faster than previous works.
- J. Lau, K. Lim, H. Shrotriya et al., “Nisq computing: where are we and where do we go?” AAPPS Bulletin, vol. 32, p. 27, 2022. [Online]. Available: https://doi.org/10.1007/s43673-022-00058-z
- G. Q. AI, “Suppressing quantum errors by scaling a surface code logical qubit,” Nature, vol. 614, no. 7949, pp. 676–681, Feb 2023. [Online]. Available: https://doi.org/10.1038/s41586-022-05434-1
- D. Bluvstein, S. J. Evered, A. A. Geim, S. H. Li, H. Zhou, T. Manovitz, S. Ebadi, M. Cain, M. Kalinowski, D. Hangleiter, J. P. Bonilla Ataides, N. Maskara, I. Cong, X. Gao, P. Sales Rodriguez, T. Karolyshyn, G. Semeghini, M. J. Gullans, M. Greiner, V. Vuletić, and M. D. Lukin, “Logical quantum processor based on reconfigurable atom arrays,” Nature, vol. 626, no. 7997, pp. 58–65, Feb 2024. [Online]. Available: https://doi.org/10.1038/s41586-023-06927-3
- A. Javadi-Abhari and et al, “Quantum computing with qiskit,” 2024. [Online]. Available: https://arxiv.org/abs/2405.08810
- V. Bergholm and et al, “Pennylane: Automatic differentiation of hybrid quantum-classical computations,” 2022. [Online]. Available: https://arxiv.org/abs/1811.04968
- T. Häner and D. S. Steiger, “0.5 petabyte simulation of a 45-qubit quantum circuit,” in Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, ser. SC ’17. ACM, Nov. 2017. [Online]. Available: http://dx.doi.org/10.1145/3126908.3126947
- Bayraktar and et al, “cuquantum sdk: A high-performance library for accelerating quantum science,” in 2023 IEEE International Conference on Quantum Computing and Engineering (QCE), vol. 01, 2023, pp. 1050–1061.
- Y. Zhao, Y. Guo, Y. Yao, A. Dumi, D. M. Mulvey, S. Upadhyay, Y. Zhang, K. D. Jordan, J. Yang, and X. Tang, “Q-gpu: A recipe of optimizations for quantum circuit simulation using gpus,” in 2022 IEEE International Symposium on High-Performance Computer Architecture (HPCA), 2022, pp. 726–740.
- N. Mahmud, B. Haase-Divine, A. Kuhnke, A. Rai, A. MacGillivray, and E. El-Araby, “Efficient computation techniques and hardware architectures for unitary transformations in support of quantum algorithm emulation,” Journal of Signal Processing Systems, vol. 92, pp. 57–74, 2020.
- D. Wakeham and M. Schuld, “Inference, interference and invariance: How the quantum fourier transform can help to learn from data,” 2024. [Online]. Available: https://arxiv.org/abs/2409.00172
- N. Mahmud, B. Haase-Divine, A. MacGillivray, B. Srimoungchanh, A. Kuhnke, N. Blankenau, A. Rai, and E. El-Araby, “Modifying quantum grover’s algorithm for dynamic multi-pattern search on reconfigurable hardware,” Journal of Computational Electronics, vol. 19, no. 3, pp. 1215–1231, Sep 2020. [Online]. Available: https://doi.org/10.1007/s10825-020-01489-3
- H. Hiroshi and D. Jun, “Optimization of quantum computing simulation with gate fusion,” IBM Quantum, IBM Research Tokyo, IBM Quantum, IBM Research Tokyo, Tech. Rep. 23, mar 2021.
- M. Smelyanskiy, N. P. Sawaya, and A. Aspuru-Guzik, “qhipster: The quantum high performance software testing environment,” arXiv preprint arXiv:1601.07195, 2016.
- P. A. Tew, “An investigation of sparse tensor formats for tensor libraries,” Master’s Thesis, pp. 17–18, 2016.
- S. Dalton, L. Olson, and N. Bell, “Optimizing sparse matrix–matrix multiplication for the gpu,” ACM Transactions on Mathematical Software, vol. 41, no. 4, pp. Article 25, pages 4–6, 2015.