Reinforcement learning with learned gadgets to tackle hard quantum problems on real hardware (2411.00230v2)
Abstract: Designing quantum circuits for specific tasks is challenging due to the exponential growth of the state space. We introduce gadget reinforcement learning (GRL), which integrates reinforcement learning with program synthesis to automatically generate and incorporate composite gates (gadgets) into the action space. This enhances the exploration of parameterized quantum circuits (PQCs) for complex tasks like approximating ground states of quantum Hamiltonians, an NP-hard problem. We evaluate GRL using the transverse field Ising model under typical computational budgets (e.g., 2- 3 days of GPU runtime). Our results show improved accuracy, hardware compatibility and scalability. GRL exhibits robust performance as the size and complexity of the problem increases, even with constrained computational resources. By integrating gadget extraction, GRL facilitates the discovery of reusable circuit components tailored for specific hardware, bridging the gap between algorithmic design and practical implementation. This makes GRL a versatile framework for optimizing quantum circuits with applications in hardware-specific optimizations and variational quantum algorithms. The code is available at: https://github.com/Aqasch/Gadget_RL
- Tetris-adapt-vqe: An adaptive algorithm that yields shallower, denser circuit ansätze. Physical Review Research, 6(1):013254, 2024.
- The option-critic architecture. In Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, AAAI’17, pp. 1726–1734. AAAI Press, 2017.
- A strategy for quantum algorithm design assisted by machine learning. New Journal of Physics, 16(7):073017, 2014.
- Quantum approximate optimization via learning-based adaptive optimization. Communications Physics, 7(1):83, 2024.
- Bootstrap learning via modular concept discovery. In Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence, IJCAI ’13, pp. 1302–1309. AAAI Press, 2013. ISBN 9781577356332.
- Evolutionary quantum architecture search for parametrized quantum circuits. In Proceedings of the Genetic and Evolutionary Computation Conference Companion, pp. 2190–2195, 2022.
- Dreamcoder: growing generalizable, interpretable knowledge with wake–sleep bayesian program learning. Philosophical Transactions of the Royal Society A, 381, 2020. URL https://api.semanticscholar.org/CorpusID:219687434.
- Diversity is all you need: Learning skills without a reward function. In International Conference on Learning Representations, 2019. URL https://openreview.net/forum?id=SJx63jRqFm.
- Overlap-adapt-vqe: practical quantum chemistry on quantum computers via overlap-guided compact ansätze. Communications Physics, 6(1):192, 2023.
- Stochastic neural networks for hierarchical reinforcement learning. In International Conference on Learning Representations, 2017. URL https://openreview.net/forum?id=B1oK8aoxe.
- Reinforcement learning for variational quantum circuits design. preprint arXiv:2409.05475, 2024.
- Quantum circuit optimization with deep reinforcement learning. arXiv preprint arXiv:2103.07585, 2021.
- META LEARNING SHARED HIERARCHIES. In International Conference on Learning Representations, 2018. URL https://openreview.net/forum?id=SyX0IeWAW.
- Variational intrinsic control, 2016. URL https://arxiv.org/abs/1611.07507.
- An adaptive variational algorithm for exact molecular simulations on a quantum computer. Nature communications, 10(1):3007, 2019.
- Grover, L. K. A fast quantum mechanical algorithm for database search. In Proceedings of the twenty-eighth annual ACM symposium on Theory of computing, pp. 212–219, 1996.
- Gsqas: graph self-supervised quantum architecture search. Physica A: Statistical Mechanics and its Applications, 630:129286, 2023a.
- A gnn-based predictor for quantum architecture search. Quantum Information Processing, 22(2):128, 2023b.
- A meta-trained generator for quantum architecture search. EPJ Quantum Technology, 11(1):44, 2024.
- IBM Quantum Documentation. transpiler — docs.quantum.ibm.com. https://docs.quantum.ibm.com/api/qiskit/transpiler, 2024. [Accessed 30-10-2024].
- Kingma, D. P. Adam: A method for stochastic optimization. preprint arXiv:1412.6980, 2014.
- Artificial intelligence and machine learning for quantum technologies. Phys. Rev. A, 107:010101, Jan 2023. doi: 10.1103/PhysRevA.107.010101. URL https://link.aps.org/doi/10.1103/PhysRevA.107.010101.
- Ddco: Discovery of deep continuous options for robot learning from demonstrations. preprint arXiv:1710.05421, 2017. URL https://api.semanticscholar.org/CorpusID:11787854.
- Kundu, A. Reinforcement learning-assisted quantum architecture search for variational quantum algorithms. preprint arXiv:2402.13754, 2024.
- Hamiltonian-oriented homotopy quantum approximate optimization algorithm. Physical Review A, 109(2):022611, 2024a.
- Kanqas: Kolmogorov arnold network for quantum architecture search. preprint arXiv:2406.17630, 2024b.
- Quantum architecture search via deep reinforcement learning. preprint arXiv:2104.07715, 2021.
- Continuous evolution for efficient quantum architecture search. EPJ Quantum Technology, 11(1):54, 2024.
- Temporal abstraction in reinforcement learning with the successor representation. J. Mach. Learn. Res., 24(1), March 2024. ISSN 1532-4435.
- Implementing grover’s algorithm on the ibm quantum computers. In 2018 IEEE international conference on big data (big data), pp. 2531–2537. IEEE, 2018.
- Melo, F. S. Convergence of q-learning: A simple proof. Institute Of Systems and Robotics, Tech. Rep, pp. 1–4, 2001.
- Cost explosion for efficient reinforcement learning optimisation of quantum circuits. In 2023 IEEE International Conference on Rebooting Computing (ICRC), pp. 1–5. IEEE, 2023.
- Realization of a scalable shor algorithm. Science, 351(6277):1068–1070, 2016.
- Data-efficient hierarchical reinforcement learning. In Proceedings of the 32nd International Conference on Neural Information Processing Systems, NIPS’18, pp. 3307–3317, Red Hook, NY, USA, 2018. Curran Associates Inc.
- Perturbed ferromagnetic chain: Tunable test of hardness in the transverse-field ising model. Phys. Rev. A, 105:022410, Feb 2022. doi: 10.1103/PhysRevA.105.022410. URL https://link.aps.org/doi/10.1103/PhysRevA.105.022410.
- Reinforcement learning for optimization of variational quantum circuit architectures. Advances in Neural Information Processing Systems, 34:18182–18194, 2021.
- Reinforcement learning assisted recursive qaoa. EPJ Quantum Technology, 11(1):6, 2024a.
- Curriculum reinforcement learning for quantum architecture search under hardware errors. preprint arXiv:2402.03500, 2024b.
- Pierce, B. C. Types and programming languages. MIT Press, Cambridge, Mass, 2002. ISBN 978-0-262-16209-8. URL 10.5555/509043.
- Powell, M. J. A direct search optimization method that models the objective and constraint functions by linear interpolation. Springer, 1994.
- Quantum Circuit Optimization with AlphaTensor. preprint arXiv:2402.14396, 2 2024.
- A quantum information theoretic analysis of reinforcement learning-assisted quantum architecture search. preprint arXiv:2404.06174, 2024.
- Discovering quantum circuit components with program synthesis. Machine Learning: Science and Technology, 5(2):025029, may 2024. doi: 10.1088/2632-2153/ad4252. URL https://dx.doi.org/10.1088/2632-2153/ad4252.
- Shor, P. W. Polynomial-time algorithms for prime factorization and discrete logarithms on a quantum computer. SIAM review, 41(2):303–332, 1999.
- Hybrid quantum-classical algorithm for the transverse-field Ising model in the thermodynamic limit. preprint arXiv:2310.07600, 10 2023.
- Quantum architecture search with unsupervised representation learning. preprint arXiv:2401.11576, 2024.
- qubit-adapt-vqe: An adaptive algorithm for constructing hardware-efficient ansätze on a quantum processor. PRX Quantum, 2(2):020310, 2021.
- Alpharouter: Quantum circuit routing with reinforcement learning and tree search. preprint arXiv:2410.05115, 2024.
- Automated gadget discovery in the quantum domain. Machine Learning: Science and Technology, 4(3):035043, sep 2023. doi: 10.1088/2632-2153/acf098. URL https://dx.doi.org/10.1088/2632-2153/acf098.
- Scipy 1.0: fundamental algorithms for scientific computing in python. Nature methods, 17(3):261–272, 2020.
- Quantum architecture search via continual reinforcement learning. preprint arXiv:2112.05779, 2021.
- Neural predictor based quantum architecture search. Machine Learning: Science and Technology, 2(4):045027, 2021.
- Quantum approximate optimization algorithm: Performance, mechanism, and implementation on near-term devices. Physical Review X, 10(2):021067, 2020.
- Adaptive quantum approximate optimization algorithm for solving combinatorial problems on a quantum computer. Physical Review Research, 4(3):033029, 2022.