Variational Quantum Circuits for Deep Reinforcement Learning (1907.00397v3)

Published 30 Jun 2019 in cs.LG, cs.AI, quant-ph, and stat.ML

Abstract: The state-of-the-art machine learning approaches are based on classical von Neumann computing architectures and have been widely used in many industrial and academic domains. With the recent development of quantum computing, researchers and tech-giants have attempted new quantum circuits for machine learning tasks. However, the existing quantum computing platforms are hard to simulate classical deep learning models or problems because of the intractability of deep quantum circuits. Thus, it is necessary to design feasible quantum algorithms for quantum machine learning for noisy intermediate scale quantum (NISQ) devices. This work explores variational quantum circuits for deep reinforcement learning. Specifically, we reshape classical deep reinforcement learning algorithms like experience replay and target network into a representation of variational quantum circuits. Moreover, we use a quantum information encoding scheme to reduce the number of model parameters compared to classical neural networks. To the best of our knowledge, this work is the first proof-of-principle demonstration of variational quantum circuits to approximate the deep $Q$-value function for decision-making and policy-selection reinforcement learning with experience replay and target network. Besides, our variational quantum circuits can be deployed in many near-term NISQ machines.

Citations (260)

View on Semantic Scholar

Summary

The paper demonstrates integrating variational quantum circuits with DRL, effectively mapping experience replay and target networks onto quantum circuits.
Experiments in frozen-lake and cognitive-radio environments reveal reduced parameters and fast convergence with robust performance against quantum noise.
The methodology paves the way for scalable quantum DRL, offering resource-efficient models specifically designed for noisy intermediate-scale quantum devices.

An Overview of "Variational Quantum Circuits for Deep Reinforcement Learning"

The paper "Variational Quantum Circuits for Deep Reinforcement Learning" investigates the intersection of quantum computing with deep reinforcement learning (DRL) paradigms, proposing a method which reshapes classical DRL algorithms using variational quantum circuits (VQC). The aim is to adapt these complex algorithms to Noisy Intermediate-Scale Quantum (NISQ) devices, acknowledging the potential of quantum computing to address challenges that classical approaches may struggle with, such as the intractability of certain deep neural networks.

Core Contributions and Methodology

The authors propose a novel approach by integrating variational quantum circuits into the framework of DRL, particularly in the context of experience replay and target networks—elements crucial to stabilizing the learning process in deep Q-learning (DQL). They demonstrate a proof-of-concept application of VQCs to approximate deep Q-value functions, facilitating decision-making and policy selection.

Key technical accomplishments include:

Encoding DRL Components in VQCs: The work effectively maps important components of DRL algorithms onto quantum circuits, fostering an efficient iterative optimization process. Experience replay is simulated using quantum models that store and replay historical action samples.
Integration with NISQ Devices: The paper focuses on designing quantum algorithms sensitive to the noise restrictions of present-day quantum computers, greatly enhancing their feasibility for real-world tasks.

Experimental Evaluation

The authors conduct experiments within the frozen-lake and cognitive-radio environments to validate their theoretical propositions. These environments, chosen for their simplicity in state and action spaces, allow for feasible demonstration on currently available quantum simulators and small quantum processors. Key results include:

Implementation Feasibility: The VQCs were successfully implemented with a reduced number of parameters compared to classical DQL models, demonstrating a substantial advantage in memory efficiency.
Robust Noise Performance: Experiments conducted with noise models derived from real NISQ devices underscored the robustness of VQCs against quantum computing noise.
Fast Convergence in Simple Environments: The VQCs showed effective convergence to optimal solutions in simple environments within fewer iterations than traditional DRL models.

Implications and Future Directions

The implications of this work are manifold:

Advancement in Quantum DRL: The research highlights the potential of quantum circuits in pushing the boundaries of reinforcement learning, suggesting that VQCs might find use in larger-scale, complex environments—especially as quantum technology continues to advance.
Memory and Parameter Efficiency: By demonstrating the reduction in parameters needed for learning representations, the paper suggests that quantum circuits could provide a more resource-efficient path forward for models that may burgeon under the classical computational paradigm.

For future work, the authors propose extending their framework to more complex environments and incorporating advanced encoding techniques, such as amplitude encoding, which could further reduce parameter space complexity. Additionally, experimentation on more extensive quantum hardware could shed light on practical scalability aspects.

In conclusion, this paper is an important step toward leveraging quantum computing in enhancing and potentially transforming the landscape of deep reinforcement learning, focusing on making complex, parameter-heavy and traditionally resource-intensive AI tasks more tractable using innovative quantum algorithms.

PDF Markdown