Generative quantum machine learning via denoising diffusion probabilistic models

Published 9 Oct 2023 in quant-ph, cs.AI, and cs.LG | (2310.05866v4)

Abstract: Deep generative models are key-enabling technology to computer vision, text generation, and LLMs. Denoising diffusion probabilistic models (DDPMs) have recently gained much attention due to their ability to generate diverse and high-quality samples in many computer vision tasks, as well as to incorporate flexible model architectures and a relatively simple training scheme. Quantum generative models, empowered by entanglement and superposition, have brought new insight to learning classical and quantum data. Inspired by the classical counterpart, we propose the quantum denoising diffusion probabilistic model (QuDDPM) to enable efficiently trainable generative learning of quantum data. QuDDPM adopts sufficient layers of circuits to guarantee expressivity, while it introduces multiple intermediate training tasks as interpolation between the target distribution and noise to avoid barren plateau and guarantee efficient training. We provide bounds on the learning error and demonstrate QuDDPM's capability in learning correlated quantum noise model, quantum many-body phases, and topological structure of quantum data. The results provide a paradigm for versatile and efficient quantum generative learning.

Abstract PDF HTML Upgrade to Chat

References (31)

Citations (13)

View on Semantic Scholar

Summary

The paper presents a novel QuDDPM framework that extends classical diffusion models to generate quantum state ensembles via forward scrambling and backward measurement-based denoising.
The study employs incremental training with intermediate loss functions to mitigate barren plateaus, optimizing parameterized quantum circuits for each diffusion step.
The paper demonstrates improved performance over benchmark quantum models in tasks such as learning correlated quantum noise and many-body phase transitions.

Quantum Denoising Diffusion Probabilistic Models (QuDDPMs) extend the principles of classical denoising diffusion probabilistic models to the quantum domain, providing a framework for learning distributions of quantum states, referred to as quantum data ensembles. Unlike classical DDPMs that operate on probability distributions over classical data, QuDDPMs aim to generate samples from a target distribution $D_0$ consisting of quantum states %%%%1%%%%. This approach leverages the structure of diffusion models to enable trainable generative learning for complex quantum data.

Methodology of QuDDPM

The QuDDPM framework consists of two main processes: a forward quantum scrambling process and a backward measurement-based denoising process.

Forward Quantum Scrambling Process

The forward process transforms samples from the target quantum data ensemble $D_0$ into a maximally mixed or random noise ensemble $D_T$ through a sequence of $T$ steps.

Start with a set of quantum states $\{\ket{\psi_k^{(0)}}\}$ drawn from the target distribution $D_0$ .
For each state $\ket{\psi_k^{(0)}}$ , apply a sequence of $T$ random unitary operations $U_t^{(k)}$ drawn from a suitable distribution (e.g., Haar random or unitary t-designs). This defines a sequence of intermediate ensembles: $D_t = \{\ket{\psi_k^{(t)}} = U_t^{(k)} \dots U_1^{(k)} \ket{\psi_k^{(0)}} \}_{k}$ for $t = 1, \dots, T$ .
Each step $t$ adds noise, gradually scrambling the initial structure. The ensemble $D_T$ approximates a distribution of random quantum states (e.g., states drawn uniformly from the Hilbert space, approaching the maximally mixed state in the ensemble average). The sequence $D_0, D_1, \dots, D_T$ serves as an interpolation between the target data distribution and the noise distribution.

Backward Measurement-Based Denoising Process

The backward process aims to reverse the forward scrambling, starting from the noise ensemble and progressively denoising it to approximate the target ensemble $D_0$ . This involves trainable Parametrized Quantum Circuits (PQCs).

Initialize the process by sampling states $\ket{\tilde{\psi}^{(T)}}$ from a noise distribution $\tilde{S}_T$ that approximates $D_T$ .
For steps $t = T, T-1, \dots, 1$ , apply a trainable unitary $\tilde{U}_t(\theta_t)$ to the current system state $\ket{\tilde{\psi}^{(t)}}$ and ancillary qubits initialized to $\ket{0}^{\otimes n_a}$ . The PQC $\tilde{U}_t(\theta_t)$ acts on the joint system-ancilla space.
Perform a projective measurement on the ancilla qubits. The post-measurement state of the system qubits, conditioned on a specific outcome (typically $\ket{0}^{\otimes n_a}$ ), becomes the input for the next step, $\ket{\tilde{\psi}^{(t-1)}}$ . Measurements are crucial for the contractive nature of the denoising map while preserving state purity.
The sequence of operations generates ensembles $\tilde{D}_t$ intended to approximate the forward ensembles $D_t$ . The final output ensemble $\tilde{D}_0 = \{\ket{\tilde{\psi}^{(0)}}\}$ approximates the target distribution $D_0$ .

Training Strategy and Mitigation of Barren Plateaus

Training QuDDPMs involves optimizing the parameters $\{\theta_t\}$ of the backward PQCs $\tilde{U}_t$ . A key challenge in training deep PQCs is the barren plateau phenomenon, where gradients vanish exponentially with the number of qubits or circuit depth. QuDDPM addresses this using a divide-and-conquer strategy inherent in the diffusion model structure.

Incremental Training: Training proceeds backward from $t=T$ down to $t=1$ . In cycle $(T+1-t)$ , the parameters $\theta_t$ of the PQC $\tilde{U}_t$ are optimized.
Intermediate Loss Functions: The objective at step $t$ is to minimize a distance metric between the generated ensemble $\tilde{D}_{t-1}$ (obtained by applying $\tilde{U}_T, \dots, \tilde{U}_t$ starting from noise $\tilde{S}_T$ ) and the corresponding forward-diffused ensemble $D_{t-1}$ .
Shallow Depth per Step: The total PQC required to transform noise into the target data might need significant depth, potentially $\Omega(n)$ layers for $n$ qubits, leading to barren plateaus if trained end-to-end. By breaking the process into $T$ steps, where $T$ can be chosen strategically (e.g., $T \sim n/\log n$ ), each individual PQC $\tilde{U}_t$ can have a relatively shallow depth (e.g., $O(\log n)$ layers). Training these shallower circuits individually mitigates the barren plateau problem, enabling efficient optimization. The theoretical analysis suggests that the loss landscape for each step is better behaved than for a single deep circuit.

Loss Functions for Quantum State Ensembles

Since analytical likelihood calculations used in classical DDPMs are generally intractable for quantum states, QuDDPM relies on distance metrics between quantum state ensembles, estimated via quantum measurements.

Maximum Mean Discrepancy (MMD): MMD compares two distributions based on the mean embeddings of their samples in a reproducing kernel Hilbert space. For QuDDPM, the kernel is often chosen based on fidelity: $k(\rho, \sigma) = \text{Tr}(\rho \sigma) = |\langle\psi_\rho|\psi_\sigma\rangle|^2$ for pure states $\ket{\psi_\rho}, \ket{\psi_\sigma}$ . The MMD loss requires estimating pairwise fidelities between states sampled from the target ensemble $D_{t-1}$ and the generated ensemble $\tilde{D}_{t-1}$ .
Wasserstein Distance (Quantum EMD): The Wasserstein-1 distance (or Earth Mover's Distance, EMD) measures the minimum cost to transport one distribution to another. In the quantum context, using the infidelity $c(\ket{\phi}, \ket{\psi}) = 1 - |\langle\phi|\psi\rangle|^2$ as the cost function allows capturing geometric structures in the distribution of quantum states.
Estimation: Both MMD and Wasserstein distance calculations typically require estimating pairwise fidelities $|\langle\psi_k|\tilde{\psi}_j\rangle|^2$ between states from the two ensembles. This can be achieved using the SWAP test or related quantum algorithms on a quantum computer, requiring multiple copies of the states or controlled operations.

Applications and Performance

The paper demonstrates QuDDPM's capabilities on several quantum learning tasks:

Learning Correlated Quantum Noise: QuDDPM effectively learned the distribution of states produced by applying probabilistic coherent errors (e.g., $XX$ and $ZZ$ rotations with certain probabilities) to an initial state like $\ket{0}^{\otimes n}$ . This is relevant for characterizing noise in quantum devices.
Learning Quantum Many-Body Phases: The model successfully learned to generate ground states of the 1D Transverse-Field Ising Model (TFIM) within its ferromagnetic phase, showcasing its ability to capture complex correlations typical of many-body systems.
Learning Topological Structures: Using the Wasserstein distance as the loss function, QuDDPM learned to generate an ensemble of single-qubit states forming a specific topological structure (a ring on the $X$ - $Z$ plane of the Bloch sphere). This highlights its sensitivity to the geometric arrangement of states in Hilbert space.

Numerical simulations indicated that QuDDPM outperformed benchmark quantum generative models like generalized versions of QuGAN and Quantum Direct Transport (QuDT) in generating target quantum state ensembles, particularly demonstrating better sample quality and training stability. Theoretical bounds on the learning error were also provided, connecting the performance to the number of diffusion steps $T$ , circuit depth per step, and the number of training samples.

Implementation Considerations

Implementing QuDDPM involves several practical choices:

Number of Diffusion Steps ( $T$ ): A larger $T$ allows for shallower circuits per step, potentially improving trainability, but increases the total number of training cycles and circuit executions. The choice often involves a trade-off, with $T \sim n/\log n$ suggested as a heuristic.
PQC Architecture: The expressivity and entanglement capability of the PQCs $\tilde{U}_t$ are crucial. Hardware-efficient ansätze or problem-specific designs might be employed.
Ancilla Qubits: The backward denoising process requires ancilla qubits for measurements. The number of ancillas affects the complexity and the nature of the effective quantum channel implemented by each step.
Measurement Cost: Estimating the loss function (MMD or Wasserstein) requires numerous fidelity estimations (e.g., via SWAP tests), which can be resource-intensive on current quantum hardware.
Computational Resources: Training involves repeated execution of quantum circuits (forward process simulation or execution, backward PQC execution, and measurement for loss estimation) coupled with classical optimization of PQC parameters. This requires significant classical and quantum computational resources.

In conclusion, QuDDPM offers a structured and potentially more trainable approach to quantum generative learning compared to monolithic PQC-based models like QuGANs, particularly for complex distributions of quantum states. Its effectiveness stems from the divide-and-conquer strategy inherent in diffusion models, which helps mitigate barren plateaus by breaking the generation task into smaller, manageable denoising steps. The framework has shown promise in learning physically relevant quantum data distributions, although practical implementation faces challenges related to measurement overhead and resource requirements.

Markdown

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Paper Prompts

Top Community Prompts

Explain it Like I'm 14

off on

Knowledge Gaps

off on

Practical Applications

off on

Glossary

off on

Conceptual Simplification

off on

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Generate Now

Generative quantum machine learning via denoising diffusion probabilistic models

Summary

Methodology of QuDDPM

Forward Quantum Scrambling Process

Backward Measurement-Based Denoising Process

Training Strategy and Mitigation of Barren Plateaus

Loss Functions for Quantum State Ensembles

Applications and Performance

Implementation Considerations

Paper to Video (Beta)

Whiteboard

Paper Prompts

Top Community Prompts

Open Problems

Continue Learning

Authors (4)

Collections

Tweets

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research

Generative quantum machine learning via denoising diffusion probabilistic models

Summary

Methodology of QuDDPM

Forward Quantum Scrambling Process

Backward Measurement-Based Denoising Process

Training Strategy and Mitigation of Barren Plateaus

Loss Functions for Quantum State Ensembles

Applications and Performance

Implementation Considerations

Paper to Video (Beta)

Whiteboard

Paper Prompts

Top Community Prompts

Open Problems

Continue Learning

Related Papers

Authors (4)

Collections

Tweets

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research