Quantum Latent Compression with Adaptive Observables
- The paper presents a novel hybrid quantum-classical diffusion model that integrates amplitude encoding for latent compression with adaptive observables for global feature extraction.
- It employs a parameterized quantum circuit and trainable non-local Hermitian observables to capture multiscale correlations and mitigate mode collapse in generative tasks.
- Empirical results on the MNIST dataset demonstrate improved stability and expressibility in multi-class image synthesis under near-term quantum hardware constraints.
Quantum Latent Compression with Adaptive Non-local Observables (ANO) is a methodology developed to address the scalability and expressibility limitations of quantum generative models in high-dimensional and multi-modal data domains. By leveraging amplitude encoding for quantum latent-space compression and extracting trainable, non-local features via adaptive non-local observables within a hybrid quantum-classical U-Net diffusion framework, this approach facilitates the representation of complex structures in generative modeling tasks, such as image synthesis with the full MNIST dataset. Distinctive to this method are the integration of parameterized quantum circuits for latent evolution and the measurement of K trainable non-local Hermitian observables whose expectations inform the final output via a classical decoder and skip-connections, thereby enhancing multi-class generation stability and mitigating mode collapse under near-term quantum hardware restrictions (Jo et al., 3 Feb 2026).
1. Quantum Encoding and Latent-Space Compression
This framework begins with a classical-to-quantum encoder , where the original 256-dimensional image vector is mapped into an -dimensional normalized complex latent vector , with . The encoding process utilizes amplitude encoding, implemented by a unitary such that is mapped to . The state preparation is achieved using a tree of multi-controlled and phase gates, or an equivalent state-preparation circuit, with depth scaling as .
The reduction from 256 to 16 dimensions yields a compression ratio , providing significant latent-space compression that preserves global informational content through quantum superposition.
2. Adaptive Non-Local Observables: Definition and Measurement
Following evolution of the quantum state by a parameterized quantum circuit , the extraction of features is performed by measuring a set of trainable Hermitian observables , each defined as for arbitrary complex matrices . Alternatively, can be expanded in the -qubit Pauli basis as , capturing non-local multi-qubit correlations through higher-weight Pauli strings.
For a quantum state , the measurement protocol computes , for , where . The resulting vectors, , embody the trainable non-local features.
3. Integration into the Hybrid Quantum-Classical U-Net Diffusion Model
At each reverse diffusion step , the process involves several stages:
- A noised image serves as input.
- A classical encoder produces .
- Quantum state preparation and dynamic evolution yield .
- Measurement of adaptive non-local observables gives feature vector .
- Intermediate classical features are produced via a skip-connection: .
- The combined feature is passed into the classical decoder.
- The decoder outputs the predicted noise , or equivalently, the mean of .
This mechanism embeds the quantum circuit (with ANO readout) as the network’s bottleneck, introducing global quantum correlations to the denominator of the generative model.
4. Training, Loss Function, and Gradient Estimation
The primary training objective is the simplified Denoising Diffusion Probabilistic Model (DDPM) loss:
where and . The U-Net, equipped with the quantum bottleneck, outputs the estimate .
Optimization is conducted in a hybrid fashion:
- Classical parameters are updated via backpropagation through the concatenated classical-quantum feature space.
- Quantum parameters are updated using the parameter-shift rule for unbiased gradient estimation:
with analogous treatment for .
5. Forward and Reverse Diffusion, Feature Roles
The forward diffusion process is purely classical and Gaussian:
The reverse process integrates the quantum bottleneck:
The quantum latent, with its 256→16 dimensionality reduction, enables the PQC to efficiently learn a compact kernel for reverse diffusion, while the ANO measurements yield non-local features unavailable to purely classical or local-measurement hybrid models. This richer feature set improves generative fidelity and supports stable multi-class output.
6. Empirical Performance and Limitations
On the full MNIST dataset (digits 0–9), structurally coherent and class-recognizable images were generated for all classes using this hybrid architecture. Hardware constraints limited the image resolution, but evidence supports the method’s potential for advancing generative capabilities on current and near-term quantum devices ("NISQ era"). The architecture’s adaptive measurement layer mitigated mode collapse and enhanced the expressibility of learned distributions, outperforming purely classical or local-measurement hybrids in multi-modal generative settings (Jo et al., 3 Feb 2026).
7. Pipeline Summary and Significance
The central innovation is the full hybrid quantum-classical diffusion pipeline, structured as:
- Classical encoder amplitude-encoded quantum latent parameterized dynamics
- adaptive non-local observables measurement classical decoder with skip-connection
- reverse-step prediction training via hybrid loss, leveraging parameter-shift gradient estimation for quantum parameters.
This configuration demonstrates a practical route to incorporating quantum latent-space compression and globally correlated feature extraction within large-scale generative models, particularly in constrained NISQ hardware regimes. The explicit combination of skip connections and adaptive trainable quantum measurements enables information preservation and expressive generation not accessible to prior architectures lacking these elements (Jo et al., 3 Feb 2026).