CVBM: Continuous-Variable Quantum Model
- CVBM is a quantum generative model that employs continuous-variable photonic circuits and both Gaussian and non-Gaussian gates to directly model real-valued distributions.
- It trains by minimizing statistical distances like the squared Maximum Mean Discrepancy between generated and target distributions using parameter-shift gradients and stochastic gradient descent.
- Its design overcomes exponential resource overheads of discrete models, enabling scalable simulation and efficient learning of multimode continuous data.
A Continuous-Variable Born Machine (CVBM) is a quantum generative model that parameterizes continuous-valued probability distributions using multimode photonic quantum circuits. By exploiting the native infinite-dimensional Hilbert space and measurement protocols of continuous-variable (CV) quantum systems—such as modes of light—CVBMs model, generate, and learn continuous probability densities over directly, without recourse to discretization. This architecture stands in contrast to discrete-variable Born machines, which are inherently limited to modeling distributions over bitstrings and suffer exponential resource overheads when approximating real-valued targets. CVBMs employ parameterized quantum circuits composed of Gaussian and, where necessary, non-Gaussian gates and are trained by minimizing a statistical distance (commonly the squared Maximum Mean Discrepancy, MMD) between the generated and target distributions, typically via stochastic gradient descent that leverages parameter-shift rules or automatic differentiation of Gaussian moments (Čepaitė et al., 2020, Kolarovszki et al., 11 Mar 2026, Kolarovszki et al., 2024).
1. Mathematical Structure and Probability Model
The core element of a CVBM is a family of multimode photonic quantum states
where is a parameterized CV quantum circuit and is the mode (feature) count. Sampling is performed via homodyne detection in the quadrature basis, typically at phase , yielding continuous real vectors according to the Born rule:
with and the eigenstate of (Čepaitė et al., 2020, Kolarovszki et al., 2024).
The position-basis wavefunction's overlap with Fock states is given by
where are the physicists' Hermite polynomials, facilitating explicit evaluation and classical simulation in truncated Fock spaces (Kolarovszki et al., 2024).
2. Quantum Circuit Architecture
CVBMs are realized with architectures comprising alternating layers of Gaussian and non-Gaussian gates. The parameterized circuit can be captured by:
where:
- includes Gaussian gates:
- Displacement:
- Squeezing:
- Phase rotation:
- Beam splitter:
- incorporates non-Gaussian elements:
- Cubic-phase:
- Cross-Kerr:
- Kerr:
The full variational parameter set collects all gate arguments across layers and modes (Čepaitė et al., 2020, Kolarovszki et al., 2024).
For circuits restricted to Gaussian gates, the resulting states are pure Gaussian and completely specified by their mean vector and covariance matrix , with successive layers updating and via real-symplectic affine maps (Kolarovszki et al., 11 Mar 2026).
3. Training Procedures: Loss Functions and Gradient Estimation
Training aims to match the generated distribution to a target using the squared Maximum Mean Discrepancy (MMD):
where is a positive-definite kernel, typically a Gaussian (Čepaitė et al., 2020, Kolarovszki et al., 2024, Kolarovszki et al., 11 Mar 2026).
Gradients of the loss with respect to circuit parameters are computed via the quantum parameter-shift rule for those parameters (notably Gaussian gates):
where , with gate-dependent shifts and scaling. For Gaussian gates, analytic shift rules are available; for non-Gaussian gates, finite-difference methods are used—an approach that is markedly sample-inefficient (Čepaitė et al., 2020, Kolarovszki et al., 2024).
Stochastic gradient descent (SGD) is employed in practice, with gradients estimated from finite samples (“shots”) drawn from parameter-shifted circuits. Empirically, shots per gradient estimate suffice for reliable optimization using Adam (Kolarovszki et al., 2024).
4. Classical Simulation and Resource Scaling
Classical simulation of CVBM sampling proceeds by representing in a truncated multimode Fock basis (cutoff ), enabling joint position probabilities to be constructed as
Sampling is performed sequentially by conditioning modes; marginal densities for each mode are polynomials times , enabling inverse-transform sampling (Kolarovszki et al., 2024).
The computational complexity of this procedure grows exponentially with the number of modes , with the observed empirical scaling (Kolarovszki et al., 2024). On physical quantum hardware, this cost is absent, but practical quantum computation remains limited by decoherence, shot noise, and currently available non-Gaussian gate fidelities.
For circuit resources, a single CV mode encodes an infinite-dimensional variable; as a result, CVBMs avoid the exponential scaling associated with discretizing continuous variables in discrete-variable circuits. For -mode, -layer circuits, the total gate and parameter counts scale as , with gradient estimation requiring (number of parameters) measurement shots per training step (Čepaitė et al., 2020).
In the special case of purely Gaussian CVBMs realized as Gaussian Boson Sampling (GBS) circuits, the state evolution and loss/gradient evaluation reduce to linear algebra on , allowing scaling and enabling classical training for hundreds of modes and parameters (Kolarovszki et al., 11 Mar 2026).
5. Empirical Results and Training Performance
Numerical experiments demonstrate the capacity of CVBMs to learn both classical and quantum continuous distributions (Čepaitė et al., 2020, Kolarovszki et al., 2024). Key findings include:
- Learning a 1D classical Gaussian: a single-mode CVBM with displacement and squeezing converges within iterations, MMD , closely matching the target. Discrete-variable Born machines with six qubits plateau in accuracy due to coarse discretization (Čepaitė et al., 2020).
- Learning multimode targets: circuits with alternations of non-Gaussian and Gaussian blocks were trained using up to four modes and $1500$ shots per gradient. Training converged in $100$–$600$ iterations, final joint distributions matching target densities (Kolarovszki et al., 2024).
- GBS-based CVBMs (“parity-GBBM”): demonstrated classical trainability with JAX on up to $805$-mode circuits, parameters, and datasets including genomic and image data. Model performance exceeded classic RBMs and Chow-Liu trees in matching reference statistics (Kolarovszki et al., 11 Mar 2026).
SGD with stochastic parameter-shift gradients was shown to be robust to gradient noise, converging reliably even with estimator fluctuations (Kolarovszki et al., 2024).
6. Noise and Robustness
The dominant noise mechanism on CV hardware is modeled as photon loss, represented as a beamsplitter of transmissivity . Numerical analyses show:
- Gaussian targets are robust to –$0.9$; learning remains effective for moderate loss.
- Non-Gaussian targets are more sensitive but show partial learnability under realistic noise (Čepaitė et al., 2020).
- In GBS-based architectures, classical training is unaffected by hardware noise, but quantum sampling for inference retains hardware-induced imperfections (Kolarovszki et al., 11 Mar 2026).
In practice, transmissivity may be calibrated directly by fitting the model to experimental data (Čepaitė et al., 2020).
7. Limitations, Variants, and Outlook
Several limitations and open questions are evident:
- Parameter-shift rules for non-Gaussian gates are absent, necessitating inefficient finite-difference gradients for these directions (Kolarovszki et al., 2024).
- The convergence of CVBM training depends critically on the chosen circuit ansatz, kernel, learning rate, and initialization. Optimal architectures for given data distributions are undetermined (Kolarovszki et al., 2024).
- For large mode count, classical simulation becomes infeasible; quantum-gradient-based training is expected to scale favorably with access to photonic hardware (Kolarovszki et al., 2024).
- The regime of shot budgets that ensure convergence is not yet quantitatively characterized; rigorous sample complexity analysis remains open (Kolarovszki et al., 2024).
Empirical evidence suggests CVBMs can be trained with practical shot counts using quantum stochastic gradient descent and can leverage hybrid approaches for model classes—such as GBS circuits with classical differentiability—to enable classically scalable training while sampling remains quantum (Kolarovszki et al., 11 Mar 2026).