Papers
Topics
Authors
Recent
Search
2000 character limit reached

QMaxCal Framework: Path-Space KL Regularization

Updated 23 June 2026
  • QMaxCal is a quantum control framework that regularizes open-system dynamics using path-space KL divergence, leveraging Girsanov’s theorem.
  • It introduces Wiener-KL and drift-variance regularizers to penalize controls that enhance environmental decoherence, enabling differentiable optimization.
  • Empirical tests on quantum benchmarks demonstrate up to 50% infidelity reduction and improved robustness under noise-model mismatches.

QMaxCal (“Quantum Maximum Caliber”) is a path-space Kullback–Leibler (KL) regularization framework for open quantum system control problems under decoherence. Its principled regularizers leverage Girsanov’s theorem to penalize controls that lead to trajectories with enhanced observable effects of the environment, thus driving the system toward states or subspaces with minimal decoherence. QMaxCal introduces two KL-based regularization terms—the Wiener-KL and drift-variance regularizers—complementing standard control fluence penalties, and produces closed-form, differentiable estimators for use in gradient-based and reinforcement learning optimization of time-dependent controls (Moody et al., 18 Jun 2026).

1. Open Quantum Control in Path Space

When a quantum system interacts with its environment, continuous monitoring of decoherence channels induces stochastic pure state trajectories governed by the stochastic Schrödinger equation (SSE). Each monitored decoherence channel kk yields a classical trajectory

dIk(t)=αk(t)dt+dWk(t)dI_k(t) = \alpha_k(t)\,dt + dW_k(t)

where the drift αk(t)=ψ(t)(Lk+Lk)ψ(t)\alpha_k(t) = \langle\psi(t)|(L_k + L_k^\dagger)|\psi(t)\rangle encodes the effect of the decoherence operator LkL_k and dWk(t)dW_k(t) is the Wiener increment. Different control protocols u(θ)(t)u^{(\theta)}(t) produce distinct drifts αk(t)\alpha_k(t) but are subject to the same environmental noise realizations.

Girsanov’s theorem provides a mechanism to relate the path probability distributions generated by different control protocols acting on the same open quantum system, by expressing a closed-form Radon–Nikodym derivative and KL divergence between their associated ensembles of measurement records. The QMaxCal framework exploits this result to regularize open-system control strategies, explicitly penalizing the projected impact of the system’s evolution onto the decoherence channels.

2. Girsanov-Based Path-Space KL Divergence

The classical version of Girsanov’s theorem addresses diffusions of the form

dXt(i)=b(i)(t,X[0,t])dt+dWt,i=0,1dX_t^{(i)} = b^{(i)}(t, X_{[0,t]})\,dt + dW_t, \quad i=0,1

within a shared Wiener-noise probability space. The key result is that the KL divergence between trajectories generated by two drifts b(1)b^{(1)} and b(0)b^{(0)} is

dIk(t)=αk(t)dt+dWk(t)dI_k(t) = \alpha_k(t)\,dt + dW_k(t)0

where dIk(t)=αk(t)dt+dWk(t)dI_k(t) = \alpha_k(t)\,dt + dW_k(t)1. In the quantum-trajectory case, measurement records for each decoherence channel inherit this structure: under controls dIk(t)=αk(t)dt+dWk(t)dI_k(t) = \alpha_k(t)\,dt + dW_k(t)2, the pathwise records are diffusions with drifts dIk(t)=αk(t)dt+dWk(t)dI_k(t) = \alpha_k(t)\,dt + dW_k(t)3. The relative entropy reads

dIk(t)=αk(t)dt+dWk(t)dI_k(t) = \alpha_k(t)\,dt + dW_k(t)4

Selecting appropriate reference measures produces motivated regularizers for quantum control objectives.

3. Regularizers: Wiener-KL and Drift-Variance

QMaxCal defines two primary path-space regularizers for a control protocol parameterized by dIk(t)=αk(t)dt+dWk(t)dI_k(t) = \alpha_k(t)\,dt + dW_k(t)5:

Regularizer Reference Process Penalty Formulation
Wiener-KL (dIk(t)=αk(t)dt+dWk(t)dI_k(t) = \alpha_k(t)\,dt + dW_k(t)6) Brownian motion (dIk(t)=αk(t)dt+dWk(t)dI_k(t) = \alpha_k(t)\,dt + dW_k(t)7) dIk(t)=αk(t)dt+dWk(t)dI_k(t) = \alpha_k(t)\,dt + dW_k(t)8
Drift-variance (dIk(t)=αk(t)dt+dWk(t)dI_k(t) = \alpha_k(t)\,dt + dW_k(t)9) Constant-drift process (αk(t)=ψ(t)(Lk+Lk)ψ(t)\alpha_k(t) = \langle\psi(t)|(L_k + L_k^\dagger)|\psi(t)\rangle0) αk(t)=ψ(t)(Lk+Lk)ψ(t)\alpha_k(t) = \langle\psi(t)|(L_k + L_k^\dagger)|\psi(t)\rangle1
  • Wiener-KL (αk(t)=ψ(t)(Lk+Lk)ψ(t)\alpha_k(t) = \langle\psi(t)|(L_k + L_k^\dagger)|\psi(t)\rangle2): The reference is pure Brownian motion (zero drift). This regularizer penalizes the mean-square drift, incentivizing trajectories to approach the joint kernel αk(t)=ψ(t)(Lk+Lk)ψ(t)\alpha_k(t) = \langle\psi(t)|(L_k + L_k^\dagger)|\psi(t)\rangle3—the “dark” or decoherence-free states under all channels.
  • Drift-variance (αk(t)=ψ(t)(Lk+Lk)ψ(t)\alpha_k(t) = \langle\psi(t)|(L_k + L_k^\dagger)|\psi(t)\rangle4): The reference is the best-fit constant-drift process, minimizing over αk(t)=ψ(t)(Lk+Lk)ψ(t)\alpha_k(t) = \langle\psi(t)|(L_k + L_k^\dagger)|\psi(t)\rangle5. For each channel, the optimal αk(t)=ψ(t)(Lk+Lk)ψ(t)\alpha_k(t) = \langle\psi(t)|(L_k + L_k^\dagger)|\psi(t)\rangle6. The penalty quantifies the temporal variance of each drift about its mean, vanishing exactly for decoherence-free subspaces (DFS) with constant αk(t)=ψ(t)(Lk+Lk)ψ(t)\alpha_k(t) = \langle\psi(t)|(L_k + L_k^\dagger)|\psi(t)\rangle7.

These KL-derived penalties differ qualitatively from standard fluence or pulse-smoothness regularization by acting directly on time-resolved observables of the system-environment interaction rather than on control differentiability or bandwidth.

4. Augmented Control Objective and Derivatives

QMaxCal’s objective for a state-transfer task from αk(t)=ψ(t)(Lk+Lk)ψ(t)\alpha_k(t) = \langle\psi(t)|(L_k + L_k^\dagger)|\psi(t)\rangle8 to αk(t)=ψ(t)(Lk+Lk)ψ(t)\alpha_k(t) = \langle\psi(t)|(L_k + L_k^\dagger)|\psi(t)\rangle9 at fixed LkL_k0 is

LkL_k1

where LkL_k2, LkL_k3 is an optional fluence constraint, and expectations are over sampled SSE trajectories.

The gradients of the objective are computed by backpropagation through the sampled trajectories. For the regularizers:

  • LkL_k4
  • LkL_k5

The derivatives LkL_k6 are computed via automatic differentiation through the SSE numerical integrator, which also yields gradients for the fluence term.

5. Optimization Protocol

The gradient-based QMaxCal algorithm proceeds as follows:

  1. Parameter initialization: E.g., Fourier coefficients for each control channel.
  2. Trajectory sampling: Integrate the SSE for LkL_k7 trajectories LkL_k8 under LkL_k9.
  3. Observable accumulation: For each trajectory, record final-state fidelity and drifts dWk(t)dW_k(t)0.
  4. Estimator calculation: Compute sample means for the objective, dWk(t)dW_k(t)1, dWk(t)dW_k(t)2, and fluence.
  5. Objective and gradient computation: Use automatic differentiation to obtain dWk(t)dW_k(t)3 and dWk(t)dW_k(t)4.
  6. Parameter update: Apply a gradient descent step (e.g., Adam).

A reinforcement learning adaptation uses, e.g., PPO with the negative of the regularized fidelity as the reward.

6. Empirical Performance Across Quantum Benchmarks

QMaxCal was evaluated on five representative open quantum system benchmarks, with consistent comparison to unregularized gradient-based trajectory optimizers and RL-based PPO baselines:

  • Single-Qubit Amplitude Damping: dWk(t)dW_k(t)5 (with dWk(t)dW_k(t)6) contracted SSE-trajectory population variance by dWk(t)dW_k(t)7 at dWk(t)dW_k(t)8 (from dWk(t)dW_k(t)9 to u(θ)(t)u^{(\theta)}(t)0) and achieved up to u(θ)(t)u^{(\theta)}(t)1–u(θ)(t)u^{(\theta)}(t)2 percentage point fidelity improvement (u(θ)(t)u^{(\theta)}(t)3 infidelity reduction). Drift-variance was less effective here.
  • STIRAP (Λ system): At u(θ)(t)u^{(\theta)}(t)4, u(θ)(t)u^{(\theta)}(t)5 reduced peak u(θ)(t)u^{(\theta)}(t)6-state population by u(θ)(t)u^{(\theta)}(t)7 (from u(θ)(t)u^{(\theta)}(t)8 to u(θ)(t)u^{(\theta)}(t)9) and time-integrated exposure by αk(t)\alpha_k(t)0, maintaining fidelity near αk(t)\alpha_k(t)1. PPO baseline degraded to αk(t)\alpha_k(t)2 fidelity.
  • Diamond Four-Level System: Baseline fidelity of αk(t)\alpha_k(t)3 (with αk(t)\alpha_k(t)4 leakage) was improved to αk(t)\alpha_k(t)5 by αk(t)\alpha_k(t)6 (αk(t)\alpha_k(t)7 pp). Under αk(t)\alpha_k(t)8 noise-model mismatch, αk(t)\alpha_k(t)9 maintained dXt(i)=b(i)(t,X[0,t])dt+dWt,i=0,1dX_t^{(i)} = b^{(i)}(t, X_{[0,t]})\,dt + dW_t, \quad i=0,10 (dXt(i)=b(i)(t,X[0,t])dt+dWt,i=0,1dX_t^{(i)} = b^{(i)}(t, X_{[0,t]})\,dt + dW_t, \quad i=0,11 pp over baseline).
  • Four-Qubit Chain: With asymmetric dephasing (dXt(i)=b(i)(t,X[0,t])dt+dWt,i=0,1dX_t^{(i)} = b^{(i)}(t, X_{[0,t]})\,dt + dW_t, \quad i=0,12), dXt(i)=b(i)(t,X[0,t])dt+dWt,i=0,1dX_t^{(i)} = b^{(i)}(t, X_{[0,t]})\,dt + dW_t, \quad i=0,13 shifted final-state fidelity from dXt(i)=b(i)(t,X[0,t])dt+dWt,i=0,1dX_t^{(i)} = b^{(i)}(t, X_{[0,t]})\,dt + dW_t, \quad i=0,14 (baseline) to dXt(i)=b(i)(t,X[0,t])dt+dWt,i=0,1dX_t^{(i)} = b^{(i)}(t, X_{[0,t]})\,dt + dW_t, \quad i=0,15 (dXt(i)=b(i)(t,X[0,t])dt+dWt,i=0,1dX_t^{(i)} = b^{(i)}(t, X_{[0,t]})\,dt + dW_t, \quad i=0,16 pp, dXt(i)=b(i)(t,X[0,t])dt+dWt,i=0,1dX_t^{(i)} = b^{(i)}(t, X_{[0,t]})\,dt + dW_t, \quad i=0,17 infidelity reduction).
  • IBM Kingston Six-Qubit Chain: Baseline fidelity dXt(i)=b(i)(t,X[0,t])dt+dWt,i=0,1dX_t^{(i)} = b^{(i)}(t, X_{[0,t]})\,dt + dW_t, \quad i=0,18 (with dXt(i)=b(i)(t,X[0,t])dt+dWt,i=0,1dX_t^{(i)} = b^{(i)}(t, X_{[0,t]})\,dt + dW_t, \quad i=0,19); drift-variance (b(1)b^{(1)}0) reached b(1)b^{(1)}1 (b(1)b^{(1)}2 infidelity reduction); b(1)b^{(1)}3 slightly trailing; PPO baseline b(1)b^{(1)}4.

These results demonstrate that QMaxCal regularization efficiently steers trajectories into decoherence-avoiding subspaces and enhances both final-state fidelity and robustness to noise-model mismatch. Gains of up to b(1)b^{(1)}5 infidelity reduction and b(1)b^{(1)}6–b(1)b^{(1)}7 percentage point fidelity boost under noise-model mismatch are reported.

7. Distinguishing Features and Theoretical Context

QMaxCal’s principal innovation is the construction of differentiable path-space KL regularizers for open quantum dynamics, grounding the control penalty in observable statistics of the decoherence channels. Unlike conventional penalties on control fluence or smoothness, which limit total pulse energy or bandwidth, QMaxCal’s terms directly penalize the cumulative environmental “visibility” of noise-induced drift, providing physical interpretability and task relevance. The Wiener-KL regularizer drives evolution into the joint kernel of the Lindblad terms, while the drift-variance identifies any decoherence-free subspace, and is effective even when no joint kernel exists.

A plausible implication is that QMaxCal can substantially improve outcome fidelity in realistic quantum hardware scenarios, especially where noise model mismatch or complex open-system structure obviates reward shaping and prior-based regularization.

For further derivations and technical details, see (Moody et al., 18 Jun 2026) (Appendices B–F).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to QMaxCal Framework.