QMaxCal Framework: Path-Space KL Regularization

Updated 23 June 2026

QMaxCal is a quantum control framework that regularizes open-system dynamics using path-space KL divergence, leveraging Girsanov’s theorem.
It introduces Wiener-KL and drift-variance regularizers to penalize controls that enhance environmental decoherence, enabling differentiable optimization.
Empirical tests on quantum benchmarks demonstrate up to 50% infidelity reduction and improved robustness under noise-model mismatches.

QMaxCal (“Quantum Maximum Caliber”) is a path-space Kullback–Leibler (KL) regularization framework for open quantum system control problems under decoherence. Its principled regularizers leverage Girsanov’s theorem to penalize controls that lead to trajectories with enhanced observable effects of the environment, thus driving the system toward states or subspaces with minimal decoherence. QMaxCal introduces two KL-based regularization terms—the Wiener-KL and drift-variance regularizers—complementing standard control fluence penalties, and produces closed-form, differentiable estimators for use in gradient-based and reinforcement learning optimization of time-dependent controls (Moody et al., 18 Jun 2026).

1. Open Quantum Control in Path Space

When a quantum system interacts with its environment, continuous monitoring of decoherence channels induces stochastic pure state trajectories governed by the stochastic Schrödinger equation (SSE). Each monitored decoherence channel $k$ yields a classical trajectory

$dI_k(t) = \alpha_k(t)\,dt + dW_k(t)$

where the drift $\alpha_k(t) = \langle\psi(t)|(L_k + L_k^\dagger)|\psi(t)\rangle$ encodes the effect of the decoherence operator $L_k$ and $dW_k(t)$ is the Wiener increment. Different control protocols $u^{(\theta)}(t)$ produce distinct drifts $\alpha_k(t)$ but are subject to the same environmental noise realizations.

Girsanov’s theorem provides a mechanism to relate the path probability distributions generated by different control protocols acting on the same open quantum system, by expressing a closed-form Radon–Nikodym derivative and KL divergence between their associated ensembles of measurement records. The QMaxCal framework exploits this result to regularize open-system control strategies, explicitly penalizing the projected impact of the system’s evolution onto the decoherence channels.

2. Girsanov-Based Path-Space KL Divergence

The classical version of Girsanov’s theorem addresses diffusions of the form

$dX_t^{(i)} = b^{(i)}(t, X_{[0,t]})\,dt + dW_t, \quad i=0,1$

within a shared Wiener-noise probability space. The key result is that the KL divergence between trajectories generated by two drifts $b^{(1)}$ and $b^{(0)}$ is

$dI_k(t) = \alpha_k(t)\,dt + dW_k(t)$ 0

where $dI_k(t) = \alpha_k(t)\,dt + dW_k(t)$ 1. In the quantum-trajectory case, measurement records for each decoherence channel inherit this structure: under controls $dI_k(t) = \alpha_k(t)\,dt + dW_k(t)$ 2, the pathwise records are diffusions with drifts $dI_k(t) = \alpha_k(t)\,dt + dW_k(t)$ 3. The relative entropy reads

$dI_k(t) = \alpha_k(t)\,dt + dW_k(t)$ 4

Selecting appropriate reference measures produces motivated regularizers for quantum control objectives.

3. Regularizers: Wiener-KL and Drift-Variance

QMaxCal defines two primary path-space regularizers for a control protocol parameterized by $dI_k(t) = \alpha_k(t)\,dt + dW_k(t)$ 5:

Regularizer	Reference Process	Penalty Formulation
Wiener-KL ( $dI_k(t) = \alpha_k(t)\,dt + dW_k(t)$ 6)	Brownian motion ( $dI_k(t) = \alpha_k(t)\,dt + dW_k(t)$ 7)	$dI_k(t) = \alpha_k(t)\,dt + dW_k(t)$ 8
Drift-variance ( $dI_k(t) = \alpha_k(t)\,dt + dW_k(t)$ 9)	Constant-drift process ( $\alpha_k(t) = \langle\psi(t)\|(L_k + L_k^\dagger)\|\psi(t)\rangle$ 0)	$\alpha_k(t) = \langle\psi(t)\|(L_k + L_k^\dagger)\|\psi(t)\rangle$ 1

Wiener-KL ( $\alpha_k(t) = \langle\psi(t)|(L_k + L_k^\dagger)|\psi(t)\rangle$ 2): The reference is pure Brownian motion (zero drift). This regularizer penalizes the mean-square drift, incentivizing trajectories to approach the joint kernel $\alpha_k(t) = \langle\psi(t)|(L_k + L_k^\dagger)|\psi(t)\rangle$ 3—the “dark” or decoherence-free states under all channels.
Drift-variance ( $\alpha_k(t) = \langle\psi(t)|(L_k + L_k^\dagger)|\psi(t)\rangle$ 4): The reference is the best-fit constant-drift process, minimizing over $\alpha_k(t) = \langle\psi(t)|(L_k + L_k^\dagger)|\psi(t)\rangle$ 5. For each channel, the optimal $\alpha_k(t) = \langle\psi(t)|(L_k + L_k^\dagger)|\psi(t)\rangle$ 6. The penalty quantifies the temporal variance of each drift about its mean, vanishing exactly for decoherence-free subspaces (DFS) with constant $\alpha_k(t) = \langle\psi(t)|(L_k + L_k^\dagger)|\psi(t)\rangle$ 7.

These KL-derived penalties differ qualitatively from standard fluence or pulse-smoothness regularization by acting directly on time-resolved observables of the system-environment interaction rather than on control differentiability or bandwidth.

4. Augmented Control Objective and Derivatives

QMaxCal’s objective for a state-transfer task from $\alpha_k(t) = \langle\psi(t)|(L_k + L_k^\dagger)|\psi(t)\rangle$ 8 to $\alpha_k(t) = \langle\psi(t)|(L_k + L_k^\dagger)|\psi(t)\rangle$ 9 at fixed $L_k$ 0 is

$L_k$ 1

where $L_k$ 2, $L_k$ 3 is an optional fluence constraint, and expectations are over sampled SSE trajectories.

The gradients of the objective are computed by backpropagation through the sampled trajectories. For the regularizers:

$L_k$ 4
$L_k$ 5

The derivatives $L_k$ 6 are computed via automatic differentiation through the SSE numerical integrator, which also yields gradients for the fluence term.

5. Optimization Protocol

The gradient-based QMaxCal algorithm proceeds as follows:

Parameter initialization: E.g., Fourier coefficients for each control channel.
Trajectory sampling: Integrate the SSE for $L_k$ 7 trajectories $L_k$ 8 under $L_k$ 9.
Observable accumulation: For each trajectory, record final-state fidelity and drifts $dW_k(t)$ 0.
Estimator calculation: Compute sample means for the objective, $dW_k(t)$ 1, $dW_k(t)$ 2, and fluence.
Objective and gradient computation: Use automatic differentiation to obtain $dW_k(t)$ 3 and $dW_k(t)$ 4.
Parameter update: Apply a gradient descent step (e.g., Adam).

A reinforcement learning adaptation uses, e.g., PPO with the negative of the regularized fidelity as the reward.

6. Empirical Performance Across Quantum Benchmarks

QMaxCal was evaluated on five representative open quantum system benchmarks, with consistent comparison to unregularized gradient-based trajectory optimizers and RL-based PPO baselines:

Single-Qubit Amplitude Damping: $dW_k(t)$ 5 (with $dW_k(t)$ 6) contracted SSE-trajectory population variance by $dW_k(t)$ 7 at $dW_k(t)$ 8 (from $dW_k(t)$ 9 to $u^{(\theta)}(t)$ 0) and achieved up to $u^{(\theta)}(t)$ 1– $u^{(\theta)}(t)$ 2 percentage point fidelity improvement ( $u^{(\theta)}(t)$ 3 infidelity reduction). Drift-variance was less effective here.
STIRAP (Λ system): At $u^{(\theta)}(t)$ 4, $u^{(\theta)}(t)$ 5 reduced peak $u^{(\theta)}(t)$ 6-state population by $u^{(\theta)}(t)$ 7 (from $u^{(\theta)}(t)$ 8 to $u^{(\theta)}(t)$ 9) and time-integrated exposure by $\alpha_k(t)$ 0, maintaining fidelity near $\alpha_k(t)$ 1. PPO baseline degraded to $\alpha_k(t)$ 2 fidelity.
Diamond Four-Level System: Baseline fidelity of $\alpha_k(t)$ 3 (with $\alpha_k(t)$ 4 leakage) was improved to $\alpha_k(t)$ 5 by $\alpha_k(t)$ 6 ( $\alpha_k(t)$ 7 pp). Under $\alpha_k(t)$ 8 noise-model mismatch, $\alpha_k(t)$ 9 maintained $dX_t^{(i)} = b^{(i)}(t, X_{[0,t]})\,dt + dW_t, \quad i=0,1$ 0 ( $dX_t^{(i)} = b^{(i)}(t, X_{[0,t]})\,dt + dW_t, \quad i=0,1$ 1 pp over baseline).
Four-Qubit Chain: With asymmetric dephasing ( $dX_t^{(i)} = b^{(i)}(t, X_{[0,t]})\,dt + dW_t, \quad i=0,1$ 2), $dX_t^{(i)} = b^{(i)}(t, X_{[0,t]})\,dt + dW_t, \quad i=0,1$ 3 shifted final-state fidelity from $dX_t^{(i)} = b^{(i)}(t, X_{[0,t]})\,dt + dW_t, \quad i=0,1$ 4 (baseline) to $dX_t^{(i)} = b^{(i)}(t, X_{[0,t]})\,dt + dW_t, \quad i=0,1$ 5 ( $dX_t^{(i)} = b^{(i)}(t, X_{[0,t]})\,dt + dW_t, \quad i=0,1$ 6 pp, $dX_t^{(i)} = b^{(i)}(t, X_{[0,t]})\,dt + dW_t, \quad i=0,1$ 7 infidelity reduction).
IBM Kingston Six-Qubit Chain: Baseline fidelity $dX_t^{(i)} = b^{(i)}(t, X_{[0,t]})\,dt + dW_t, \quad i=0,1$ 8 (with $dX_t^{(i)} = b^{(i)}(t, X_{[0,t]})\,dt + dW_t, \quad i=0,1$ 9); drift-variance ( $b^{(1)}$ 0) reached $b^{(1)}$ 1 ( $b^{(1)}$ 2 infidelity reduction); $b^{(1)}$ 3 slightly trailing; PPO baseline $b^{(1)}$ 4.

These results demonstrate that QMaxCal regularization efficiently steers trajectories into decoherence-avoiding subspaces and enhances both final-state fidelity and robustness to noise-model mismatch. Gains of up to $b^{(1)}$ 5 infidelity reduction and $b^{(1)}$ 6– $b^{(1)}$ 7 percentage point fidelity boost under noise-model mismatch are reported.

7. Distinguishing Features and Theoretical Context

QMaxCal’s principal innovation is the construction of differentiable path-space KL regularizers for open quantum dynamics, grounding the control penalty in observable statistics of the decoherence channels. Unlike conventional penalties on control fluence or smoothness, which limit total pulse energy or bandwidth, QMaxCal’s terms directly penalize the cumulative environmental “visibility” of noise-induced drift, providing physical interpretability and task relevance. The Wiener-KL regularizer drives evolution into the joint kernel of the Lindblad terms, while the drift-variance identifies any decoherence-free subspace, and is effective even when no joint kernel exists.

A plausible implication is that QMaxCal can substantially improve outcome fidelity in realistic quantum hardware scenarios, especially where noise model mismatch or complex open-system structure obviates reward shaping and prior-based regularization.

For further derivations and technical details, see (Moody et al., 18 Jun 2026) (Appendices B–F).

Markdown Report Issue Upgrade to Chat

References (1)

QMaxCal: Path-Space Regularization for Open Quantum Control via Girsanov's Theorem (2026)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to QMaxCal Framework.

QMaxCal Framework: Path-Space KL Regularization

1. Open Quantum Control in Path Space

2. Girsanov-Based Path-Space KL Divergence

3. Regularizers: Wiener-KL and Drift-Variance

4. Augmented Control Objective and Derivatives

5. Optimization Protocol

6. Empirical Performance Across Quantum Benchmarks

7. Distinguishing Features and Theoretical Context

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Don't miss out on important new AI/ML research

QMaxCal Framework: Path-Space KL Regularization

1. Open Quantum Control in Path Space

2. Girsanov-Based Path-Space KL Divergence

3. Regularizers: Wiener-KL and Drift-Variance

4. Augmented Control Objective and Derivatives

5. Optimization Protocol

6. Empirical Performance Across Quantum Benchmarks

7. Distinguishing Features and Theoretical Context

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Related Topics

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research