Papers
Topics
Authors
Recent
Search
2000 character limit reached

Wiener KL Divergence in Quantum Control

Updated 4 July 2026
  • The paper defines Wiener KL as a path-space Kullback–Leibler divergence that penalizes measurement-record drifts relative to a zero-drift Wiener process in quantum control.
  • It derives the divergence via Girsanov’s theorem, establishing a closed-form expectation of quadratic drift deviations akin to Cameron–Martin energy in Gaussian measures.
  • The approach demonstrates practical improvements in quantum fidelity and robustness by steering trajectories toward decoherence-free regions under open quantum system dynamics.

to=arxiv_search.search 天天中彩票中大奖 彩神争霸官网 天天中彩票买json {"query":"KL_W Wiener KL QMaxCal path-space regularization open quantum control arXiv (Moody et al., 18 Jun 2026)", "max_results": 5} to=arxiv_search.search ахәыҷി 北京赛车女json {"query":"Information Projection on Banach spaces with Applications to State Independent KL-Weighted Optimal Control Wiener space arXiv (Selk et al., 2020)", "max_results": 5} to=arxiv_search.search 福利彩票天天彩պեսjson {"query":"A representation for the Kantorovich--Rubinstein distance on the abstract Wiener space arXiv (Riabov, 2016)", "max_results": 5} Wiener KL, usually written KLWKL_W, denotes in the cited arXiv literature a Kullback–Leibler divergence defined on path space with a Wiener reference measure. In the exact notation KLWKL_W used by QMaxCal, it is the KL divergence between the measurement-record distribution induced by a controlled stochastic Schrödinger evolution and standard Wiener measure, so it penalizes the observable trajectory-level consequences of control on decoherence channels rather than the control waveform itself (Moody et al., 18 Jun 2026). In a broader Wiener-space context, this usage belongs to the standard relative-entropy framework for Gaussian and path measures; in particular, for Gaussian shift measures on classical Wiener space, the KL divergence reduces to a quadratic Cameron–Martin energy (Selk et al., 2020).

1. Definition in open quantum control

In QMaxCal, an open quantum system is continuously monitored under the diffusive or homodyne unravelling. Each decoherence channel kk produces a measurement record

dIk(t)=αk(t)dt+dWk(t),αk(t)=ψ(t)(Lk+Lk)ψ(t),dI_k(t)=\alpha_k(t)\,dt+dW_k(t), \qquad \alpha_k(t)=\langle \psi(t)\mid (L_k+L_k^\dagger)\mid \psi(t)\rangle,

where the dWkdW_k are independent Wiener increments. The drift αk(t)\alpha_k(t) is the observable signature of the state’s exposure to channel kk (Moody et al., 18 Jun 2026).

With PθP_\theta denoting the path measure induced by the controlled stochastic Schrödinger equation and PWP_W denoting standard Wiener measure, the Wiener KL is defined by

KLW:=KL(PθPW)=12k=1KEPθ ⁣[0Tαk(θ)(t)2dt].\mathrm{KL}_W:=\mathrm{KL}(P_\theta\|P_W) = \frac12\sum_{k=1}^K \mathbb{E}_{P_\theta}\!\left[\int_0^T \alpha_k^{(\theta)}(t)^2\,dt\right].

Here KLWKL_W0 is the zero-drift reference process, KLWKL_W1. Accordingly, KLWKL_W2 is literally the KL divergence between the control-induced record distribution and pure Brownian noise (Moody et al., 18 Jun 2026).

This definition makes the regularizer state- and trajectory-sensitive. It does not act on KLWKL_W3 directly, but on the path-space distribution generated after the controls, Hamiltonian, Lindblad operators, and monitoring scheme have been fixed. The paper’s stated interpretation is that minimizing KLWKL_W4 encourages the controlled trajectory to move into regions where decoherence has little or no effect, most importantly into a joint kernel or decoherence-free region when such a region is reachable (Moody et al., 18 Jun 2026).

2. Derivation from Girsanov’s theorem

The central derivation uses Girsanov’s theorem for diffusions with identical noise structure and different drifts. If two path measures KLWKL_W5 and KLWKL_W6 have drift difference KLWKL_W7, then the paper states the scalar change-of-measure formula as

KLWKL_W8

and the KLWKL_W9-channel quantum measurement-record analogue as

kk0

Choosing the reference drift to be zero immediately yields the Wiener KL formula above (Moody et al., 18 Jun 2026).

The derivation depends on a specific structural condition: the measurement record must be an Itô diffusion with unit diffusion coefficient independent of control. In the QMaxCal construction, two quantum evolutions share the same Lindblad operators kk1, so the path measures differ only in the drift of the measurement record. Under exactly that hypothesis, the KL functional becomes a closed-form, differentiable expectation of a quadratic drift functional, and the paper emphasizes that it is straightforward to estimate by Monte Carlo over stochastic Schrödinger trajectories (Moody et al., 18 Jun 2026).

A plausible implication is that kk2 inherits the computational advantages typical of quadratic-energy path-space penalties while remaining tied to an explicitly observable signal, namely the monitored record itself rather than an auxiliary latent quantity.

3. Relation to Wiener-space KL on classical Wiener space

The QMaxCal construction is not an isolated use of KL on Wiener path space. On classical Wiener space kk3, the Cameron–Martin theorem gives explicit Radon–Nikodym derivatives for Gaussian shift measures kk4, and the associated KL divergence is

kk5

If kk6 and kk7 with kk8, then

kk9

The same paper formulates an information projection problem over shift measures and shows that, in this Wiener-space setting, KL projection, KL-weighted optimal control, and minimization of an Onsager–Machlup function are equivalent formulations (Selk et al., 2020).

This places dIk(t)=αk(t)dt+dWk(t),αk(t)=ψ(t)(Lk+Lk)ψ(t),dI_k(t)=\alpha_k(t)\,dt+dW_k(t), \qquad \alpha_k(t)=\langle \psi(t)\mid (L_k+L_k^\dagger)\mid \psi(t)\rangle,0 in a larger measure-theoretic lineage. The object is not a new divergence axiomatically; it is the ordinary KL divergence specialized to a Wiener reference structure. In the QMaxCal case the reference is standard Wiener measure on the monitored record; in the Banach- and Wiener-space control formulation, the reference is the law of standard Brownian motion and admissible measures are Cameron–Martin shifts (Selk et al., 2020).

A nearby but distinct result is the representation of the Kantorovich–Rubinstein distance on abstract Wiener space. There the relevant functional is not KL divergence but the dIk(t)=αk(t)dt+dWk(t),αk(t)=ψ(t)(Lk+Lk)ψ(t),dI_k(t)=\alpha_k(t)\,dt+dW_k(t), \qquad \alpha_k(t)=\langle \psi(t)\mid (L_k+L_k^\dagger)\mid \psi(t)\rangle,1-Wasserstein distance dIk(t)=αk(t)dt+dWk(t),αk(t)=ψ(t)(Lk+Lk)ψ(t),dI_k(t)=\alpha_k(t)\,dt+dW_k(t), \qquad \alpha_k(t)=\langle \psi(t)\mid (L_k+L_k^\dagger)\mid \psi(t)\rangle,2, represented via the divergence or extended stochastic integral operator: dIk(t)=αk(t)dt+dWk(t),αk(t)=ψ(t)(Lk+Lk)ψ(t),dI_k(t)=\alpha_k(t)\,dt+dW_k(t), \qquad \alpha_k(t)=\langle \psi(t)\mid (L_k+L_k^\dagger)\mid \psi(t)\rangle,3 This is conceptually adjacent because it also turns a path-space measure discrepancy into a variational norm minimization, but it is not a Wiener KL (Riabov, 2016).

4. Role inside QMaxCal and comparison with dIk(t)=αk(t)dt+dWk(t),αk(t)=ψ(t)(Lk+Lk)ψ(t),dI_k(t)=\alpha_k(t)\,dt+dW_k(t), \qquad \alpha_k(t)=\langle \psi(t)\mid (L_k+L_k^\dagger)\mid \psi(t)\rangle,4

QMaxCal incorporates the Wiener KL into a fidelity-plus-regularization objective

dIk(t)=αk(t)dt+dWk(t),αk(t)=ψ(t)(Lk+Lk)ψ(t),dI_k(t)=\alpha_k(t)\,dt+dW_k(t), \qquad \alpha_k(t)=\langle \psi(t)\mid (L_k+L_k^\dagger)\mid \psi(t)\rangle,5

where

dIk(t)=αk(t)dt+dWk(t),αk(t)=ψ(t)(Lk+Lk)ψ(t),dI_k(t)=\alpha_k(t)\,dt+dW_k(t), \qquad \alpha_k(t)=\langle \psi(t)\mid (L_k+L_k^\dagger)\mid \psi(t)\rangle,6

The paper notes that experiments often use either dIk(t)=αk(t)dt+dWk(t),αk(t)=ψ(t)(Lk+Lk)ψ(t),dI_k(t)=\alpha_k(t)\,dt+dW_k(t), \qquad \alpha_k(t)=\langle \psi(t)\mid (L_k+L_k^\dagger)\mid \psi(t)\rangle,7 or dIk(t)=αk(t)dt+dWk(t),αk(t)=ψ(t)(Lk+Lk)ψ(t),dI_k(t)=\alpha_k(t)\,dt+dW_k(t), \qquad \alpha_k(t)=\langle \psi(t)\mid (L_k+L_k^\dagger)\mid \psi(t)\rangle,8, not both simultaneously, depending on the benchmark (Moody et al., 18 Jun 2026).

The drift-variance regularizer $dI_k(t)=\alpha_k(t)\,dt+dW_k(t), \qquad \alpha_k(t)=\langle \psi(t)\mid (L_k+L_k^\dagger)\mid \psi(t)\rangle,$9 uses a different reference class. Instead of zero-drift Wiener measure, it chooses the closest constant-drift process: $dW_k$0 and obtains

$dW_k$1

with

$dW_k$2

The paper’s distinction is explicit: dWkdW_k3 penalizes drift magnitude relative to zero drift, whereas $dW_k$4 penalizes fluctuations around a constant drift regardless of whether that constant is zero (Moody et al., 18 Jun 2026).

This leads to different inductive biases. $dW_k$5 has a stronger bias toward the joint kernel $dW_k$6, where drift vanishes. By contrast, $dW_k$7 vanishes on any decoherence-free subspace where the drift is constant in time and across realizations, even if that constant is nonzero. The paper therefore treats dWkdW_k8 as the stronger but more specialized regularizer, and $dW_k$9 as the more generally applicable one (Moody et al., 18 Jun 2026).

5. Benchmark behavior and reported performance

Across the reported single- and multi-qubit benchmarks, as well as a multi-qubit chain calibrated to a published snapshot of the IBM Kingston processor, the regularizers outperform unregularized gradient-based and reinforcement-learning baselines along final-state fidelity, robustness to mismatch in the assumed noise model, and occupation of forbidden states. The abstract reports gains growing from $\alpha_k(t)$0 percentage points at training noise to $\alpha_k(t)$1 percentage points under $\alpha_k(t)$2 noise mismatch, reduction of infidelity by up to $\alpha_k(t)$3, and approximately $\alpha_k(t)$4 gains on the calibrated IBM Kingston chain (Moody et al., 18 Jun 2026).

Benchmark Reported $\alpha_k(t)$5 outcome Structural interpretation
Single-qubit amplitude damping At $\alpha_k(t)$6, baseline fidelity $\alpha_k(t)$7, Wiener KL $\alpha_k(t)$8, $\alpha_k(t)$9 $k$0, PPO kk1 Ground state is the kernel
STIRAP At kk2, fidelity stays essentially unchanged (kk3 baseline, kk4 Wiener KL), but peak $k$5 population drops from kk6 to kk7 Suppresses occupancy of the lossy intermediate state
Diamond system At kk8, baseline kk9, Wiener KL with PθP_\theta0 gives $P_\theta$1, with $P_\theta$2 gives $P_\theta$3, PPO gives PθP_\theta4 Routes population through a safe kernel state

The amplitude-damping example is the clearest demonstration of the mechanism. The Lindblad operator is PθP_\theta5, so the kernel is the ground state PθP_\theta6. The paper reports that the trajectory-variance collapse is large: the time-integrated population variance falls from PθP_\theta7 to PθP_\theta8, approximately a PθP_\theta9 reduction, and the drift-squared integral PWP_W0 drops from PWP_W1 to PWP_W2 at PWP_W3 (Moody et al., 18 Jun 2026).

The STIRAP benchmark illustrates a different regime. Here Wiener KL does not materially improve the final fidelity, but it reduces occupation of the lossy excited state. The paper gives a PWP_W4 reduction in peak PWP_W5 population and a PWP_W6 reduction in time-integrated PWP_W7 exposure at PWP_W8 (Moody et al., 18 Jun 2026). This suggests that PWP_W9 can improve protocol quality even when the terminal fidelity is already near saturation.

The diamond-system robustness test shows the strongest mismatch effect. Trained at KLW:=KL(PθPW)=12k=1KEPθ ⁣[0Tαk(θ)(t)2dt].\mathrm{KL}_W:=\mathrm{KL}(P_\theta\|P_W) = \frac12\sum_{k=1}^K \mathbb{E}_{P_\theta}\!\left[\int_0^T \alpha_k^{(\theta)}(t)^2\,dt\right].0 and evaluated at KLW:=KL(PθPW)=12k=1KEPθ ⁣[0Tαk(θ)(t)2dt].\mathrm{KL}_W:=\mathrm{KL}(P_\theta\|P_W) = \frac12\sum_{k=1}^K \mathbb{E}_{P_\theta}\!\left[\int_0^T \alpha_k^{(\theta)}(t)^2\,dt\right].1, the reported fidelities are KLW:=KL(PθPW)=12k=1KEPθ ⁣[0Tαk(θ)(t)2dt].\mathrm{KL}_W:=\mathrm{KL}(P_\theta\|P_W) = \frac12\sum_{k=1}^K \mathbb{E}_{P_\theta}\!\left[\int_0^T \alpha_k^{(\theta)}(t)^2\,dt\right].2 for the baseline, KLW:=KL(PθPW)=12k=1KEPθ ⁣[0Tαk(θ)(t)2dt].\mathrm{KL}_W:=\mathrm{KL}(P_\theta\|P_W) = \frac12\sum_{k=1}^K \mathbb{E}_{P_\theta}\!\left[\int_0^T \alpha_k^{(\theta)}(t)^2\,dt\right].3 for Wiener KL, and KLW:=KL(PθPW)=12k=1KEPθ ⁣[0Tαk(θ)(t)2dt].\mathrm{KL}_W:=\mathrm{KL}(P_\theta\|P_W) = \frac12\sum_{k=1}^K \mathbb{E}_{P_\theta}\!\left[\int_0^T \alpha_k^{(\theta)}(t)^2\,dt\right].4 for PPO, with the paper noting that the gain grows monotonically with test-noise strength and reaches KLW:=KL(PθPW)=12k=1KEPθ ⁣[0Tαk(θ)(t)2dt].\mathrm{KL}_W:=\mathrm{KL}(P_\theta\|P_W) = \frac12\sum_{k=1}^K \mathbb{E}_{P_\theta}\!\left[\int_0^T \alpha_k^{(\theta)}(t)^2\,dt\right].5 percentage points over the baseline at the hardest mismatch setting (Moody et al., 18 Jun 2026).

6. Terminological scope and ambiguity of the notation

The notation KLW:=KL(PθPW)=12k=1KEPθ ⁣[0Tαk(θ)(t)2dt].\mathrm{KL}_W:=\mathrm{KL}(P_\theta\|P_W) = \frac12\sum_{k=1}^K \mathbb{E}_{P_\theta}\!\left[\int_0^T \alpha_k^{(\theta)}(t)^2\,dt\right].6 is not uniform across arXiv literatures. In the QMaxCal usage, it denotes the Wiener KL regularizer just described (Moody et al., 18 Jun 2026). In work on classical Wiener space, the operative object is still the ordinary KL divergence KLW:=KL(PθPW)=12k=1KEPθ ⁣[0Tαk(θ)(t)2dt].\mathrm{KL}_W:=\mathrm{KL}(P_\theta\|P_W) = \frac12\sum_{k=1}^K \mathbb{E}_{P_\theta}\!\left[\int_0^T \alpha_k^{(\theta)}(t)^2\,dt\right].7 on Gaussian shift measures rather than a separately named KLW:=KL(PθPW)=12k=1KEPθ ⁣[0Tαk(θ)(t)2dt].\mathrm{KL}_W:=\mathrm{KL}(P_\theta\|P_W) = \frac12\sum_{k=1}^K \mathbb{E}_{P_\theta}\!\left[\int_0^T \alpha_k^{(\theta)}(t)^2\,dt\right].8 invariant (Selk et al., 2020). In abstract Wiener-space optimal transport, the relevant discrepancy is KLW:=KL(PθPW)=12k=1KEPθ ⁣[0Tαk(θ)(t)2dt].\mathrm{KL}_W:=\mathrm{KL}(P_\theta\|P_W) = \frac12\sum_{k=1}^K \mathbb{E}_{P_\theta}\!\left[\int_0^T \alpha_k^{(\theta)}(t)^2\,dt\right].9, not KL divergence (Riabov, 2016).

The term “Wiener” is also heavily overloaded outside probability and control. In graph theory it usually denotes the Wiener index

KLWKL_W00

or related invariants such as the terminal Wiener index or Wiener complexity KLWKL_W01. Those papers explicitly do not define a KL-type quantity KLWKL_W02 [(Chen et al., 2013); (Dobrynin et al., 2019); (Dolžan, 2023)]. Separately, the shorthand KLWKL_W03 may also refer to the KL divergence between two Weibull distributions,

KLWKL_W04

which is unrelated to Wiener measure despite the similar subscript notation (Bauckhage, 2013).

Accordingly, the expression “Wiener KL” is context-dependent. As an exact named object, it refers most directly to the QMaxCal path-space regularizer

KLWKL_W05

while its broader mathematical background is the standard KL divergence on Gaussian and Wiener path spaces rather than a separate divergence family (Moody et al., 18 Jun 2026).

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Wiener KL (KL_W).