Papers
Topics
Authors
Recent
Search
2000 character limit reached

Sampling-based CVaR Variational Quantum Algorithm

Updated 4 July 2026
  • Sampling-based CVaR VQA is a hybrid quantum-classical method that replaces the mean energy objective with a lower-tail risk functional to emphasize elite state sampling.
  • The approach uses order statistics by sorting measurement outcomes and averaging the lowest α-fraction, which improves convergence to optimal quantum states.
  • Extensions like Ascending-CVaR and smooth tilted-loss surrogates address non-smooth optimization and estimator variance, enhancing algorithm performance under hardware constraints.

Sampling-based Conditional Value-at-Risk variational quantum algorithms are hybrid quantum-classical procedures for combinatorial optimization in which the objective is computed from the lower tail of the energy or loss distribution induced by a parametrized quantum circuit, rather than from its full expectation value. In the standard setting, a circuit U(θ)U(\theta) prepares ψ(θ)|\psi(\theta)\rangle, measurements in the computational basis produce bit strings zz, and a diagonal cost Hamiltonian or directly evaluated classical loss assigns an energy E(z)E(z) to each sample. The resulting objective is the lower-tail CVaRα\mathrm{CVaR}_\alpha, estimated from finite shots by sorting measured energies and averaging the smallest αK\lceil \alpha K\rceil outcomes. This formulation was introduced as an alternative to expectation-based VQE/QAOA for diagonal Hamiltonians (Barkoutsos et al., 2019), and was subsequently extended with evolving risk schedules, direct bitstring-loss evaluation for constrained problems, smooth tilted-loss surrogates, and hardware-scale implementations (Kolotouros et al., 2021).

1. Formal setting and objective

In the diagonal-Hamiltonian formulation, one considers a problem Hamiltonian HH acting on nn qubits, diagonal in the computational basis with eigenstates z|z\rangle for z{0,1}nz\in\{0,1\}^n and eigenvalues ψ(θ)|\psi(\theta)\rangle0. A parametrized state is prepared as

ψ(θ)|\psi(\theta)\rangle1

and measuring in the computational basis yields ψ(θ)|\psi(\theta)\rangle2 with probability

ψ(θ)|\psi(\theta)\rangle3

The induced random variable ψ(θ)|\psi(\theta)\rangle4 takes values ψ(θ)|\psi(\theta)\rangle5 with probabilities ψ(θ)|\psi(\theta)\rangle6, and the standard VQA objective is the expectation

ψ(θ)|\psi(\theta)\rangle7

Sampling-based CVaR-VQA replaces this mean by a lower-tail risk functional (Barkoutsos et al., 2019).

For minimization, the lower-tail Value-at-Risk and Conditional Value-at-Risk at level ψ(θ)|\psi(\theta)\rangle8 are

ψ(θ)|\psi(\theta)\rangle9

and

zz0

Equivalent integral expressions also appear in the literature, including

zz1

and, for the energy-distribution setting,

zz2

At zz3, CVaR reduces to the expectation; as zz4, it approaches the minimum supported by the sampled distribution (Barkoutsos et al., 2019).

A second formulation dispenses with an explicit diagonal Hamiltonian and instead evaluates a classical loss directly on each sampled bit string. This pattern is used in constrained portfolio construction and in custom-penalty formulations for constrained binary optimization, where each shot yields a bit string zz5 and a classically computed value zz6 or zz7; the VQA objective is then the empirical zz8 of those sampled losses (Agliardi et al., 19 Aug 2025). In portfolio applications this point matters conceptually: the VQA objective is CVaR over measured energies from an Ising or directly sampled loss, and is distinct from financial CVaR on portfolio losses (Kolotouros et al., 2021).

2. Sampling estimator and hybrid optimization loop

The defining feature of the sampling-based variant is that the objective is built from order statistics of measurement outcomes. At parameters zz9, one runs the circuit for E(z)E(z)0 shots, computes energies or losses E(z)E(z)1, sorts them as

E(z)E(z)2

and defines E(z)E(z)3. The empirical estimator used in the foundational CVaR-VQA work is

E(z)E(z)4

The same sort-and-average estimator is used in later sampling-based portfolio and constrained-optimization implementations (Barkoutsos et al., 2019).

The hybrid loop is correspondingly simple. One prepares E(z)E(z)5, samples bit strings, computes a classical energy or loss per shot, sorts the resulting values, feeds the empirical tail average to a classical optimizer, and updates E(z)E(z)6. The workflow typically also records the best bit string observed so far across all iterations. In the diagonal setting, this best sample is the lowest-energy measured computational basis state; in constrained formulations, it is the lowest classical loss after penalty evaluation (Lee et al., 22 Apr 2026).

Because the objective depends on sorting and on a quantile threshold, the sampled CVaR landscape is non-smooth. The original experiments therefore used gradient-free COBYLA rather than analytic gradients (Barkoutsos et al., 2019). Subsequent studies retained derivative-free optimizers, including COBYLA for VQE/QAOA comparisons (Kolotouros et al., 2021), Powell for step-penalty constrained optimization (Lee et al., 22 Apr 2026), the Nakanishi–Fujii–Todo optimizer for 109-qubit ETF construction (Agliardi et al., 19 Aug 2025), and a two-stage PSOE(z)E(z)7NFT schedule for 150-qubit dynamic portfolio optimization (Haghighi, 8 Jun 2026).

Sampling overhead is intrinsic to the method. Since only the lowest E(z)E(z)8-fraction of samples contributes to the objective, the reported analyses note that decreasing E(z)E(z)9 increases estimator variance and typically requires more shots to match the statistical precision of the mean estimator. One common practical rule is to scale shots by approximately CVaRα\mathrm{CVaR}_\alpha0; in Ascending-CVaR experiments, a baseline CVaRα\mathrm{CVaR}_\alpha1 was scaled as CVaRα\mathrm{CVaR}_\alpha2, and in dynamic portfolio optimization the schedule

CVaRα\mathrm{CVaR}_\alpha3

was used to stabilize tail estimates (Kolotouros et al., 2021). Sorting contributes only an CVaRα\mathrm{CVaR}_\alpha4 classical overhead per objective evaluation (Kolotouros et al., 2021).

3. Landscape reshaping and theoretical properties

The principal theoretical motivation for CVaR-VQA is that, for diagonal Hamiltonians, the practical goal is often not to minimize average measured energy but to increase the probability of sampling low-energy bit strings. CVaR enforces this by discarding or downweighting high-energy outcomes and concentrating the objective on the lower tail. The foundational analysis explicitly states that, for classical optimization problems mapped to diagonal Hamiltonians, aggregating measurement samples by CVaR is more natural than using the expected value (Barkoutsos et al., 2019).

This reshaping can alter the local minima structure. A central proposition in the original work states that a local minimum of CVaRα\mathrm{CVaR}_\alpha5 does not necessarily correspond to a local minimum of CVaRα\mathrm{CVaR}_\alpha6, and vice versa (Barkoutsos et al., 2019). The paper’s two-qubit example makes the point starkly: for

CVaRα\mathrm{CVaR}_\alpha7

the expectation objective is constant in CVaRα\mathrm{CVaR}_\alpha8, so every parameter value is a global minimum and no optimization is possible, whereas at CVaRα\mathrm{CVaR}_\alpha9 the CVaR objective becomes

αK\lceil \alpha K\rceil0

which has nontrivial local structure and drives the state toward increased ground-state sampling probability (Barkoutsos et al., 2019).

The global minima story is more nuanced. If the variational form can reach the ground state and αK\lceil \alpha K\rceil1 is diagonal, then a global optimum of the expectation objective also minimizes αK\lceil \alpha K\rceil2 for any αK\lceil \alpha K\rceil3. Conversely, states with overlap αK\lceil \alpha K\rceil4 on the ground state are global minima of αK\lceil \alpha K\rceil5 for αK\lceil \alpha K\rceil6, even if they are not minima of the expectation objective (Barkoutsos et al., 2019). This is one source of the method’s practical utility: the objective can already be minimized once the circuit has moved sufficient probability mass onto optimal samples, without requiring the full distribution to be sharply concentrated.

The same papers also emphasize two caveats that are often overlooked. First, small αK\lceil \alpha K\rceil7 produces a “soft cap”: once the overlap with the optimum exceeds αK\lceil \alpha K\rceil8, further increasing that overlap need not improve the CVaR objective (Barkoutsos et al., 2019). Second, CVaR does not remove ansatz expressivity bottlenecks. In the QAOA analysis of the original study, small-depth circuits can remain “flat,” keeping amplitudes exponentially small on individual bit strings; CVaR can still guide optimization by focusing on the low-energy tail, but shallow-depth reachability remains a bottleneck (Barkoutsos et al., 2019). Later experiments found the same qualitative pattern: Ascending-CVaR improved QAOA overlap relative to fixed CVaR, yet absolute overlaps remained low at shallow depth for Max-Cut, Number Partitioning, and Portfolio Optimization (Kolotouros et al., 2021).

4. Scheduled CVaR objectives and smooth surrogates

A major extension of the fixed-αK\lceil \alpha K\rceil9 formulation is the evolving-risk objective known as Ascending-CVaR. In that approach the objective at optimization stage HH0 is

HH1

with a risk level HH2 that increases during training. The reported schedules include a linear update

HH3

with HH4 in 15–20 qubit VQE experiments, and a sigmoid schedule with HH5 that spends more early iterations at small HH6 and was beneficial for harder Number Partitioning instances (Kolotouros et al., 2021).

The theoretical justification is continuation across objective landscapes. The paper reports that if the ansatz can achieve maximum overlap HH7 with the ground state, then for any HH8, minimizing HH9 preserves the same global optimum as minimizing the energy. It also reports that if nn0 is a global minimizer for nn1 with nn2, then it is also a global minimizer for any nn3 with nn4, while local minima can change as nn5 changes (Kolotouros et al., 2021). The intended effect is to use small-nn6 phases to identify promising basins and larger-nn7 phases to erase suboptimal minima without perturbing the true optimum.

Not all schedules widen the tail. In dynamic portfolio optimization, a descending adaptive schedule was introduced:

nn8

with nn9, z|z\rangle0, and z|z\rangle1, decreasing every 48 NFT parameter updates (Haghighi, 8 Jun 2026). There the purpose is explicitly different: start with a broad, stable signal to steer amplitude toward feasible or near-feasible regions, then tighten the tail to refine high-quality samples. The literature therefore contains both widening and tightening schedules. This suggests that schedule direction is task-dependent: Ascending-CVaR addresses spurious local minima in unconstrained or lightly constrained landscapes, whereas descending schedules can prioritize feasibility discovery before tail refinement.

A separate line of development places CVaR inside a smooth tilted-loss framework. The Quantum Tilted Loss is defined by

z|z\rangle2

It interpolates continuously between expectation minimization and tail-sensitive objectives, preserves the true global minima, and admits exact parameter-shift gradients for suitable gate generators (Qiu et al., 4 May 2026). For z|z\rangle3, the paper proves the entropic lower bound

z|z\rangle4

and defines the corresponding EVaR surrogate by optimizing the right-hand side over z|z\rangle5 (Qiu et al., 4 May 2026). The same work formalizes the trade-off that sharper tilting can improve trainability while increasing finite-shot variance, shifting the bottleneck from vanishing gradients to measurement sampling variance.

5. Applications and empirical record

The empirical record spans simulation, small-device demonstrations, utility-scale sampling workflows, and constrained optimization with nonlinear penalties. Across these settings, the common pattern is that tail-focused objectives improve the probability of sampling good bit strings, but the effective choice of z|z\rangle6, optimizer, and shot budget remains problem-dependent.

The original CVaR-VQA study evaluated maximum stable set, Max3Sat, Number Partitioning, MaxCut, Market Split, and Portfolio Optimization across 340 random instances and 14,280 test cases. In simulation, CVaR-VQE improved rapidly as z|z\rangle7 decreased and circuit depth increased; with VQE depth z|z\rangle8 and z|z\rangle9, almost all instances reached at least z{0,1}nz\in\{0,1\}^n0 overlap within 50 normalized iterations, whereas at z{0,1}nz\in\{0,1\}^n1 only about z{0,1}nz\in\{0,1\}^n2 did so. On IBM Q Poughkeepsie, a 6-qubit portfolio optimization experiment with 8192 shots per evaluation found that z{0,1}nz\in\{0,1\}^n3 and z{0,1}nz\in\{0,1\}^n4 quickly increased ground-state overlap to the z{0,1}nz\in\{0,1\}^n5-level in each of five runs, while z{0,1}nz\in\{0,1\}^n6 remained very small and largely flat (Barkoutsos et al., 2019).

Ascending-CVaR extended this line in noiseless emulation for Max-Cut, Number Partitioning, and Portfolio Optimization using VQE and QAOA. In VQE, Ascending-CVaR reportedly outperformed both the expectation objective and constant CVaR on all tested cases, achieving up to ten times greater overlap in Portfolio Optimization and Number Partitioning and an z{0,1}nz\in\{0,1\}^n7 average improvement in Max-Cut. In hard Number Partitioning instances, standard objectives failed in almost all cases, constant CVaR found the correct solution in about z{0,1}nz\in\{0,1\}^n8 of cases, and Ascending-CVaR did so in about z{0,1}nz\in\{0,1\}^n9 (Kolotouros et al., 2021).

Later work generalized the sampling-based pattern to constrained losses evaluated directly on bit strings. In multi-dimensional knapsack, a slack-free step-penalty formulation combined with CVaR at ψ(θ)|\psi(\theta)\rangle00 and 4000 shots per evaluation outperformed slack-based QUBO on all 12 reported instances in mean optimality gap; for the largest reported instance, pet7 with ψ(θ)|\psi(\theta)\rangle01, the median gap was about ψ(θ)|\psi(\theta)\rangle02 for the finite-sampling mean objective and about ψ(θ)|\psi(\theta)\rangle03 for CVaR (Lee et al., 22 Apr 2026). In ETF construction, a 109-qubit hardware experiment on IBM Heron processors, using ψ(θ)|\psi(\theta)\rangle04 shots, ψ(θ)|\psi(\theta)\rangle05, and local-search post-processing, reached a best relative solution error of ψ(θ)|\psi(\theta)\rangle06 on a simplified 109-bond instance (Agliardi et al., 19 Aug 2025). In 150-qubit dynamic portfolio optimization, simulator studies selected an adaptive CVaR schedule, PSOψ(θ)|\psi(\theta)\rangle07NFT optimizer scheduling, and hardware-aware layouts; subsequent ibm_quebec experiments found that the heavy-hex-native deep-chain layout achieved the best final CVaR-tail value and best sampled objective among the tested layouts, although no quantum advantage over SCIP was observed (Haghighi, 8 Jun 2026).

Study Setting Reported outcome
(Barkoutsos et al., 2019) 340 random instances; VQE/QAOA; also 6-qubit IBM hardware Lower ψ(θ)|\psi(\theta)\rangle08 improved overlap and convergence; hardware runs at ψ(θ)|\psi(\theta)\rangle09 outperformed ψ(θ)|\psi(\theta)\rangle10
(Kolotouros et al., 2021) 15–20 qubits; Max-Cut, Number Partitioning, Portfolio; VQE and QAOA Ascending-CVaR exceeded constant CVaR and expectation; hard Number Partitioning reached about ψ(θ)|\psi(\theta)\rangle11 success
(Lee et al., 22 Apr 2026) MDKP with slack-free custom penalty; 4000 shots Custom penalty + CVaR beat slack-based QUBO on all 12 instances
(Agliardi et al., 19 Aug 2025) 109-qubit ETF construction on IBM Heron Best post-processed relative gap ψ(θ)|\psi(\theta)\rangle12
(Haghighi, 8 Jun 2026) 150-qubit dynamic portfolio optimization Adaptive CVaR and hardware-native layouts improved practical performance; no quantum advantage over SCIP

6. Limitations, misconceptions, and evaluation methodology

A recurring misconception is that smaller ψ(θ)|\psi(\theta)\rangle13 is unconditionally superior. The literature does not support that conclusion. Small ψ(θ)|\psi(\theta)\rangle14 sharpens focus on elite samples, but it also raises estimator variance, introduces a soft cap on rewarded overlap, and can destabilize derivative-free optimization if shot counts are not increased accordingly (Barkoutsos et al., 2019). A later evaluation framework, applied to a 16-qubit QUBO using RealAmplitudes and COBYLA, found that ψ(θ)|\psi(\theta)\rangle15 dominated lower ψ(θ)|\psi(\theta)\rangle16 values in feasibility and quality at moderate-to-high shot counts, with diminishing gains beyond about 20,000 shots for most ψ(θ)|\psi(\theta)\rangle17 (Mamedaliev et al., 21 Jan 2026). Another study likewise reported that fixed ψ(θ)|\psi(\theta)\rangle18 without shot scaling performs poorly in dynamic portfolio optimization (Haghighi, 8 Jun 2026). This suggests that the effective risk level depends on the interaction among problem structure, ansatz, optimizer, and sampling budget.

A second misconception is that CVaR-VQA is defined only for Ising/QUBO expectation estimation. Several newer formulations use the quantum circuit purely as a sampler and compute a nonlinear classical loss on each observed bit string. This is how hinge-penalty ETF construction, step-penalty knapsack optimization, and dynamic multi-period portfolio objectives avoid explicit slack-variable Hamiltonians or costly exact expectation computation for nonlinear penalties (Agliardi et al., 19 Aug 2025). The trade-off is that qubit savings and modeling flexibility come at the price of heavier shot budgets and explicit classical post-processing.

Because sampling-based VQAs are stochastic at multiple levels, evaluation has itself become a research topic. A dedicated framework introduced three complementary metrics—feasibility, quality, and reproducibility—and a quality diagram that visualizes the trade-off between success probability and resource consumption. Reproducibility is quantified by Shannon entropy over binned outcome distributions, and algorithm selection is framed as a thresholded decision rule under resource constraints (Mamedaliev et al., 21 Jan 2026). This perspective treats CVaR-VQAs not as single trajectories but as distributions over performance outcomes.

The main technical limitations remain measurement complexity, non-smoothness, hardware noise, and ansatz reachability. The empirical CVaR objective depends on order statistics and can be discontinuous under finite sampling; exact parameter-shift does not apply directly to the hard-truncated estimator, which is one reason smooth surrogates such as QTL and EVaR have been proposed (Qiu et al., 4 May 2026). Device noise and readout error distort energy histograms and quantiles, making careful calibration important (Kolotouros et al., 2021). More fundamentally, if the ansatz cannot reach the ground-state subspace, CVaR preserves the true optimum as an objective target but cannot make it attainable (Qiu et al., 4 May 2026).

Sampling-based CVaR-VQA is therefore best understood not as a single algorithmic recipe but as a family of tail-sensitive hybrid optimization procedures. Its defining idea is stable across variants: use finite-shot samples from a parametrized quantum circuit, evaluate a classical optimization loss on those samples, and train on the lower tail rather than the full mean. The subsequent literature shows that this family encompasses fixed-risk CVaR, evolving risk schedules, smooth entropic surrogates, custom penalties for constrained optimization, and hardware-aware large-qubit workflows, with performance gains that are substantial in many reported settings but are tightly conditioned by shot allocation, optimizer design, and circuit expressivity (Barkoutsos et al., 2019).

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Sampling-based CVaR Variational Quantum Algorithm (VQA).