Sampling-based CVaR Variational Quantum Algorithm
- Sampling-based CVaR VQA is a hybrid quantum-classical method that replaces the mean energy objective with a lower-tail risk functional to emphasize elite state sampling.
- The approach uses order statistics by sorting measurement outcomes and averaging the lowest α-fraction, which improves convergence to optimal quantum states.
- Extensions like Ascending-CVaR and smooth tilted-loss surrogates address non-smooth optimization and estimator variance, enhancing algorithm performance under hardware constraints.
Sampling-based Conditional Value-at-Risk variational quantum algorithms are hybrid quantum-classical procedures for combinatorial optimization in which the objective is computed from the lower tail of the energy or loss distribution induced by a parametrized quantum circuit, rather than from its full expectation value. In the standard setting, a circuit prepares , measurements in the computational basis produce bit strings , and a diagonal cost Hamiltonian or directly evaluated classical loss assigns an energy to each sample. The resulting objective is the lower-tail , estimated from finite shots by sorting measured energies and averaging the smallest outcomes. This formulation was introduced as an alternative to expectation-based VQE/QAOA for diagonal Hamiltonians (Barkoutsos et al., 2019), and was subsequently extended with evolving risk schedules, direct bitstring-loss evaluation for constrained problems, smooth tilted-loss surrogates, and hardware-scale implementations (Kolotouros et al., 2021).
1. Formal setting and objective
In the diagonal-Hamiltonian formulation, one considers a problem Hamiltonian acting on qubits, diagonal in the computational basis with eigenstates for and eigenvalues 0. A parametrized state is prepared as
1
and measuring in the computational basis yields 2 with probability
3
The induced random variable 4 takes values 5 with probabilities 6, and the standard VQA objective is the expectation
7
Sampling-based CVaR-VQA replaces this mean by a lower-tail risk functional (Barkoutsos et al., 2019).
For minimization, the lower-tail Value-at-Risk and Conditional Value-at-Risk at level 8 are
9
and
0
Equivalent integral expressions also appear in the literature, including
1
and, for the energy-distribution setting,
2
At 3, CVaR reduces to the expectation; as 4, it approaches the minimum supported by the sampled distribution (Barkoutsos et al., 2019).
A second formulation dispenses with an explicit diagonal Hamiltonian and instead evaluates a classical loss directly on each sampled bit string. This pattern is used in constrained portfolio construction and in custom-penalty formulations for constrained binary optimization, where each shot yields a bit string 5 and a classically computed value 6 or 7; the VQA objective is then the empirical 8 of those sampled losses (Agliardi et al., 19 Aug 2025). In portfolio applications this point matters conceptually: the VQA objective is CVaR over measured energies from an Ising or directly sampled loss, and is distinct from financial CVaR on portfolio losses (Kolotouros et al., 2021).
2. Sampling estimator and hybrid optimization loop
The defining feature of the sampling-based variant is that the objective is built from order statistics of measurement outcomes. At parameters 9, one runs the circuit for 0 shots, computes energies or losses 1, sorts them as
2
and defines 3. The empirical estimator used in the foundational CVaR-VQA work is
4
The same sort-and-average estimator is used in later sampling-based portfolio and constrained-optimization implementations (Barkoutsos et al., 2019).
The hybrid loop is correspondingly simple. One prepares 5, samples bit strings, computes a classical energy or loss per shot, sorts the resulting values, feeds the empirical tail average to a classical optimizer, and updates 6. The workflow typically also records the best bit string observed so far across all iterations. In the diagonal setting, this best sample is the lowest-energy measured computational basis state; in constrained formulations, it is the lowest classical loss after penalty evaluation (Lee et al., 22 Apr 2026).
Because the objective depends on sorting and on a quantile threshold, the sampled CVaR landscape is non-smooth. The original experiments therefore used gradient-free COBYLA rather than analytic gradients (Barkoutsos et al., 2019). Subsequent studies retained derivative-free optimizers, including COBYLA for VQE/QAOA comparisons (Kolotouros et al., 2021), Powell for step-penalty constrained optimization (Lee et al., 22 Apr 2026), the Nakanishi–Fujii–Todo optimizer for 109-qubit ETF construction (Agliardi et al., 19 Aug 2025), and a two-stage PSO7NFT schedule for 150-qubit dynamic portfolio optimization (Haghighi, 8 Jun 2026).
Sampling overhead is intrinsic to the method. Since only the lowest 8-fraction of samples contributes to the objective, the reported analyses note that decreasing 9 increases estimator variance and typically requires more shots to match the statistical precision of the mean estimator. One common practical rule is to scale shots by approximately 0; in Ascending-CVaR experiments, a baseline 1 was scaled as 2, and in dynamic portfolio optimization the schedule
3
was used to stabilize tail estimates (Kolotouros et al., 2021). Sorting contributes only an 4 classical overhead per objective evaluation (Kolotouros et al., 2021).
3. Landscape reshaping and theoretical properties
The principal theoretical motivation for CVaR-VQA is that, for diagonal Hamiltonians, the practical goal is often not to minimize average measured energy but to increase the probability of sampling low-energy bit strings. CVaR enforces this by discarding or downweighting high-energy outcomes and concentrating the objective on the lower tail. The foundational analysis explicitly states that, for classical optimization problems mapped to diagonal Hamiltonians, aggregating measurement samples by CVaR is more natural than using the expected value (Barkoutsos et al., 2019).
This reshaping can alter the local minima structure. A central proposition in the original work states that a local minimum of 5 does not necessarily correspond to a local minimum of 6, and vice versa (Barkoutsos et al., 2019). The paper’s two-qubit example makes the point starkly: for
7
the expectation objective is constant in 8, so every parameter value is a global minimum and no optimization is possible, whereas at 9 the CVaR objective becomes
0
which has nontrivial local structure and drives the state toward increased ground-state sampling probability (Barkoutsos et al., 2019).
The global minima story is more nuanced. If the variational form can reach the ground state and 1 is diagonal, then a global optimum of the expectation objective also minimizes 2 for any 3. Conversely, states with overlap 4 on the ground state are global minima of 5 for 6, even if they are not minima of the expectation objective (Barkoutsos et al., 2019). This is one source of the method’s practical utility: the objective can already be minimized once the circuit has moved sufficient probability mass onto optimal samples, without requiring the full distribution to be sharply concentrated.
The same papers also emphasize two caveats that are often overlooked. First, small 7 produces a “soft cap”: once the overlap with the optimum exceeds 8, further increasing that overlap need not improve the CVaR objective (Barkoutsos et al., 2019). Second, CVaR does not remove ansatz expressivity bottlenecks. In the QAOA analysis of the original study, small-depth circuits can remain “flat,” keeping amplitudes exponentially small on individual bit strings; CVaR can still guide optimization by focusing on the low-energy tail, but shallow-depth reachability remains a bottleneck (Barkoutsos et al., 2019). Later experiments found the same qualitative pattern: Ascending-CVaR improved QAOA overlap relative to fixed CVaR, yet absolute overlaps remained low at shallow depth for Max-Cut, Number Partitioning, and Portfolio Optimization (Kolotouros et al., 2021).
4. Scheduled CVaR objectives and smooth surrogates
A major extension of the fixed-9 formulation is the evolving-risk objective known as Ascending-CVaR. In that approach the objective at optimization stage 0 is
1
with a risk level 2 that increases during training. The reported schedules include a linear update
3
with 4 in 15–20 qubit VQE experiments, and a sigmoid schedule with 5 that spends more early iterations at small 6 and was beneficial for harder Number Partitioning instances (Kolotouros et al., 2021).
The theoretical justification is continuation across objective landscapes. The paper reports that if the ansatz can achieve maximum overlap 7 with the ground state, then for any 8, minimizing 9 preserves the same global optimum as minimizing the energy. It also reports that if 0 is a global minimizer for 1 with 2, then it is also a global minimizer for any 3 with 4, while local minima can change as 5 changes (Kolotouros et al., 2021). The intended effect is to use small-6 phases to identify promising basins and larger-7 phases to erase suboptimal minima without perturbing the true optimum.
Not all schedules widen the tail. In dynamic portfolio optimization, a descending adaptive schedule was introduced:
8
with 9, 0, and 1, decreasing every 48 NFT parameter updates (Haghighi, 8 Jun 2026). There the purpose is explicitly different: start with a broad, stable signal to steer amplitude toward feasible or near-feasible regions, then tighten the tail to refine high-quality samples. The literature therefore contains both widening and tightening schedules. This suggests that schedule direction is task-dependent: Ascending-CVaR addresses spurious local minima in unconstrained or lightly constrained landscapes, whereas descending schedules can prioritize feasibility discovery before tail refinement.
A separate line of development places CVaR inside a smooth tilted-loss framework. The Quantum Tilted Loss is defined by
2
It interpolates continuously between expectation minimization and tail-sensitive objectives, preserves the true global minima, and admits exact parameter-shift gradients for suitable gate generators (Qiu et al., 4 May 2026). For 3, the paper proves the entropic lower bound
4
and defines the corresponding EVaR surrogate by optimizing the right-hand side over 5 (Qiu et al., 4 May 2026). The same work formalizes the trade-off that sharper tilting can improve trainability while increasing finite-shot variance, shifting the bottleneck from vanishing gradients to measurement sampling variance.
5. Applications and empirical record
The empirical record spans simulation, small-device demonstrations, utility-scale sampling workflows, and constrained optimization with nonlinear penalties. Across these settings, the common pattern is that tail-focused objectives improve the probability of sampling good bit strings, but the effective choice of 6, optimizer, and shot budget remains problem-dependent.
The original CVaR-VQA study evaluated maximum stable set, Max3Sat, Number Partitioning, MaxCut, Market Split, and Portfolio Optimization across 340 random instances and 14,280 test cases. In simulation, CVaR-VQE improved rapidly as 7 decreased and circuit depth increased; with VQE depth 8 and 9, almost all instances reached at least 0 overlap within 50 normalized iterations, whereas at 1 only about 2 did so. On IBM Q Poughkeepsie, a 6-qubit portfolio optimization experiment with 8192 shots per evaluation found that 3 and 4 quickly increased ground-state overlap to the 5-level in each of five runs, while 6 remained very small and largely flat (Barkoutsos et al., 2019).
Ascending-CVaR extended this line in noiseless emulation for Max-Cut, Number Partitioning, and Portfolio Optimization using VQE and QAOA. In VQE, Ascending-CVaR reportedly outperformed both the expectation objective and constant CVaR on all tested cases, achieving up to ten times greater overlap in Portfolio Optimization and Number Partitioning and an 7 average improvement in Max-Cut. In hard Number Partitioning instances, standard objectives failed in almost all cases, constant CVaR found the correct solution in about 8 of cases, and Ascending-CVaR did so in about 9 (Kolotouros et al., 2021).
Later work generalized the sampling-based pattern to constrained losses evaluated directly on bit strings. In multi-dimensional knapsack, a slack-free step-penalty formulation combined with CVaR at 00 and 4000 shots per evaluation outperformed slack-based QUBO on all 12 reported instances in mean optimality gap; for the largest reported instance, pet7 with 01, the median gap was about 02 for the finite-sampling mean objective and about 03 for CVaR (Lee et al., 22 Apr 2026). In ETF construction, a 109-qubit hardware experiment on IBM Heron processors, using 04 shots, 05, and local-search post-processing, reached a best relative solution error of 06 on a simplified 109-bond instance (Agliardi et al., 19 Aug 2025). In 150-qubit dynamic portfolio optimization, simulator studies selected an adaptive CVaR schedule, PSO07NFT optimizer scheduling, and hardware-aware layouts; subsequent ibm_quebec experiments found that the heavy-hex-native deep-chain layout achieved the best final CVaR-tail value and best sampled objective among the tested layouts, although no quantum advantage over SCIP was observed (Haghighi, 8 Jun 2026).
| Study | Setting | Reported outcome |
|---|---|---|
| (Barkoutsos et al., 2019) | 340 random instances; VQE/QAOA; also 6-qubit IBM hardware | Lower 08 improved overlap and convergence; hardware runs at 09 outperformed 10 |
| (Kolotouros et al., 2021) | 15–20 qubits; Max-Cut, Number Partitioning, Portfolio; VQE and QAOA | Ascending-CVaR exceeded constant CVaR and expectation; hard Number Partitioning reached about 11 success |
| (Lee et al., 22 Apr 2026) | MDKP with slack-free custom penalty; 4000 shots | Custom penalty + CVaR beat slack-based QUBO on all 12 instances |
| (Agliardi et al., 19 Aug 2025) | 109-qubit ETF construction on IBM Heron | Best post-processed relative gap 12 |
| (Haghighi, 8 Jun 2026) | 150-qubit dynamic portfolio optimization | Adaptive CVaR and hardware-native layouts improved practical performance; no quantum advantage over SCIP |
6. Limitations, misconceptions, and evaluation methodology
A recurring misconception is that smaller 13 is unconditionally superior. The literature does not support that conclusion. Small 14 sharpens focus on elite samples, but it also raises estimator variance, introduces a soft cap on rewarded overlap, and can destabilize derivative-free optimization if shot counts are not increased accordingly (Barkoutsos et al., 2019). A later evaluation framework, applied to a 16-qubit QUBO using RealAmplitudes and COBYLA, found that 15 dominated lower 16 values in feasibility and quality at moderate-to-high shot counts, with diminishing gains beyond about 20,000 shots for most 17 (Mamedaliev et al., 21 Jan 2026). Another study likewise reported that fixed 18 without shot scaling performs poorly in dynamic portfolio optimization (Haghighi, 8 Jun 2026). This suggests that the effective risk level depends on the interaction among problem structure, ansatz, optimizer, and sampling budget.
A second misconception is that CVaR-VQA is defined only for Ising/QUBO expectation estimation. Several newer formulations use the quantum circuit purely as a sampler and compute a nonlinear classical loss on each observed bit string. This is how hinge-penalty ETF construction, step-penalty knapsack optimization, and dynamic multi-period portfolio objectives avoid explicit slack-variable Hamiltonians or costly exact expectation computation for nonlinear penalties (Agliardi et al., 19 Aug 2025). The trade-off is that qubit savings and modeling flexibility come at the price of heavier shot budgets and explicit classical post-processing.
Because sampling-based VQAs are stochastic at multiple levels, evaluation has itself become a research topic. A dedicated framework introduced three complementary metrics—feasibility, quality, and reproducibility—and a quality diagram that visualizes the trade-off between success probability and resource consumption. Reproducibility is quantified by Shannon entropy over binned outcome distributions, and algorithm selection is framed as a thresholded decision rule under resource constraints (Mamedaliev et al., 21 Jan 2026). This perspective treats CVaR-VQAs not as single trajectories but as distributions over performance outcomes.
The main technical limitations remain measurement complexity, non-smoothness, hardware noise, and ansatz reachability. The empirical CVaR objective depends on order statistics and can be discontinuous under finite sampling; exact parameter-shift does not apply directly to the hard-truncated estimator, which is one reason smooth surrogates such as QTL and EVaR have been proposed (Qiu et al., 4 May 2026). Device noise and readout error distort energy histograms and quantiles, making careful calibration important (Kolotouros et al., 2021). More fundamentally, if the ansatz cannot reach the ground-state subspace, CVaR preserves the true optimum as an objective target but cannot make it attainable (Qiu et al., 4 May 2026).
Sampling-based CVaR-VQA is therefore best understood not as a single algorithmic recipe but as a family of tail-sensitive hybrid optimization procedures. Its defining idea is stable across variants: use finite-shot samples from a parametrized quantum circuit, evaluate a classical optimization loss on those samples, and train on the lower tail rather than the full mean. The subsequent literature shows that this family encompasses fixed-risk CVaR, evolving risk schedules, smooth entropic surrogates, custom penalties for constrained optimization, and hardware-aware large-qubit workflows, with performance gains that are substantial in many reported settings but are tightly conditioned by shot allocation, optimizer design, and circuit expressivity (Barkoutsos et al., 2019).