Sample-Based Quantum Diagonalization (SQD)
- SQD is a quantum–classical hybrid framework that projects a many-body Hamiltonian onto a reduced, sampled subspace to estimate ground and excited state energies efficiently.
- It combines quantum hardware sampling with classical diagonalization and configuration recovery protocols to mitigate noise and reduce resource overhead.
- SQD has been applied in quantum chemistry and materials science, offering competitive accuracy and scalability for electronic structure and correlated lattice problems.
Sample-Based Quantum Diagonalization (SQD) is a quantum–classical hybrid algorithmic framework for estimating ground and excited state energies of many-body quantum systems, particularly relevant for electronic structure and correlated lattice problems. SQD leverages quantum hardware to sample many-body wavefunction configurations and then relies on classical processing—usually via subspace diagonalization—to extract physical observables within a reduced, physically motivated subspace. By replacing full Hilbert space diagonalization with the diagonalization of a carefully selected subspace, SQD significantly reduces the quantum and classical resource requirements required for accurate quantum simulation on near-term quantum hardware.
1. Algorithmic Structure and Projection Formalism
SQD operates by projecting the Hamiltonian , typically defined in a second-quantized form, onto a subspace constructed from a sampled set of configurations (usually Slater determinants encoded as computational basis states or bitstrings):
The electronic or lattice wavefunction is then expanded as:
Solving the projected eigenvalue equation
or, when non-orthogonality is present, the generalized form
returns low-energy eigenstates and associated energies with accuracy determined by the representability of the selected subspace .
Generation of the basis is accomplished via quantum circuit sampling of a trial state , often derived from hardware-efficient ansätze such as the Local Unitary Cluster Jastrow (LUCJ) or other parametrizations informed by coupled-cluster amplitudes. Due to device noise or ansatz limitations, raw samples may violate required quantum numbers (particle number, spin projection). A self-consistent configuration recovery (S-CORE) protocol is applied iteratively: sampled bitstrings are filtered and repaired to match target quantum numbers, and occupation numbers are updated across Diagonalization–Sampling cycles to ensure consistency.
2. Subspace Construction, Sampling, and Configuration Recovery
Subspace construction in SQD is fundamentally stochastic. Multiple quantum measurement outcomes are gathered, forming sets which are grouped into batches . Each batch yields a projected Hamiltonian, which is diagonalized classically (often by the Davidson algorithm). Configuration recovery proceeds by matching empirical occupation numbers with physical ones, typically employing a soft-constraint method for approximate symmetry restoration (e.g., through a penalty term in open-shell problems).
The measurement bottleneck is governed by the quantum state's amplitude distribution over determinants. This is analytically mapped to a coupon-collector problem: estimating the expected number of measurement shots required to sample all important determinants (i.e., all bitstrings with for some threshold) is a combinatorial problem controlled by the state's amplitude skew. For determinants, the expected number is
where is the probability for each determinant. In the uniform amplitude case, , but for distributions dominated by a few configurations (e.g., strongly single-reference), measurement overhead grows sharply (Reinholdt et al., 13 Jan 2025, Khamadja et al., 25 Sep 2025).
Redundancy in sampled configurations—i.e., repeatedly drawing already-seen determinants—introduces diminishing returns as the number of samples increases, limiting the efficiency of the method for high-accuracy, strongly correlated targets.
3. Extensions: Krylov, Randomization, and Symmetry Adaptation
Subspace design has significant impact on the convergence and feasibility of SQD. Beyond naïve sampling, the algorithm can be extended with:
- Sample-Based Krylov Quantum Diagonalization (SKQD): Instead of sampling only the trial state, one generates a Krylov subspace by applying short-time evolution:
and samples each to enrich the configuration set. SKQD comes with provable polynomial-time convergence so long as the ground state is "concentrated" (i.e., amplitude weight lies on bitstrings with probabilities exceeding ) (Yu et al., 16 Jan 2025, Piccinelli et al., 4 Aug 2025).
- qDRIFT Randomized Compilation (SqDRIFT): To address the depth barrier of SKQD for complex Hamiltonians, the qDRIFT protocol constructs approximate time-evolution unitaries as random sequences:
with drawn according to Hamiltonian coefficients . This reduces expected circuit depth while controlling the approximation error . The union of outputs from many such randomized circuits forms the sampling pool for subspace diagonalization, preserving the convergence guarantees of SKQD under reasonable hardware constraints (Piccinelli et al., 4 Aug 2025).
- Symmetry-Adapted SQD: The subspace can be symmetrized to respect lattice, point-group, or particle-number symmetries by expanding each sampled configuration to its orbit under group actions:
with obtained from group representation transformations and representation matrix . Sparsity of (as in the momentum basis) ensures that the symmetrized subspace does not grow intractably and can dramatically improve energy convergence and the ability to capture key order parameters (e.g., superconducting correlations in ladder models) (Nogaki et al., 1 May 2025).
4. Applications and Performance in Quantum Chemistry and Materials Science
SQD has been applied across a range of molecular and materials problems:
- Molecules: SQD has been used to compute potential energy surfaces (PES) and excited states of dimers, open- and closed-shell molecules (e.g., CH), and challenging multi-reference systems. In each case, energies are recovered within a few milliHartree (often kcal/mol) of classical CASCI or HCI when the configuration pool is adequately sampled (Kaliakin et al., 11 Oct 2024, Barison et al., 1 Nov 2024, Liepuoniute et al., 7 Nov 2024).
- Materials: The method remains competitive for periodic systems and band gap computation by combining DFT-projected electronic structures with quantum circuit sampling, successfully demonstrating scalability up to nearly 50 qubits (Duriez et al., 13 Mar 2025).
- Embedded/Fragment Approaches: SQD has been incorporated with density-matrix embedding theory (DMET) to isolate active regions in large molecules, allowing efficient simulation of spatially extended systems—e.g., hydrogen atom rings and cyclohexane conformers—with resource counts compatible with current hardware (Shajan et al., 15 Nov 2024).
- Alchemical Free Energy Calculations: SQD-corrected configuration interaction is integrated within hybrid workflows for hydration free energy predictions, achieving FCI-quality corrections to MM-based paths (Bazayeva et al., 25 Jun 2025).
Benchmark results across these domains indicate that performance is dictated primarily by the completeness of the sampled subspace—which is ultimately limited by the stochastic sampling bottleneck discussed above. Configurations contributing small weight can dominate the correlation energy, necessitating advanced state-preparation or sampling techniques as system size and complexity increase.
5. Advantages, Limitations, and Hybrid Extensions
Advantages:
- Measurement Efficiency: SQD avoids the high cost of expectation value measurement over noncommuting Pauli operators required by VQE, focusing on stochastic projective measurements, with the quantum hardware acting purely as a sampling oracle.
- Robustness to Device Noise: Sampling is often more robust than variational optimization to hardware imperfections, and noise may inadvertently aid in discovering rare (important) configurations (Khamadja et al., 25 Sep 2025).
- Classical/Quantum Separation: All nontrivial diagonalization is performed classically in a reduced subspace, permitting classical certification, error mitigation (variance extrapolation), and further stochastic hybridization (e.g., SQD-generated trial functions for phaseless AFQMC (Danilov et al., 7 Mar 2025)).
Limitations:
- Sampling Inefficiency: The determinant discovery process (i.e., recovering all determinants with significant amplitude) is an instance of a coupon collector problem with nonuniform probabilities—highly skewed amplitude distributions dramatically increase measurement cost. There exists a trade-off between subspace size, compactness, and accuracy, quantified analytically (Reinholdt et al., 13 Jan 2025, Khamadja et al., 25 Sep 2025).
- Compactness vs. Discovery Rate Trade-off: Attempts to bias sampling toward less-discovered determinants (by adjusting probability distributions, e.g., flattening via ) yield larger, less compact configuration expansions, increasing the classical post-processing cost. Keeping CI expansions compact slows new determinant discovery, impeding accuracy (Reinholdt et al., 13 Jan 2025).
- Scalability: For strongly correlated or large systems, neither hardware nor classical resources currently suffice to guarantee systematic convergence to chemical accuracy via pure SQD.
Hybridizations, such as using truncated SQD expansions as trial wave functions for phaseless auxiliary-field quantum Monte Carlo (ph-AFQMC) with variance extrapolation (Danilov et al., 7 Mar 2025), or combining with entanglement forging for qubit reduction (Smith et al., 11 Aug 2025), provide means to extend SQD's reach while mitigating core bottlenecks.
6. Future Directions and Outlook
The analytic identification of the sampling bottleneck and convergence guarantees via concentration and sparsity assumptions provide a rigorous roadmap for future development. Approaches for mitigating determinant redundancy and improving the coverage of rare but important configurations include:
- Constructing amplitude-flattening ansätze for state preparation.
- Applying importance sampling, active learning, or adaptive measurement allocation to focus on underrepresented determinants.
- Incorporating classical SCI heuristics in hybrid subspace expansion.
- Combining symmetry adaptation and qubit-reduction schemes to minimize hardware requirements (Khamadja et al., 25 Sep 2025).
A plausible implication is that as error-mitigation protocols, circuit compilation strategies (qDRIFT), and hybrid quantum–classical workflows mature, SQD and its extensions will establish the foundation for quantum advantage in correlated electron simulations—conditional on efficient sampling of all physically relevant configurations and the effective management of redundancy.
7. Comparative Table: SQD vs. Classical and Variational Quantum Approaches
Aspect | SQD/Core Approach | VQE/qEOM | Classical SCI |
---|---|---|---|
Quantum Role | Sampling (determinants) | Variational optimization | Deterministic selection |
Measurement Cost | Bitstring sampling | Many Pauli bases | None |
Bottleneck | Sampling redundancy, coverage | Optimization, measurement | Heuristic screening |
Symmetry Restoration | Via configuration recovery/SCORE | Enforced by ansatz | Explicit criteria |
Scalability | Limited by sampling, recovery | Limited by measurement/gradient | Set by heuristics, memory |
Ground/Excited States | General, via extended subspace | Requires qEOM or expansion | Selected by design |
SQD's main advantage—minimal measurement overhead—comes with the cost of scaling bottlenecks inherent to stochastic discovery and the classical diagonalization step. Its continued utility will depend on further algorithmic innovations addressing the interlocking challenges of sampling efficiency and subspace completeness.