Sample Fidelity Overview
- Sample fidelity is a metric that quantifies the similarity between observed or generated data and a target distribution or quantum state using information-theoretic, geometric, or statistical measures.
- It enables rigorous evaluation of measurement protocols and generative models by comparing statistical distances such as Kullback-Leibler, Wasserstein, or mutual information metrics.
- The framework supports advanced protocols in quantum state certification, synthetic data evaluation, and adaptive sampling, thereby informing experimental design and model optimization.
Sample fidelity quantifies the degree to which observed samples, generated data, or physical measurements match a desired distribution, quantum state, or measurement objective. Across statistics, machine learning, and quantum information, sample fidelity frameworks provide principled, quantitative metrics for evaluating representativeness, information transfer, or closeness—enabling optimization, certification, and comparison of measurement or generative protocols.
1. General Definitions and Formalisms
Sample fidelity appears in diverse forms, but core definitions involve information-theoretic, geometric, or statistical similarity measures. In physical measurement, fidelity is defined as the Shannon mutual information between unknown physical quantities and measured outcomes: where is the prior on the quantity of interest and is the conditional probability of outcome given (Bahder, 2011). This yields a comprehensive figure of merit that incorporates all uncertainty and possible outcomes.
In quantum information, Uhlmann fidelity between two quantum states , is
for mixed states, or for pure states , 0 (Wang, 24 Jun 2026, Bina et al., 2013). For quantum operations (channels), sample fidelity is 1, with averaging over all input states yielding the operation fidelity (Chełstowski et al., 2022).
In statistical settings such as synthetic data, sample fidelity is the average distance between low-dimensional marginals of synthetic and real data distributions, typically measured via total variation, Kullback-Leibler, or Wasserstein distances (Platzer et al., 2021).
2. Sample Complexity and Optimal Estimation
The sample complexity of estimating fidelity to a specified precision is a central consideration, particularly for quantum state certification and generative models. For estimation of fidelity between an unknown quantum state and a rank-2 reference state to error 3, the optimal sample complexity is 4, with a lower bound of 5. In the case where both the unknown and reference states are rank 6, sample complexity increases to 7 (Wang, 24 Jun 2026).
For pure-state quantum fidelity, the query-to-sample conversion (samplizer) yields a tight upper and lower bound of 8, outperforming traditional SWAP-test approaches (Wang et al., 2024). Generic protocols for estimating overlaps between functional representations of states (e.g., via Wigner or Pauli characteristic functions) achieve
9
where 0 is an 1 norm of the target function—serving as a complexity parameter for both discrete variable and continuous variable systems (Fawzi et al., 2024). Lower bounds on sample complexity for quantum tomography under fidelity constraints are 2 for rank-3 4-dimensional mixed states (Yuen, 2022).
3. Application Domains and Metrics
Sample fidelity serves a broad range of purposes:
- Measurement Quality: As a mutual information measure, sample fidelity quantifies the average information gained per trial, accommodating prior knowledge as a natural part of the Bayesian framework. It enables direct numerical comparison between measurement protocols (e.g., quantum vs classical interferometers), and guides selection of schemes maximizing expected information per sample (Bahder, 2011).
- Quantum State and Channel Certification: Fidelity is a primary tool in verifying whether a prepared or transmitted quantum state or channel output matches its intended form within a specified tolerance. Sample complexity and tomography techniques are tightly linked to the achievable fidelity (Wang, 24 Jun 2026, Yuen, 2022, Fawzi et al., 2024, Chełstowski et al., 2022).
- Generative Models and Synthetic Data: Fidelity is used as a summary metric for how well generated samples (e.g., from GANs or synthetic tabular data models) replicate the statistical structure of a target dataset. Evaluation is often performed by averaging statistical distances over all 1-way, 2-way, or higher-order attribute marginals between synthetic and real samples (Platzer et al., 2021).
- GAN and Deep Generator Evaluation: In image synthesis, sample fidelity is measured with perceptual or proxy metrics (e.g., Fréchet Inception Distance), feature-space discriminability, or downstream classification consistency, such as the DBN+RBM or ES score in the GM metric (GM et al., 2021, Zeng et al., 2020). Wavelet filtering and noise homogenization can directly improve sample fidelity on these metrics.
- Quantum Circuit Sampling: In random circuit sampling, the cross-entropy benchmarking (XEB) fidelity quantifies how well sample outputs correspond to the ideal quantum distribution, providing a rigorous operational bound for both theoretical and experimental sampling (Kalachev et al., 2021).
4. Computation and Optimization of Sample Fidelity
The computation of sample fidelity depends on context:
- Direct Optimization: Given a statistical or cumulative model 5 and sample 6, the sample fidelity statistic used in maximum fidelity estimation (MFE) is
7
which is bounded, coordinate-independent, and can be maximized to estimate parameters or assess fit (Kinkhabwala, 2013).
- Marginal Distance Averaging: For synthetic data, the average total variation distance between all 8-way marginals of real and generated datasets forms the fidelity metric
9
with reference to a held-out real sample as baseline (Platzer et al., 2021).
- Quantum Procedures: In quantum systems, sample-efficient direct fidelity estimation (DFE) is possible for stabilizer or low-magic targets, but is exponentially costly for generic states. Magic-reduction protocols (e.g., phase-stripping) can drastically reduce overhead for phase-close or nearly stabilizer states by transforming the basis and converting exponential-depth diagonal gates into efficient measurements plus classical processing (Park et al., 10 Feb 2026).
- Operation Fidelity for Channels: The probability distribution 0 of sample fidelity is governed by the joint numerical range of the Hermitian and anti-Hermitian parts of the Kraus operators, supporting explicit geometric and spectral integration for key classes (unitary channels, Schur channels, mixed-unitary channels) (Chełstowski et al., 2022).
5. Limitations, Caveats, and Best Practices
Fidelity is a powerful, but not universally sufficient, indicator of physical or statistical resource similarity:
- Resource Blindness: High fidelity in quantum states (1) does not guarantee equivalence in entanglement, nonclassicality, or discord. States with similar global fidelities may differ in essential properties—including being separable vs. entangled, or classical vs. nonclassical (Bina et al., 2013).
- Diagnostic Supplementation: For robust resource assessment, fidelity should be complemented by constraints (e.g., on energy, photon number), resource-specific entanglement or nonclassicality metrics, or explicit full-state tomography. When fidelity is used as a summary statistic, it must be interpreted in light of the restricted family of states under consideration and possible noise or model restrictions.
- Normalization and Baselines: In synthetic data, fidelity to holdout real samples provides a crucial baseline to distinguish genuine generative modeling from trivial memorization or overfitting (Platzer et al., 2021).
- Optimal Design: In experimental or algorithmic contexts, maximizing sample fidelity per resource cost—whether by tuning input energies, filtering noise, or balancing high-fidelity and low-fidelity evaluations—delivers practical gains in estimation, detection, and optimization (Bahder, 2011, Gong et al., 2022, Chen et al., 2023).
6. Practical Recipes and Impact
Sample fidelity metrics have driven the design of adaptive multi-fidelity sampling frameworks, optimal quantum tomography protocols, and fidelity-regularized model selection in generative AI. Across these domains:
- Adaptive Acquisition: Information gain per cost, integrated with surrogate Gaussian-process models, judiciously allocates queries across fidelity levels in surrogate-assisted Bayesian optimization and safety analysis, accelerating convergence and improving robustness (Gong et al., 2022, Chen et al., 2023).
- Quantum Certification: Protocols can certify state-preparation or channel implementation to within specified fidelity gaps, with sample complexity guarantees and operational acceptance thresholds (Wang, 24 Jun 2026, Fawzi et al., 2024).
- Generative Model Evaluation: Latent-space and classifier-based sample fidelity metrics in GM-score or related evaluation frameworks align model selection with both statistical and perceptual discriminability (GM et al., 2021).
- Tensor Network Sampling: Classical random-circuit simulators rigorously bound sample fidelity (e.g., XEB fidelity) by explicit contraction strategies, supporting computational claims in the quantum supremacy context (Kalachev et al., 2021).
In summary, sample fidelity functions as a foundational concept uniting information-theoretic, geometric, and operational criteria for judging the closeness of samples to their desired targets, underpinning rigorous methodology in quantum information, statistics, experimental design, and generative modeling.