- The paper demonstrates that both cloning and learning stabilizer states require Θ(n) samples, showing no structural advantage in cloning over learning.
- It employs advanced representation theory and coding-theoretic techniques, including character measurements and random purification, to derive tight lower bounds.
- The findings have practical implications for quantum cryptography and error correction by reinforcing the security and limits of cloning structured quantum states.
Cloning is as Hard as Learning for Stabilizer States: An Authoritative Synthesis
Introduction and Problem Context
The fundamental No-Cloning theorem asserts that non-orthogonal quantum states cannot be cloned, and even approximate cloning of arbitrary pure states demands resources commensurate with full quantum state tomography. However, much of modern quantum learning theory, both for classical and quantum tasks, focuses not on arbitrary unknown states, but rather on structured families—such as stabilizer states—where computational or sample advantages over generic tomography can sometimes be exploited. This motivates a central theoretical question: For structured quantum states, can approximate cloning ever be significantly easier (in terms of sample complexity) than learning?
This work resolves the question emphatically for n-qubit stabilizer states: cloning is as hard as learning, up to constant factors. The sample complexity for either task is Θ(n). This result generalizes the classical finding that for certain structured distributions, sample amplification is as hard as learning, and integrates this perspective into the quantum domain via heavy use of modern representation theory and connections to hidden subgroup problems.
Technical Frameworks
The study is grounded in two parallel frameworks:
Structured Sample Amplification (Classical)
Sample amplification asks: given t random samples from an unknown member of a class of probability distributions, can one efficiently generate additional samples that are indistinguishable from fresh samples from the same distribution? For general distributions over a k-element support, amplification can be quadratically easier than learning ([axelrod2019sampleamplificationincreasingdataset]).
Here, the authors focus on structured function classes (e.g., parities, Reed-Muller codes). They formalize amplification for distribution classes of the form {(Un,f)}f∈F, where Un is the uniform distribution over n bits and f is drawn from a function class F.
Structured Quantum Cloning
Quantum cloning is formulated as: given t i.i.d. copies of a quantum state from a structured set (such as stabilizer states), can one generate Θ(n)0 copies approximating true i.i.d. samples, with total trace distance at most Θ(n)1? The main structural focus is on stabilizer states, but the results extend to representations associated with Abelian State Hidden Subgroup Problems (Abelian StateHSPs).
Main Results
1. Sample Amplification Lower Bounds
For any Boolean function class Θ(n)2, let Θ(n)3 be its teaching dimension. If Θ(n)4, then structured sample amplification necessarily incurs error proportional to the probability that Θ(n)5 samples uniquely specify Θ(n)6. For Θ(n)7-bit parity functions, sample amplification from Θ(n)8 samples incurs at least Θ(n)9 total variation distance—nearly the same as the sample complexity for learning parities, which is t0. Thus, amplification is no easier than learning for parities.
Succinctly:
| Function Class |
Learning Complexity |
Amplification Error Lower Bound |
| t1-bit parities |
t2 |
t3 if t4 |
| t5-dim linear subspace |
t6 |
In terms of code-theoretic erasure probability |
The result is generalized to any linear subspace of Boolean functions, using properties of the related linear code and its dual, specifically the behavior under random erasures.
2. Quantum Cloning Lower Bounds
Stabilizer States
For the class of t7-qubit stabilizer states, the optimal sample complexity to clone with small constant error (t8) is t9. The result is established by considering an Abelian StateHSP associated with stabilizer states: k0-independent generators are required to identify the hidden stabilizer group, and character measurements (Bell sampling) are shown to be optimal for extracting subgroup information. Thus, any cloning map capable of producing fresh copies from k1 input states would violate fundamental linear independence and information-theoretic bounds.
General Abelian StateHSPs
A general framework for cloning lower bounds is established: for an Abelian group k2 and a hidden subgroup k3, one cannot clone the hiding states better than one can learn—sample complexity lower bounds are linked to the teaching dimension or generator rank of the dual group k4 and the code-theoretic erasure resilience.
3. Technical Machinery
- Representation Theory: The proofs leverage block-diagonalization under the Clifford group (for stabilizer states) and the use of character POVMs (optimal projective measurements corresponding to irreducible representations).
- Random Purification Channel: To extend from mixed hiding states to pure stabilizer states, a structured random purification channel is introduced, generalizing recent developments ([tang2025conjugatequerieshelp], [walter2025randompurificationchannelarbitrary]).
- Coding Theory: The linear independence thresholds for sample amplification directly relate to the code distance and random erasure recovery probabilities for the associated linear function code of k5.
Implications and Connections
Theoretical Implications
- No Structural Separation: Even for highly structured and efficiently learnable quantum states, like stabilizers, cloning cannot be performed with fewer samples than learning. There is no sample-complexity separation available via structure for these prominent classes, contrasting sharply with the situation for unstructured distributions or quantum states.
- Coding-Theoretic Reduction: The analysis unifies quantum learning, classical sample amplification, and code theory. This provides new directions for both quantum tomography and classical learning theory, especially regarding combinatorial dimensions (teaching dimension, code distance) and their operational consequences.
- Optimality of Measurements: Character measurements (e.g., Bell sampling for stabilizer states) are shown to be information-theoretically optimal for extracting latent subgroup symmetry information, not just sufficient.
Practical Implications
- Limits of Efficient Cloning: This result systematically limits any approach to machine-learning-inspired quantum data amplification for structured classes relevant to fault-tolerant quantum computation and error correction (stabilizer codes).
- Cryptographic Security: The findings reinforce the security assumptions underlying schemes like quantum money or quantum authentication protocols based on stabilizers and similar codes—difficulty of cloning persists even under symmetry constraints.
Contrasting Amplification and Learning for Unstructured Classes
While structured classes like parities offer no advantage, for the completely unstructured class of all Boolean functions, sample amplification is quadratically easier than learning, matching results for arbitrary classical distributions ([axelrod2019sampleamplificationincreasingdataset]). This contrast is sharp, with structure imposing an informational bottleneck.
Open Questions and Future Directions
Several avenues remain for extending these results:
- Beyond Stabilizers: Does cloning remain as hard as learning for other classes (e.g., hypergraph states, tensor network states, or outputs of shallow circuits)? For instance, for degree-k6 polynomials (Reed-Muller codes), does the erasure threshold similarly dictate cloning hardness?
- Restricted Distinguishers: What changes if distinguishers are computationally bounded? This has relevance for cryptography and pseudorandomness.
- Combinatorial Dimensions: Can one characterize sample amplification and quantum cloning complexity for arbitrary function/state classes via suitable combinatorial dimensions, analogous to VC or teaching dimension?
- Average-Case Hardness: The results here are worst-case; average-case bounds remain to be systematically explored.
- Amplification with Access Restrictions: How does the sample complexity change if ancilla or entangling operations are restricted?
Conclusion
This paper closes a central question in quantum learning theory for a broad and important class of structured quantum states, demonstrating that no sample-complexity separation exists between learning and cloning for stabilizer states. The connection between classical sample amplification, quantum cloning, and linear/coding-theoretic structures is rigorously established, and a suite of technical tools—representation theory, random purification, and coding theory—are marshaled to provide tight, nonasymptotic lower bounds. The implications are substantial for both foundational quantum information and for practical tasks in quantum cryptography and tomography, and they outline a template for further study of quantum machine learning under symmetry and structure.