q-iPrune: One-Shot Pruning for QNNs
- The paper introduces q-iPrune, a one-shot, structured pruning algorithm that leverages q-deformed Lie groups and quantum geometric redundancy metrics to streamline quantum circuits.
- It employs a noise-calibrated deformation parameter and task-conditioned q-overlap distance to identify and remove algebraically consistent redundant gates with explicit error guarantees.
- Empirical evaluations on classification and VQE tasks demonstrate up to 60% gate reduction with minimal performance loss, ensuring robust optimization under NISQ conditions.
q-iPrune is a one-shot, structured pruning framework for quantum neural networks (QNNs) that leverages -deformed Lie group representations and a task-conditioned quantum geometric redundancy metric. Designed for the noisy intermediate-scale quantum (NISQ) regime, q-iPrune rigorously formulates and eliminates gate-level redundancy by exploiting both the algebraic structure of -groups and the operational similarities of gates on task-relevant state ensembles. It provides explicit, task-conditioned error guarantees, polynomial computational complexity, and integrates a noise-adaptive deformation parameter, distinguishing it from heuristic or gradient-based alternatives (Shao et al., 30 Dec 2025).
1. Algebraic Structure: -Deformation and Hardware Adaptation
q-iPrune replaces the canonical Lie group with its Drinfeld–Jimbo -deformation, denoted . The deformation is controlled by a continuous parameter , smoothly interpolating between the fully commutative limit () and the conventional non-commutative algebra (). The core components are:
- Deformation Function: with ; as , and standard is recovered.
- -Lie Algebra: Generators satisfy , , , with .
- Noise-Adaptive Scaling: Scaled generators yield commutators , capturing the decoherence-driven commutative contraction as .
- Gate Parametrization: Gate operators are constructed as , using the -exponential map.
Two-qubit gates (e.g., CNOT) are -deformed via the Hopf coproduct, resulting in unitary -generalizations such as . Hardware noise is modeled through , allowing the algebra to adapt to physical device imperfections.
2. Redundancy Detection via -Subgroups
Redundancy identification in q-iPrune is restricted to "algebraically consistent" -subgroups. Given the full gate multiset , q-iPrune partitions into disjoint subsets (, ) where each is closed under (approximate) composition and inversion within or as appropriate. Within each subgroup, a single representative gate is chosen (commonly the medoid under the redundancy metric). All comparisons and redundancy assessments are confined to the corresponding subgroup, ensuring that any gate replacement preserves the local group-theoretic structure of the quantum circuit.
3. Task-Conditioned -Overlap Distance
The operational similarity of gates is quantified using the task-conditioned -overlap distance, defined on a finite ensemble (e.g., data encodings or VQE intermediates). The -inner product is introduced: with and . This induces the norm .
The task-conditioned -overlap distance for compiled unitaries is
This quantity measures the average -weighted angular deviation of and on the ensemble. A gate is classified as -redundant with respect to if . This redundancy implies a guaranteed bound on expectation shifts of any observable :
4. One-Shot Structured Pruning Algorithm
q-iPrune performs a single traversal of each -subgroup, comparing all members to the designated reference gate. Gates within the -redundancy threshold are removed; those exceeding it are retained. The algorithm is as follows:
- Compute the redundancy threshold: for a given task deviation .
- Initialize the set of kept gates as empty.
- For each -subgroup :
- Select (the medoid under ).
- Add to .
- For each :
- Compute .
- Keep if ; otherwise, discard.
- Return .
There are no iterative retraining or gradient-based updates; each gate is processed once. The pruning is thus “one-shot” and structured, reflecting only algebraically and operationally justified redundancy.
5. Rigorous Theoretical Guarantees
Three main guarantees are established for q-iPrune:
- Completeness of Redundancy Pruning: Only gates meeting the -redundancy criterion are removed. Gates with are always kept (Theorem 4.1).
- Circuit-Level Functional Bound: Replacing gates by reference representatives, the trace distance between the original and pruned circuit outputs is bounded as
with analogous bounds for observable drift (Theorem 4.2).
- Polynomial Computational Complexity: If each computation costs work, the overall pruning cost is , and medoid selection by all-pairs distance is . There is no exponential scaling with Hilbert space size (Theorem 4.3).
These structural guarantees imply strict control over functional degradation and operational feasibility in the NISQ context.
6. Noise-Calibrated Deformation Parameter
The parameter modulates two aspects:
- Non-commutativity: , interpolating between fully commutative and standard quantum regimes.
- Redundancy Thresholds: Because affects and hence in the -inner product, smaller (corresponding to higher physical noise) typically increases the spectral bound and reduces the allowed . This results in more conservative pruning under high noise.
In practical scenarios, is calibrated to match device decoherence characteristics, such as via randomized benchmarking.
7. Empirical Performance and Applicability
q-iPrune was validated on standard QNN benchmarks, including:
- Classification: 8 qubit, depth-12 circuits for MNIST "4 vs 9", Fashion-MNIST "Sandal vs Boot", and synthetic Bars-and-Stripes, with up to 480 gates.
- VQE: 4-qubit transverse-field Ising Model circuits, 240 gates.
Key results (with and ):
| Task | Replacement % | Base Metric | Pruned Metric | Drop |
|---|---|---|---|---|
| Classification | 60% | 72.77% acc. | 72.90% acc. | −0.13% |
| TFIM VQE | 60% | 0.3976 energy | 0.3970 energy |
Higher noise (larger ) or tolerance () yields less redundancy and larger (but still bounded) accuracy degradation. In all cases, the experimental accuracy and fidelity drops were well below the theoretical bounds (which are conservative and may be clipped at 100%).
q-iPrune thus delivers substantial circuit compression while certifying retention of task-relevant functionality, with robustness to hardware imperfections via the deformation parameter (Shao et al., 30 Dec 2025).