Quantum Split Learning Overview

Updated 27 October 2025

Quantum Split Learning is a framework that partitions quantum machine learning and optimization tasks into distributed or hybrid quantum-classical executions.
It employs methodologies such as de Finetti-based channel splitting, hybrid quantum neural networks, and circuit-level partitioning to address scalability, privacy, and noise challenges.
Practical implementations demonstrate measurable accuracy gains, runtime reductions, and improved fidelity in applications like image classification and quantum optimization.

Quantum split learning encompasses a class of frameworks and strategies for partitioning quantum machine learning tasks, quantum computations, or even quantum optimization jobs, so that they may be executed distributively, in staged fashion, or via hybrid classical–quantum collaboration. The concept is unified by the principle of dividing the quantum learning process—whether at the level of data, model layers, quantum circuits, or job iterations—across hardware resources or system boundaries, to maximize practical performance, scalability, and security. The following sections synthesize key methodologies and implications from the literature, covering de Finetti-based channel splitting, distributed and hybrid split learning with quantum neural networks, circuit-level splitting to resolve barren plateaus, split-parallel architectures exploiting physical symmetries, robust Markov modeling via environmental splitting, and fidelity-optimized job scheduling in heterogeneous quantum hardware scenarios.

1. Foundational Theory: de Finetti Channel Splitting and Inductive Quantum Learning

In inductive quantum supervised learning, the impossibility of quantum state cloning limits the exact splitting of a learning protocol into independent training and application phases. Nevertheless, under a non-signalling condition—where outputs on individual test instances are independent—a rigorous quantum de Finetti theorem for channels establishes that any non-signalling quantum learning protocol can be approximated (error $O(n^{-1/6})$ for $n$ test instances) by a measure-and-prepare protocol. This protocol first measures the training data ( $\mathcal{T}(dg)$ ), then applies a learned channel $\Phi_g$ independently to each test instance:

$\tilde{\lambda} = \int \mathcal{T}(dg) \otimes \Phi_g^{\otimes n}$

This splitting, realized via local operations and classical communication (LOCC), conceptually parallels the classical inductive paradigm and endows quantum split learning with structural risk minimization and sample complexity properties from computational learning theory. The main limitation is that the split approximation only holds asymptotically, implying loss of quantum information at the training measurement step but enabling practical decomposability and hybrid quantum-classical analysis (Monràs et al., 2016).

2. Hybrid, Distributed, and Federated Quantum Split Learning Architectures

Quantum split learning methods are formalized in architectures where quantum machine learning models are divided between clients and servers, between neural network layers, or between quantum and classical processing nodes.

Hybrid Quantum Split Learning (HQSL)

HQSL permits classical clients to collaborate in training with a hybrid quantum server. Clients transmit only intermediate representations (“smashed data”)—processed using novel, qubit-efficient data loading protocols—to the server, which hosts quantum layers. The server-side quantum processing improves classification accuracy (by over 3% on Fashion-MNIST; over 1.5% on Speech Commands) and F1-score compared to purely classical models. Privacy risks due to reconstruction attacks are mitigated by Laplacian noise injection tuned to quantum encoding gates, with leakage quantified using distance correlation between client input and smashed data. Scalability studies with up to 100 clients confirm that privacy and performance trade-offs are manageable via careful noise calibration (Cowlessur et al., 25 Sep 2024).

Quantum Split Neural Network Learning with Cross-Channel Pooling

In QSL, local quantum neural networks (QCNNs) process private data and send only quantum-derived feature representations to a centralized server, which further applies quantum layers for classification. Cross-channel pooling (C2Pool) aggregates quantum amplitudes $\alpha^k$ from local tomography with classical features $f^k$ to form a pooled representation $\tilde{f}^k = f^k \cdot \alpha^k(x^k)$ , reducing both communication cost and privacy leakage. The system demonstrates a 1.64% top-1 accuracy gain over quantum federated learning (QFL) and excellent scalability on MNIST, FashionMNIST, and CIFAR10 (Yun et al., 2022).

Distributed Quantum Learning with Co-Management

DQuLearn generalizes quantum split learning to distributed quantum hardware, dividing deep learning subtasks—such as image segments for quantum CNN—among multiple quantum workers with orchestrated classical management. Dynamic scheduling on worker resource availability supports multi-tenancy and interleaved client execution, delivering up to 68.7% reduction in training runtime and tripling circuit throughput in experiments on IBM-Q and Google Cloud (Jr. et al., 2023).

3. Circuit-Level Splitting Strategies and Optimization Implications

Quantum split learning is closely related to circuit-splitting strategies developed to resolve fundamental optimization bottlenecks.

Classical Splitting of Parametrized Quantum Circuits

This technique partitions an $N$ -qubit variational ansatz into independent $m$ -qubit (with $m \sim \log N$ ) blocks, each optimized locally. The cost function decomposes as $C(\theta) = \sum_i C^i(\theta^i)$ , with gradient variances scaling polynomially in $N$ ( $O(N^{-6\beta\log_2\gamma})$ versus exponential decay $O(2^{-6N})$ in fully entangled circuits). The approach is validated on classical and quantum datasets, showing competitive or superior accuracy in high-dimensional regimes, rapid parallelizable gradient computation, reduced two-qubit gate counts, and robustness against noise. Notably, classical splitting aims at gradient trainability over privacy or resource aggregation, distinguishing its primary objective from distributed quantum split learning (Tüysüz et al., 2022).

Split-Parallelizing of Quantum Convolutional Neural Networks (sp-QCNN)

The sp-QCNN architecture leverages known translational symmetry by splitting the circuit at the pooling stage—without discarding qubits—and applies identical unitary transformations to each branch. Measurement of equivalent observables (e.g., $Z_1$ , $Z_j$ ) is parallelized and averaged, yielding a variance reduction of $1/(n N_{\mathrm{shot}})$ and greatly accelerating stochastic gradient optimization. Applied to quantum phase recognition for translationally symmetric cluster Ising model ground states, sp-QCNN achieves comparable classification accuracy to conventional QCNNs but with an order-of- $n$ reduction in statistical error (Chinzei et al., 2023).

4. Split Markov Processes and Physical Implementations

The split hidden quantum Markov model (SHQMM) extends HQMMs by partitioning the environment’s Hilbert space into multiple subspaces with a hierarchy of conditional density matrices. Kraus operators act on these splits, enabling more expressive modeling of environmental correlations and robustness to initialization. The learning algorithm utilizes gradient descent on the Stiefel manifold, preserving complex orthogonality constraints. Empirical results demonstrate DA metric improvements (~23% higher than HQMM), scalability in the number of hidden states, and stability under various boundary conditions. Quantum transport systems provide a physical substrate for implementing SHQMM. A plausible implication is that environmental splitting may enhance memory compression and computational expressivity for time-series and quantum data analysis (Li et al., 2023).

5. Job-Split Scheduling for Fidelity and Throughput on Heterogeneous Quantum Hardware

QuSplit introduces job splitting at the quantum optimization level, adaptive to heterogeneous and noisy hardware backends (IBM Quantum, Amazon Braket). Each variational job (e.g., VQE, VQC, QAOA) is divided into a “head” stage executed on high-noise/high-availability backends and a “tail” stage switched to high-fidelity (low-noise) processors. A genetic algorithm-based scheduler optimizes the mapping by maximizing a combined fitness function:

$\text{Fitness(Sol)} = w_1 \cdot \text{TH(Sol)} + w_2 \cdot \text{FI(Sol)}$

where TH(Sol) is system throughput and FI(Sol) is aggregate fidelity. Experiments reveal that job splitting yields higher final fidelity and convergence speed, supported by empirical VQE runs on IBM Strasbourg. Scalability is confirmed for increasing workloads and processor counts. This suggests that “quantum split learning” at the job level is central for resource-efficient, practical quantum computing in the NISQ era (Li et al., 21 Jan 2025).

6. Comparative Analysis, Limitations, and Future Directions

Quantum split learning unifies a spectrum of methodologies sharing the objective of decomposing learning or optimization tasks—via data partitioning, layer-wise division, circuit block splitting, split Markov modeling, or staged job execution—across distributed or hybrid systems. The main practical advantages are improvement in scalability, fidelity, privacy, and resource utilization. However, the split structures often entail trade-offs: loss of quantum coherence or information upon measurement (channel splitting), only asymptotic equivalence to ideal global protocols (de Finetti approximation), and, for circuit splitting, limits to expressivity if blocks are too small. Real-world performance hinges on careful balance between local computation, classical communication, and hybrid quantum-classical management. Future research will likely continue to refine split learning architectures, investigate privacy metrics such as distance correlation for leakage control, and optimize split ratios in dynamic quantum system environments.

Table: Quantum Split Learning Variants and Core Features

Variant	Split Dimension	Primary Advantage
de Finetti Channel Splitting	Protocol phase (train/apply)	Hybrid decomposability, risk analysis
Hybrid/Distributed QSL	Data/model layers, device pool	Scalability, privacy, accuracy
Circuit/Classical Splitting	Circuit blocks/subcircuits	Gradient trainability, noise robustness
sp-QCNN	Circuit branches (symmetry)	Measurement efficiency, accuracy
SHQMM	Environmental Hilbert subspaces	Modeling expressivity, robustness
QuSplit Job Scheduling	Job stages/iterations	Fidelity-throughput optimization

A plausible implication is that, as quantum hardware resources diversify and applications scale, quantum split learning—especially at the protocol, algorithm, and hardware orchestration levels—will be foundational for realizing quantum advantage under practical constraints.