Quantum Embeddings: Theory & Applications

Updated 15 March 2026

Quantum embeddings are nonlinear mappings from classical data to quantum state space using trainable circuits for enhanced feature representation.
They employ specific circuit layers such as Ry rotations and entangling gates to compute quantum kernels and facilitate quantum metric learning.
Empirical results show that QES-optimized architectures achieve competitive accuracy with fewer parameters and shallower circuit depths compared to classical models.

A quantum embedding is a nonlinear (often trainable) map from classical input space into a quantum (Hilbert) space, realized via a parameterized quantum circuit, with the goal of exploiting the representational and geometric power of quantum states for use in supervised, unsupervised, or downstream quantum/classical algorithms. Quantum embeddings serve as the foundational feature map in quantum machine learning, quantum representation learning, and quantum embedding theories for many-body physics, and appear in quantum-inspired classical models as efficient or interpretable ways of encoding structure.

1. Mathematical Formalism and Circuit Structure

Let $x\in\mathbb{R}^p$ denote a classical feature vector. A quantum embedding is a map

$\phi: x \mapsto |\psi(x)\rangle \in \mathcal{H},\quad \mathcal{H} \cong \mathbb{C}^{2^n}$

where $\mathcal{H}$ is the Hilbert space of $n$ qubits. Typically, one prepares $|\psi(x)\rangle$ via a parameterized quantum circuit (ansatz):

$|\psi(x)\rangle = U(\theta, x)\,|0\rangle^{\otimes n}$

with $U(\theta,x)$ combining input-dependent rotations, entangling unitaries, and potentially trainable blocks:

$U(\theta, x) = U_w(\theta) \cdot U_E(\alpha) \cdot U_x(x)$

$U_x(x) = \bigotimes_{j=1}^{n} R_y(x_j)$ is the input feature block (typically $y$ -axis rotations),
$\phi: x \mapsto |\psi(x)\rangle \in \mathcal{H},\quad \mathcal{H} \cong \mathbb{C}^{2^n}$ 0 is a sequence of CNOTs or other entangling gates,
$\phi: x \mapsto |\psi(x)\rangle \in \mathcal{H},\quad \mathcal{H} \cong \mathbb{C}^{2^n}$ 1 represents trainable single-qubit rotations.

The quantum kernel associated with $\phi: x \mapsto |\psi(x)\rangle \in \mathcal{H},\quad \mathcal{H} \cong \mathbb{C}^{2^n}$ 2 is

$\phi: x \mapsto |\psi(x)\rangle \in \mathcal{H},\quad \mathcal{H} \cong \mathbb{C}^{2^n}$ 3

enabling kernel-based learning and quantum metric learning paradigms (Nguyen et al., 2021).

2. Search Space, Entanglement, and the QES Algorithm

The expressivity and efficiency of quantum embeddings are intimately tied to the structure of the entangling layers. Each entangling block $\phi: x \mapsto |\psi(x)\rangle \in \mathcal{H},\quad \mathcal{H} \cong \mathbb{C}^{2^n}$ 4 can be represented as an ordered, directed multi-graph $\phi: x \mapsto |\psi(x)\rangle \in \mathcal{H},\quad \mathcal{H} \cong \mathbb{C}^{2^n}$ 5 where $\phi: x \mapsto |\psi(x)\rangle \in \mathcal{H},\quad \mathcal{H} \cong \mathbb{C}^{2^n}$ 6 are qubits and $\phi: x \mapsto |\psi(x)\rangle \in \mathcal{H},\quad \mathcal{H} \cong \mathbb{C}^{2^n}$ 7 denotes the directed CNOTs. The "genotype" $\phi: x \mapsto |\psi(x)\rangle \in \mathcal{H},\quad \mathcal{H} \cong \mathbb{C}^{2^n}$ 8 is a length- $\phi: x \mapsto |\psi(x)\rangle \in \mathcal{H},\quad \mathcal{H} \cong \mathbb{C}^{2^n}$ 9 sequence of directed edges (CNOTs), with $\mathcal{H}$ 0 termed the entanglement level.

For $\mathcal{H}$ 1 qubits, the total number of possible distinct directed CNOT edges is $\mathcal{H}$ 2. The reduced search space of depth- $\mathcal{H}$ 3 embedding circuits (fixing $\mathcal{H}$ 4 entanglers per layer) is

$\mathcal{H}$ 5

Searching this space is intractable for large $\mathcal{H}$ 6 or $\mathcal{H}$ 7.

Quantum Embedding Search (QES) tackles this via a sequential model-based optimization (SMBO) algorithm using a surrogate (e.g., TPE), which proposes candidate entanglement patterns maximizing an acquisition function (expected improvement), thereby efficiently exploring $\mathcal{H}$ 8 for optimal embedding architectures (Nguyen et al., 2021).

3. Empirical Performance and Practitioner Guidelines

Extensive empirical benchmarking demonstrates QES-found architectures achieve or surpass hand-designed embeddings and approach classical model accuracy on synthetic, UCI, and hybrid autoencoder-low-qubit datasets:

On a 4-dimensional synthetic dataset: QES-TPE (depth=2, $\mathcal{H}$ 9) achieved 83.5% accuracy, matching SVM/XGBoost and outperforming manual entangling designs by 2-7%.
On the Iris dataset (4 dim, 3 class): QES-TPE (depth=2, $n$ 0, 23 params) reached 95.33% accuracy, using $n$ 1 fewer parameters than the best fair NN baseline.
On higher-dimensional tabular data autoencoded to 4 qubits: QES-TPE circuits reached 97-98% test accuracy with 60-75 params, matching NNs with $n$ 2200 params.

Best-practice guidelines identified in (Nguyen et al., 2021) include:

Fix single-qubit encoding and trainable rotations to $n$ 3 gates,
Use shallow ansatz depth (1–2) to avoid excessive noise and barren plateaus,
Set entanglement $n$ 4, incrementally increasing until validation accuracy plateaus,
Use an initial random sample of $n$ 5 embeddings, then $n$ 6 surrogate-guided proposals, with $n$ 7– $n$ 8 full-trainings as a compute budget.
For high-feature/low-qubit settings, reduce input dimensionality with a classical autoencoder prior to embedding.

4. Quantum Embedding Search: Surrogate Modeling and Optimization

The QES algorithm formulates embedding search as

$n$ 9

where $|\psi(x)\rangle$ 0 quantifies performance (validation loss) of the QNN instantiated with entanglement pattern $|\psi(x)\rangle$ 1. The surrogate $|\psi(x)\rangle$ 2 is trained on observed $|\psi(x)\rangle$ 3 pairs and is used to propose new candidates maximizing expected improvement (EI):

$|\psi(x)\rangle$ 4

where $|\psi(x)\rangle$ 5 is the modeled distribution over losses. The surrogate-based approach drastically reduces the number of expensive full trainings required per run, leading to substantial computational efficiency, particularly critical in NISQ simulation regimes (Nguyen et al., 2021).

5. Expressivity, Parameter Efficiency, and Quantum Advantage

The QES framework enables discovery of quantum embedding circuits that are both parameter- and depth-efficient:

For moderate $|\psi(x)\rangle$ 6, QES-optimized circuits reach accuracy competitive with classical NNs or boosting methods with far fewer parameters.
t-SNE visualization of quantum-encoded representations shows improved class separation, indicating the embedding circuit's ability to remap data geometry beneficially.
Empirically, QES-TPE closes the performance gap to classical SVM/XGBoost, often with $|\psi(x)\rangle$ 7 the number of tunable weights and shallow circuit depths.

A key observation is that the quantum kernel $|\psi(x)\rangle$ 8 defined by the optimized embedding matches or surpasses classical kernels in class-separation capacity, confirming the utility of trainable quantum feature maps and the role of circuit structure search in practical QML (Nguyen et al., 2021).

6. Reproducibility, Implementation Trade-offs, and Outlook

QES and related quantum embedding workflows lend themselves to reproducibility on NISQ hardware or simulators:

Each search consists of $|\psi(x)\rangle$ 9 trials, each entailing (a) embedding proposal, (b) full circuit training, (c) validation, and (d) surrogate model update.
Single run compute cost is governed by number of full trainings, with 4-qubit, depth-2 circuits on simulators consuming $|\psi(x)\rangle = U(\theta, x)\,|0\rangle^{\otimes n}$ 0– $|\psi(x)\rangle = U(\theta, x)\,|0\rangle^{\otimes n}$ 1 GPU-days per search at $|\psi(x)\rangle = U(\theta, x)\,|0\rangle^{\otimes n}$ 2 or $|\psi(x)\rangle = U(\theta, x)\,|0\rangle^{\otimes n}$ 3 entanglement.
Shallow-layer, low-parameter embeddings are critical for robustness under hardware noise, and for maintaining tractable classical simulation times.

A plausible implication is that SMBO-driven quantum embedding search will remain a key component of NISQ-era hybrid QML practice, as hardware constraints and model selection demands outpace naive circuit enumeration or heuristic design.

7. Relation to Quantum Kernel Theory and Machine Learning Paradigms

Quantum embeddings form the theoretical bridge between quantum machine learning feature maps, quantum kernel methods, and entanglement structure optimization:

The map $|\psi(x)\rangle = U(\theta, x)\,|0\rangle^{\otimes n}$ 4 is directly analogous to the nonlinear feature map in kernel SVMs,
The QES framework generalizes classical "circuit architecture search" to the quantum domain, with direct impact on expressivity and generalization,
The quantum kernel induced by the embedding provides access to analytically optimal measurements (e.g., Helstrom for trace-distance optimality),
Tuning the mapping and associated circuit architecture controls metric learning capacity—maximizing between-class trace distance or Hilbert-Schmidt distance leads to empirically optimal minimum-risk classification (Nguyen et al., 2021, Lloyd et al., 2020).

The development of algorithmic quantum embedding search substantiates the critical insight that representational power in quantum ML arises from precise control over the mapping of input data to quantum state space, with efficient optimization algorithms serving as practical enablers in both simulation and hardware environments.

Markdown Report Issue Upgrade to Chat

References (2)

Quantum Embedding Search for Quantum Machine Learning (2021)

Quantum embeddings for machine learning (2020)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Quantum Embeddings.