Sinkclass Autoencoder for Quantum Dimensionality Reduction
- Sinkclass autoencoder is a dimensionality reduction architecture that integrates Sinkhorn divergence with label-driven regularization to yield bimodal latent embeddings for quantum applications.
- It employs a deep encoder-decoder framework with explicit noise and label branches to compress 67-dimensional Higgs boson data into a 16-dimensional, discriminative representation.
- Empirical results show that, with optimized hyperparameters, Sinkclass outperforms variational and classifier autoencoders, achieving a QSVM AUC of up to 0.74.
The Sinkclass autoencoder is a dimensionality reduction architecture designed to facilitate quantum machine learning, particularly for high-dimensional data sets outside the present reach of quantum algorithms. By integrating optimal-transport regularization (Sinkhorn divergence) with explicit label-driven latent space separation, the Sinkclass autoencoder enhances class-discriminative, low-dimensional embeddings suitable for subsequent quantum support vector machine (QSVM) classification. Developed in the context of Higgs boson detection at the LHC with 67 input features, it enables quantum classifiers to operate on compressed yet maximally informative inputs, outperforming variational and classifier autoencoders as well as conventional classical reduction techniques (Odagiu et al., 1 Dec 2025).
1. Architectural Overview
The Sinkclass autoencoder processes datasets with large feature counts, exemplified by 67-dimensional feature vectors normalized to . Its network configuration is composed of four major components:
- Encoder (): A stack of six fully-connected layers with decreasing dimensionality , using implicit ReLU activations. The mapping produces compact latent representations.
- Conditional Latent-Space Regularizer ("Noise Generator"):
- Gaussian branch: Two layers , taking noise .
- Label branch: One layer , with the true class bit ($0$ or $1$) as input.
- Fusion network: Concatenates outputs and propagates through three layers , yielding the “target” latent vector .
- Decoder (): Six fully-connected layers, mirroring the encoder , reconstructing from .
- Classifier (): A conventional feed-forward module attached to , predicting signal versus background with a sigmoid output, structurally analogous to that in a “Classifier AE”.
2. Loss Function Composition
The Sinkclass objective incorporates three contributions:
- Reconstruction error ():
- Binary cross-entropy classifier loss ():
- Sinkhorn divergence ():
Entropically regularized optimal-transport divergence calculated between the batch of encoded latent vectors and the batch of generator outputs (see Patrini et al. 2020 for mathematical detail).
The total loss minimized is:
The hyperparameters and determine the regularization/classification trade-off.
3. Comparison with Alternative Representations
Ablation studies and direct architectural comparisons reveal the differentiating theoretical and empirical features of Sinkclass AE:
- Standard AE involves only , resulting in unstructured latent space.
- Variational AE substitutes with a Kullback-Leibler regularizer, enforcing strict unimodal (Gaussian) latent space, which can lead to signal/background overlap.
- Classifier AE includes , promoting class-separation in the latent encoding but at the expense of substantial reconstruction quality loss due to absent distributional regularization.
- Sinkclass AE applies Sinkhorn divergence to softly guide the encoder’s output toward the learned, label-conditioned generator manifold (), and jointly employs to assure class separation. This mechanism yields a bimodal, disentangled latent representation that maintains both discriminativity and reconstruction fidelity.
4. Training Protocol and Hyperparameterization
- Dataset partitioning: 1.15 million samples for training, 0.144 million each for validation and test (all normalized).
- Pre-filtering: Physics-driven event selection based on , , and object multiplicity.
- Optimization: Adam optimizer with a learning rate of $0.001$, batch size $128$, and early stopping on validation loss.
- Hyperparameter selection:
5. Empirical Evaluation and Benchmarking
The performance of the Sinkclass autoencoder is assessed via downstream QSVM AUC on the $16$D reduced representations. Results across five test splits ($3600$ events per split) are summarized:
| Model variant | QSVM AUC | |
|---|---|---|
| Sinkclass (MSE-opt) | 0.0008, 0.90 | 0.50 ± 0.01 |
| Sinkclass (BCE-opt) | 0.20, 0.02 | 0.74 ± 0.01 |
| Classifier AE (BCE-opt) | 0.60 (only weighted) | 0.72 ± 0.01 |
| RBM (classical baseline) | — | 0.65 ± 0.016 |
| PCA / ICA / … | — | 0.53–0.60 |
- Best QSVM AUC: is achieved by Sinkclass AE tuned for BCE minimization.
- Hyperparameter sensitivity:
- yields minimal reconstruction error (MSE), but QSVM AUC collapses to $0.50$.
- produces a maximal separation in latent space and optimal QSVM AUC, with a slight increase in reconstruction error.
6. Theoretical Underpinnings and Latent Structure
The Sinkhorn regularization () ensures that the encoder’s latent distribution aligns via a learnable, non-linear transport from a simple base (Gaussian plus label information), as opposed to strictly matching a fixed Gaussian prior. This approach enables preservation of class-conditioned multimodal structure in latent space, mitigating the mode-collapse or overlap observed in variational autoencoders. The BCE classifier term () further drives the system toward a bimodal latent embedding, mapping signal and background events into spatially disjoint clusters. This structured latent space is well suited for quantum embeddings, particularly the $16$-qubit regime, thereby translating to improved downstream QSVM discrimination.
7. Applications and Broader Significance
The Sinkclass autoencoder’s design relaxes constraints that often limit variational and classifier autoencoders in quantum-classifier settings. By delivering strong class separation and information retention during dimensionality reduction, it opens quantum machine learning workflows to a wider class of high-dimensional scientific datasets, exemplified in LHC signal-versus-background searches. The recipe provided for effective regularization and hyperparameter balancing directly extends the reach of quantum classifiers and suggests potential utility in broader scientific and industrial applications where quantum resources are bottlenecked by feature dimensionality (Odagiu et al., 1 Dec 2025).