Quantum Temporal CNN (QTCNN)

Updated 14 December 2025

Quantum Temporal Convolutional Neural Network (QTCNN) is a hybrid architecture that combines classical multi-scale temporal encoders with quantum convolution circuits to predict equity returns.
It employs efficient angle embedding and hierarchical, parameter-sharing quantum layers to harness superposition and entanglement for robust sequential modeling in noisy financial environments.
Empirical evaluations on the JPX Tokyo Stock Exchange dataset demonstrate superior out-of-sample Sharpe ratios, highlighting its performance advantage over classical models.

The Quantum Temporal Convolutional Neural Network (QTCNN) is a hybrid quantum–classical architecture engineered for cross-sectional equity return prediction in noisy, nonstationary financial environments. QTCNN integrates a classical temporal encoder with hierarchical, parameter-efficient quantum convolutional circuits, designed to leverage quantum features such as superposition and entanglement for more robust and generalizable sequential modeling. The primary modeling pipeline consists of a classical temporal encoder ("TConv") that extracts multi-scale temporal features, followed by quantum angle embedding and convolutional processing, and culminating in a hybrid classification head. Empirical evaluation on the JPX Tokyo Stock Exchange dataset demonstrates superior out-of-sample Sharpe ratios compared to both classical and other quantum deep learning baselines (Chen et al., 7 Dec 2025).

1. Architectural Overview

QTCNN is composed of three major modules: (1) a classical temporal CNN encoder (TConv), (2) a quantum convolutional circuit, and (3) a hybrid classification stage. Input data $X \in \mathbb{R}^{T \times F}$ , representing multi-day sequences of technical indicators per equity, is processed through these stages as follows:

Classical Temporal Encoder: Extracts temporally compact feature $z \in \mathbb{R}^{n_q}$ .
Quantum Convolutional Circuit: Embeds $z$ via angle embedding onto $n_q$ qubits, follows with $L$ hierarchical quantum convolution and pooling layers, and outputs a single-qubit expectation value $q$ .
Hybrid Classification Head: Concatenates $q$ and $z$ , propagates through an MLP, and outputs probability prediction $\hat{y}$ via sigmoid activation.

The canonical instantiation sets the qubit count $n_q=8$ , sequence length $T=20$ , and utilizes dozens of feature channels $F$ derived from engineered technical indicators.

2. Classical Temporal Encoder Design

Given a sequential input $X = [x_{t-T+1}, \ldots, x_t] \in \mathbb{R}^{T \times F}$ , the TConv encoder applies a stack of one-dimensional temporal convolutions:

Convolutional Layer: $h^{(\ell)}_{:,f'} = \sum_{\tau=0}^{K-1} W^{(\ell)}_{\tau, f, f'} \cdot h^{(\ell-1)}_{t-\tau, f}$
Nonlinear Activation: ReLU or variant.
Global Average Pooling (GAP): $g_f = \frac{1}{T} \sum_{t=1}^T h^{(L)}_{t,f}$
MLP Projection: $z = W_{MLP} \cdot GAP[h^{(L)}] + b \in \mathbb{R}^{n_q}$

This encoder ensures efficient multi-scale temporal pattern extraction, including capturing momentum and volatility fluctuations. The projection to $z$ yields an $n_q$ -dimensional latent representation suitable for quantum encoding.

3. Quantum Convolutional Module

3.1 Data Encoding (“Angle Embedding”)

The feature vector $z=(z_1, \ldots, z_{n_q})$ is mapped to a quantum state using $R_Y$ rotations independently applied to each qubit:

$|\psi_0(z)\rangle = \bigotimes_{j=1}^{n_q} R_Y(z_j) |0\rangle^{\otimes n_q}$

where $R_Y(\theta) = \exp(-i \theta Y/2)$ .

3.2 Hierarchical Quantum Convolution and Pooling

Each convolutional layer applies a shared six-parameter unitary to adjacent pairs $(i,i+1)$ :

$U_{conv}(\theta) = \mathrm{CNOT}_{i,i+1}[R_Y(\theta_1)\otimes R_Z(\theta_2)]\, \mathrm{CNOT}_{i+1,i}[R_Y(\theta_3)\otimes R_Z(\theta_4)][R_Y(\theta_5)\otimes R_Z(\theta_6)]$

Parameters $\theta^{(\ell)} \in \mathbb{R}^6$ are shared across all pairs within layer $\ell$ , reducing trainable quantum parameters.
After each layer, pooling discards alternate qubits (odd indices), halving the register size. This is recursively applied over $L$ layers, resulting in depth $L_{eff} \leq \lfloor \log_2(n_q) \rfloor$ and a final single-qubit state.

3.3 Quantum Measurement

The final state’s single qubit is measured in the $Z$ basis to obtain output $q \in [-1, 1]$ :

$q = \langle Z \rangle = \langle \psi_{final}| Z | \psi_{final} \rangle$

4. Hybrid Classification and Training

The quantum output $q$ is concatenated with classical embedding $z$ , forming $h_0 = [q; z] \in \mathbb{R}^{n_q + 1}$ . A two-layer MLP propagates $h_0$ using ReLU activations, with intermediate dimensionalities $[64, 32]$ , and applies a sigmoid to yield probability prediction $\hat{y}$ .

4.1 Loss Function and Optimization

Classification targets are binary labels $y \in \{0,1\}$ , identifying extreme long/short equity candidates. The binary cross entropy objective is:

$\mathcal{L}_{BCE} = -\frac{1}{N} \sum_{i=1}^N \left[y_i \log \hat{y}_i + (1-y_i) \log(1-\hat{y}_i)\right]$

QTCNN uses AdamW optimization ( $lr=1 \times 10^{-3}$ , batch size 128, 50 epochs).

4.2 Quantum Gradient Evaluation

For quantum circuit parameters $\theta_j$ , observables’ gradients are computed via the parameter-shift rule:

$\frac{\partial \langle O \rangle}{\partial \theta_j} = \frac{1}{2} [\langle O \rangle_{\theta_j + \frac{\pi}{2}} - \langle O \rangle_{\theta_j - \frac{\pi}{2}}]$

These gradients interface with classical optimization frameworks using PennyLane’s autodifferentiation.

5. Benchmarking and Empirical Performance

5.1 Dataset, Feature Engineering, and Labeling

The principal evaluation utilizes daily JPX Tokyo Stock Exchange data (2017–2022), focusing on Universe0 stocks ( $\sim$ 2,000 equities). Feature engineering comprises momentum, volatility, and volume signals, all z-scored and winsorized cross-sectionally. Labels are assigned with weak supervision, selecting the top/bottom $p=200$ equities per day as positive/negative classes.

Training set: First 80% temporal partition
Testing set: Last 20% (out-of-sample, OOS)
Subsampling with stride $k=11$ : $\sim$ 100 train days, 22 OOS days per run
Evaluation metric: Annualized Sharpe ratio of long/short portfolios ( $K=200$ stocks per side), with bootstrap 95% confidence intervals

5.2 Comparative Results

Model	OOS Sharpe Ratio	95% CI
QTCNN	0.538	±0.042
QNN	0.467	±0.038
QCNN	0.361	±0.298
QLSTM	0.333	±0.051
Transformer	0.313	±0.036
LSTM	0.044	±0.029

QTCNN achieves a Sharpe ratio of $0.538$, outperforming the best classical baseline (Transformer) by approximately $72\%$ , and surpassing quantum convolutional variants lacking temporal encoders by $\sim49\%$ . This suggests a critical role for the temporal encoder and parameter-efficient quantum convolution structure in enhancing predictive robustness and generalization.

6. Quantum Circuit Implementation and Complexity

Qubit count: $n_q = 8$
Quantum depth: $L \leq 3$ (determined by pooling hierarchy)
Entanglement layout: Nearest-neighbor ring CNOTs, NISQ-compatible
Simulation framework: PennyLane, using default.qubit/lightning backends
Per-iteration wall time: QTCNN $\approx$ 0.25 s, LSTM $\approx$ 0.002 s (hardware: RTX3090 + i9 CPU)

The implementation leverages parameter sharing and pooling to optimize sample efficiency and scalability with limited qubit resources.

7. Limitations and Prospective Directions

Key limitations include the modest OOS sample (22 days) and extensive subsampling ( $\sim$ 9% of available days), leading to reduced statistical power. All experiments utilize idealized (noiseless) qubit simulation; future work must address robustness to quantum noise (e.g., depolarizing, amplitude damping channels) and physical device constraints. Feature engineering is restricted to price-volume technical indicators; incorporating sentiment, macroeconomic, or order-book data could potentially enrich the representational scope. Generalizability to other global equity markets and asset classes remains unverified. Additionally, theoretical investigation into expressivity, barren-plateau phenomena, and conditions for quantum advantage in financial time series models is an open avenue.

In summary, QTCNN represents a hybrid paradigm in quantum machine learning, synthesizing classical multi-scale temporal encoding with hierarchical, parameter sharing quantum circuits to address predictive challenges in complex financial data. Empirical results indicate substantial outperformance relative to canonical deep learning and hybrid quantum approaches, underscoring the practical relevance of quantum-enhanced forecasting for quantitative finance (Chen et al., 7 Dec 2025).

PDF Markdown Chat (Pro)

References (1)

Quantum Temporal Convolutional Neural Networks for Cross-Sectional Equity Return Prediction: A Comparative Benchmark Study (2025)

Whiteboard

Generate a whiteboard explanation of this topic.

Follow Topic

Get notified by email when new papers are published related to Quantum Temporal Convolutional Neural Network (QTCNN).