Discretized NeuTucF Model for Sparse Turbulence Data

Updated 12 December 2025

The paper introduces a neural variant of Tucker factorization that discretizes continuous wind field data, achieving state-of-the-art performance in turbulence estimation.
It employs quantile-based binning and a four-mode neural Tucker decomposition to embed discrete indices and capture complex spatio-temporal interactions.
Empirical results show lower MAE and RMSE compared to baselines, demonstrating robust imputation and regression of atmospheric turbulence quantities.

The discretized NeuTucF model is a neural variant of Tucker factorization tailored to model sparse, multi-dimensional turbulence data using quantized, discrete input features. Developed to estimate turbulence quantities such as the Richardson number from wind profile radar data, NeuTucF achieves state-of-the-art performance on continuous yet sparse three-dimensional wind fields by embedding discretized indices into low-rank representations and capturing multiway spatio-temporal interactions via a four-mode neural Tucker decomposition. This methodology enables imputation and regression of missing entries in real-world atmospheric datasets, outperforming common baseline models (Fan et al., 5 Dec 2025).

1. Input Discretization and Quantization

The model operates on four continuous variables: altitude ( $h$ ), and three wind speed components ( $u$ , $v$ , $w$ ). Each variable undergoes the following transformation pipeline:

Standardization: Each feature $X$ is z-normalized,

$X_{\rm std} = \frac{X-\mu}{\sigma},$

where $\mu$ and $\sigma$ are the empirical mean and standard deviation.

Quantile-based binning: Each standardized feature is partitioned into $K$ equally populated bins. Quantile boundaries $B_k$ are computed as

$B_k = Q\left(\frac{k}{K}\right),\quad k=0,1,\dots,K,$

where $Q(\cdot)$ denotes the empirical quantile function.

Piecewise discretization: Each real-valued standardized input $x$ is mapped to its corresponding bin index using

$\mathrm{idx}(x)= \begin{cases} 0, & x < B_1, \ i, & B_i \le x < B_{i+1},\quad i=1,\dots,K-1, \ K, & x\ge B_K. \end{cases}$

Applying this scheme to all features produces discrete indices $p, i, j, k \in \{0,\dots,K\}$ for each observation. All quantized data are then aggregated into a four-way array

$\mathcal Z = \mathcal D(\mathcal Y) \in \mathbb R^{(K+1)\times (K+1)\times (K+1)\times (K+1)},$

with $\mathcal Z_{p,i,j,k}$ encoding the (possibly missing) Richardson number at the joint quantized coordinates.

2. Tucker-Based Neural Model Architecture

The NeuTucF model encodes interactions between quantized features using a four-mode neural Tucker factorization:

Factor matrices (embeddings): For each mode $n=1,2,3,4$ (corresponding to $h,u,v,w$ ), a learnable matrix

$U^{(n)} \in \mathbb R^{(K+1) \times r}$

provides $r$ -dimensional embeddings for each discrete value. For instance, $e^{(p)} = U^{(1)}_{p,:}$ embeds bin index $p$ of height.

Core tensor: A dense core tensor

$\mathcal G \in \mathbb R^{r \times r \times r \times r}$

encodes higher-order interactions between modes. All experiments fix $r=5$ .

Tucker reconstruction: Classically, the estimated value at $(p,i,j,k)$ is

$\hat y_{pijk} = \sum_{a,b,c,d} G_{abcd} U^{(1)}_{p,a} U^{(2)}_{i,b} U^{(3)}_{j,c} U^{(4)}_{k,d}.$

3. Neural Interaction Tensor Construction

Rather than multiplying full mode-factor matrices as in classical Tucker, NeuTucF constructs the "interaction tensor" using outer products of the relevant embeddings: $\mathcal T_{pijk} = e^{(p)} \circ e^{(i)} \circ e^{(j)} \circ e^{(k)} \in \mathbb R^{r\times r\times r\times r},$ where $\circ$ denotes the outer product. Entrywise,

$\mathcal T_{pijk}[a,b,c,d] = e^{(p)}_a\; e^{(i)}_b\; e^{(j)}_c\; e^{(k)}_d.$

The flattened interaction tensor $\mathrm{vec}(\mathcal T_{pijk})$ is then linearly projected using the flattened core $\mathrm{vec}(\mathcal G)$ : $\hat y_{pijk} = \sigma \left( \mathrm{vec}(\mathcal G)^\top \mathrm{vec}\bigl(\mathcal T_{pijk}\bigr) \right),$ where $\sigma$ is a sigmoid or identity activation. This neural mapping allows fully expressive four-way nonlinear interactions parameterized by the core tensor.

4. Training Objective and Optimization

Training minimizes the mean squared error (MSE) over all observed entries $\Omega$ : $\mathcal L = \frac{1}{|\Omega|} \sum_{(p,i,j,k)\in\Omega} \left( \hat y_{pijk}-y_{pijk} \right)^2 + \lambda \left( \|\mathcal G\|_F^2 +\sum_{n=1}^4\|U^{(n)}\|_F^2 \right),$ where $y_{pijk}$ are ground-truth Richardson numbers and $\lambda$ controls $L_2$ weight regularization.

Implementation utilizes PyTorch and the Adam optimizer (learning rate $1\mathrm e\!-\!3$ ), training for approximately 100 epochs in minibatches. This configuration suits high-dimensional, sparse regression over wind field tensors.

5. Inference and Reconstruction

After training, the model can impute or regress missing entries $(p,i,j,k)\notin\Omega$ by passing their respective discrete indices through the same embedding, outer product, and linear mapping pipeline. The inference formula is equivalent to Tucker-factor reconstruction: $\hat y_{pijk} = \sum_{a,b,c,d} G_{abcd} U^{(1)}_{p,a} U^{(2)}_{i,b} U^{(3)}_{j,c} U^{(4)}_{k,d}.$ This unified parametrization allows estimation anywhere within the quantized input domain without explicit tensor completion.

6. Empirical Results and Performance Metrics

Model efficacy is evaluated via five-fold cross-validation using mean absolute error (MAE), root-mean-square error (RMSE), and coefficient of determination ( $R^2$ ):

Model	MAE ( $\downarrow$ )	RMSE ( $\downarrow$ )	$R^2$ ( $\uparrow$ )
M1 (NeuTucF)	$11.64 \pm 2.47$	$20.11 \pm 3.76$	$0.353 \pm 0.195$
Baselines (M2–M5)	Higher	Higher	Lower

All baseline models (M2–M5) exhibit higher MAE and RMSE and lower $R^2$ compared to NeuTucF under matched embedding size and Tucker rank ( $r=5$ ). The chosen discretization and low-rank parametric structure yield the best cross-validated error; no explicit ablation of bin count or Tucker rank is reported, but these hyperparameters are fixed for fair comparison (Fan et al., 5 Dec 2025).

7. Context, Significance, and Limitations

The discretized NeuTucF approach addresses the challenge of estimating turbulence quantities from sparse, incomplete, or irregularly-sampled atmospheric measurements, particularly in settings where only wind profile radar is available. Discretization adapts continuous inputs for compatibility with embedding-based neural tensor factorization models, while the full four-way Tucker core captures complex spatio-temporal dependencies. The method demonstrates robust imputational and regression accuracy across a variety of low-altitude turbulence datasets. A plausible implication is that similar discretized neural factorization architectures could be adapted for regression tasks involving other high-dimensional, spatio-temporal geophysical data.

However, the model's effectiveness is contingent on the appropriateness of the chosen binning strategy and Tucker rank. The paper reports no explicit sensitivity analysis on these hyperparameters, leaving open the question of how choices such as $K$ (the number of quantile bins) affect generalization or bias (Fan et al., 5 Dec 2025).

PDF Markdown Chat (Pro)

References (1)

Turbulence Regression (2025)

Whiteboard

Generate a whiteboard explanation of this topic.

Follow Topic

Get notified by email when new papers are published related to Discretized NeuTucF Model.