Papers
Topics
Authors
Recent
Search
2000 character limit reached

Truncated Tucker Decomposition with Error Control

Updated 16 May 2026
  • The paper introduces error-controlled truncated Tucker decomposition algorithms that guarantee near-optimal low multilinear rank approximations using methods like HOSVD and randomized sketching.
  • The paper presents interpolatory CUR-type approaches that directly sample tensor fibers to preserve data structure while providing probabilistic error bounds.
  • The paper further investigates randomized, streaming, and rank-adaptive strategies to enhance scalability and accuracy in high-dimensional tensor approximations.

Truncated Tucker Decomposition with Error Control

The truncated Tucker decomposition with error control targets the computation of best-approximation or near-best-approximation low multilinear rank representations of a tensor, with explicit guarantees on the Frobenius norm error. This paradigm is critical in high-dimensional settings where efficiency, rigor, and structure preservation must be reconciled, and has motivated extensive algorithmic innovation. Approaches now include randomized sketching, interpolatory CUR/Tucker, streaming updates, and parallel implementations, all equipped with rigorous a priori or probabilistic error bounds.

1. Problem Formulation and Classical Solution

Given a dd-way tensor XRI1×I2××Id\mathcal{X} \in \mathbb{R}^{I_1 \times I_2 \times \cdots \times I_d} and target multilinear ranks (r1,,rd)(r_1, \dots, r_d), the truncated Tucker decomposition seeks orthonormal factor matrices UnRIn×rnU_n \in \mathbb{R}^{I_n \times r_n} and a core tensor GRr1××rd\mathcal{G} \in \mathbb{R}^{r_1 \times \cdots \times r_d} minimizing the Frobenius error: minUn,GXG×1U1×2×dUdF\min_{U_n,\, \mathcal{G}} \|\mathcal{X} - \mathcal{G} \times_1 U_1 \times_2 \cdots \times_d U_d\|_F The classical (deterministic, optimal) approach is the truncated higher-order singular value decomposition (HOSVD), which first computes the mode-nn unfoldings X(n)X_{(n)}, extracts the leading rnr_n left singular vectors for each mode as UnU_n, and forms the core via mode products. The approximation error satisfies

XRI1×I2××Id\mathcal{X} \in \mathbb{R}^{I_1 \times I_2 \times \cdots \times I_d}0

where XRI1×I2××Id\mathcal{X} \in \mathbb{R}^{I_1 \times I_2 \times \cdots \times I_d}1 are the singular values of mode-XRI1×I2××Id\mathcal{X} \in \mathbb{R}^{I_1 \times I_2 \times \cdots \times I_d}2 unfoldings. This forms the baseline against which all efficient and randomized truncation algorithms with error control are measured (Cao et al., 2022).

2. Interpolatory and CUR-Type Tucker Decompositions

To preserve data structure such as sparsity or nonnegativity, interpolatory or CUR-type Tucker decompositions select columns (or fibers) directly from the original tensor using sampling methods. A leading strategy is the L-DEIM (Leverage-augmented Discrete Empirical Interpolation Method), which operates as follows:

  • For each mode XRI1×I2××Id\mathcal{X} \in \mathbb{R}^{I_1 \times I_2 \times \cdots \times I_d}3: (1) Unfold XRI1×I2××Id\mathcal{X} \in \mathbb{R}^{I_1 \times I_2 \times \cdots \times I_d}4 along mode XRI1×I2××Id\mathcal{X} \in \mathbb{R}^{I_1 \times I_2 \times \cdots \times I_d}5; (2) Approximate singular value decomposition (SVD) to extract XRI1×I2××Id\mathcal{X} \in \mathbb{R}^{I_1 \times I_2 \times \cdots \times I_d}6 right singular vectors; (3) Select XRI1×I2××Id\mathcal{X} \in \mathbb{R}^{I_1 \times I_2 \times \cdots \times I_d}7 columns using L-DEIM—first selecting XRI1×I2××Id\mathcal{X} \in \mathbb{R}^{I_1 \times I_2 \times \cdots \times I_d}8 indices via classic DEIM, then augmenting with additional columns by leverage-score ranking.
  • Assemble the indices into sampled fiber sets XRI1×I2××Id\mathcal{X} \in \mathbb{R}^{I_1 \times I_2 \times \cdots \times I_d}9, retrieve the corresponding columns to form (r1,,rd)(r_1, \dots, r_d)0.
  • Compute the core by

(r1,,rd)(r_1, \dots, r_d)1

where (r1,,rd)(r_1, \dots, r_d)2 denotes the Moore–Penrose pseudoinverse.

Error control is realized by probabilistic bounds (for the randomized L-DEIM–HOID variant) parameterized by an oversampling parameter (r1,,rd)(r_1, \dots, r_d)3, SVD quality parameters (r1,,rd)(r_1, \dots, r_d)4, (r1,,rd)(r_1, \dots, r_d)5, and the singular value decay. Explicitly, for oversampling (r1,,rd)(r_1, \dots, r_d)6, error satisfies

(r1,,rd)(r_1, \dots, r_d)7

with failure probability controlled by (r1,,rd)(r_1, \dots, r_d)8 (typically set so that the failure rate is (r1,,rd)(r_1, \dots, r_d)9 in practice) (Cao et al., 2022).

These structure-preserving decompositions are particularly advantageous when factor interpretability or data format adherence is necessary.

3. Randomized and Streaming Algorithms with Error Guarantees

Randomized algorithms dramatically reduce the computational cost of Tucker truncation and inherently facilitate error estimation via probabilistic analysis. Core methods include:

  • Randomized T-HOSVD/ST-HOSVD with power iteration and/or adaptive shifts: Each mode is sketched with a Gaussian random projection (optionally followed by power iterations and adaptive shift in the power method to enhance spectral gap), and an orthonormal basis is extracted (Che et al., 2023, Che et al., 5 Jun 2025). With oversampling UnRIn×rnU_n \in \mathbb{R}^{I_n \times r_n}0, power parameter UnRIn×rnU_n \in \mathbb{R}^{I_n \times r_n}1, and shift parameter UnRIn×rnU_n \in \mathbb{R}^{I_n \times r_n}2, error is bounded (with high probability) in terms of higher moments of the singular value tail and decays rapidly with UnRIn×rnU_n \in \mathbb{R}^{I_n \times r_n}3 and UnRIn×rnU_n \in \mathbb{R}^{I_n \times r_n}4:

UnRIn×rnU_n \in \mathbb{R}^{I_n \times r_n}5

where UnRIn×rnU_n \in \mathbb{R}^{I_n \times r_n}6 decay with UnRIn×rnU_n \in \mathbb{R}^{I_n \times r_n}7 and shifting makes UnRIn×rnU_n \in \mathbb{R}^{I_n \times r_n}8 often sufficient (Che et al., 5 Jun 2025).

  • Oblivious Subspace Embeddings (OSEs) for compressed HOOI: Each mode is compressed by a JL-optimal embedding, with embedding dimension UnRIn×rnU_n \in \mathbb{R}^{I_n \times r_n}9. The HOOI iterates then operate in the compressed domain, with relative error guarantees:

GRr1××rd\mathcal{G} \in \mathbb{R}^{r_1 \times \cdots \times r_d}0

enabling rigorous GRr1××rd\mathcal{G} \in \mathbb{R}^{r_1 \times \cdots \times r_d}1-approximate solutions with substantial time and memory savings (Pietrosanu et al., 2024).

  • Streaming Adaptive ST-HOSVD: For tensors growing along a streaming mode (e.g., time), factor matrices and the core are updated incrementally in one pass without storing the entire history. Component ranks and SVD tolerances are dynamically chosen to keep the total Frobenius error below a user-specified GRr1××rd\mathcal{G} \in \mathbb{R}^{r_1 \times \cdots \times r_d}2 at all times (De et al., 2023).

These approaches deliver near-optimal error, sharp time-memory tradeoffs, and adaptivity to streaming and parallel hardware.

4. Advanced Error-Control Strategies and Rank Adaptivity

Rank-adaptive algorithms determine the minimum multilinear ranks needed to meet a prescribed absolute or relative error threshold, rather than require fixed ranks as input. In the rank-adaptive HOOI framework, each mode truncation rank GRr1××rd\mathcal{G} \in \mathbb{R}^{r_1 \times \cdots \times r_d}3 is dynamically set at each sweep by evaluating the tail singular-value energy against error budgets derived from the global constraint

GRr1××rd\mathcal{G} \in \mathbb{R}^{r_1 \times \cdots \times r_d}4

Rank reduction is monotonic (non-increasing per sweep) with convergence guaranteed after finitely many updates, and local optimality follows from the Eckart–Young theorem. Empirical results show that rank-adaptive methods recover the true underlying multilinear rank in synthetic and real data, leading to significantly more compact decompositions for a given target error (Xiao et al., 2021).

Hybrid truncation methods (e.g., in ATC) further split the total error budget between Tucker truncation and quantization, delivering tight error control post-compression through careful error tracking and blockwise parallel quantization (Baert et al., 2021).

5. Mode-Parallel and Scalable Implementations

For high-order tensors or those with extremely large modes, mode-parallel randomized algorithms enable resource-efficient Tucker truncation. Subsampled randomized HOSVD (Sub-R-HOSVD) applies randomized range-finding to suitably sampled fibers from each mode independently. With sampling budget GRr1××rd\mathcal{G} \in \mathbb{R}^{r_1 \times \cdots \times r_d}5 (where GRr1××rd\mathcal{G} \in \mathbb{R}^{r_1 \times \cdots \times r_d}6 denotes the coherence of the mode-GRr1××rd\mathcal{G} \in \mathbb{R}^{r_1 \times \cdots \times r_d}7 unfolding), expectations of the truncation error can be tightly controlled: GRr1××rd\mathcal{G} \in \mathbb{R}^{r_1 \times \cdots \times r_d}8 This does not require explicit formation of the mode unfoldings and is fully compatible with distributed memory and parallel compute environments (Iannacito et al., 22 Mar 2026).

Block-Krylov iterative sketching (rBKI-TK) further enhances error and noise-robustness for high-noise or low-signal data, attaining error within a low factor of the deterministic best multilinear-rank error with very low oversampling and power–iteration cost (Qiu et al., 2022).

6. Practical Recommendations and Comparative Perspectives

Empirical findings across large synthetic and real datasets support the following guidelines:

  • Use oversampling GRr1××rd\mathcal{G} \in \mathbb{R}^{r_1 \times \cdots \times r_d}9–minUn,GXG×1U1×2×dUdF\min_{U_n,\, \mathcal{G}} \|\mathcal{X} - \mathcal{G} \times_1 U_1 \times_2 \cdots \times_d U_d\|_F0 for randomized SVDs or OSEs per mode.
  • For L-DEIM or CUR-Tucker, set the DEIM parameter minUn,GXG×1U1×2×dUdF\min_{U_n,\, \mathcal{G}} \|\mathcal{X} - \mathcal{G} \times_1 U_1 \times_2 \cdots \times_d U_d\|_F1.
  • Adaptive-shifted power iterations with minUn,GXG×1U1×2×dUdF\min_{U_n,\, \mathcal{G}} \|\mathcal{X} - \mathcal{G} \times_1 U_1 \times_2 \cdots \times_d U_d\|_F2 or minUn,GXG×1U1×2×dUdF\min_{U_n,\, \mathcal{G}} \|\mathcal{X} - \mathcal{G} \times_1 U_1 \times_2 \cdots \times_d U_d\|_F3 suffice to closely match deterministic HOSVD accuracy.
  • Rank selection can be automated via singular-value energy partitioning to meet prescribed minUn,GXG×1U1×2×dUdF\min_{U_n,\, \mathcal{G}} \|\mathcal{X} - \mathcal{G} \times_1 U_1 \times_2 \cdots \times_d U_d\|_F4 thresholds (Cao et al., 2022, Che et al., 5 Jun 2025, Xiao et al., 2021).
  • For streaming or online scenarios, incremental SVDs and error splitting per non-streaming mode are mandatory for time-unbounded data (De et al., 2023).

In the presence of strong prior knowledge on tensor structure (e.g., sparsity, nonnegativity), interpolatory Tucker (e.g., L-DEIM–HOID) is preferred for structure preservation. Conversely, for extremely large, distributed, or high-order data, randomized or mode-parallel approaches are recommended for scalability and efficient error control.

A summary comparison of prominent algorithmic classes is given below.

Approach Error Guarantee Scalability Structure Preservation
Classical HOSVD/ST-HOSVD Sum of tail squares: minUn,GXG×1U1×2×dUdF\min_{U_n,\, \mathcal{G}} \|\mathcal{X} - \mathcal{G} \times_1 U_1 \times_2 \cdots \times_d U_d\|_F5 Moderate No
Randomized T-HOSVD/ST-HOSVD Probabilistic: minUn,GXG×1U1×2×dUdF\min_{U_n,\, \mathcal{G}} \|\mathcal{X} - \mathcal{G} \times_1 U_1 \times_2 \cdots \times_d U_d\|_F6 High No
L-DEIM–CUR-HOID Multiplicative factor on tail singular values High Yes (fibers/columns chosen)
OSE-compressed HOOI Relative minUn,GXG×1U1×2×dUdF\min_{U_n,\, \mathcal{G}} \|\mathcal{X} - \mathcal{G} \times_1 U_1 \times_2 \cdots \times_d U_d\|_F7 of best rank approximation Very High No
Block Krylov (rBKI-TK) minUn,GXG×1U1×2×dUdF\min_{U_n,\, \mathcal{G}} \|\mathcal{X} - \mathcal{G} \times_1 U_1 \times_2 \cdots \times_d U_d\|_F8 times best (deterministic) error Very High No
Streaming ST-HOSVD Global minUn,GXG×1U1×2×dUdF\min_{U_n,\, \mathcal{G}} \|\mathcal{X} - \mathcal{G} \times_1 U_1 \times_2 \cdots \times_d U_d\|_F9 enforced slice-by-slice Highest No

7. Limitations and Ongoing Directions

Limitations of current paradigm variants include:

  • Multiplicative error growth factors or additional tail-energy terms in interpolatory or randomized decompositions versus optimal HOSVD.
  • Non-optimality of random sketching under extreme coherence unless sampling budgets are increased.
  • Structured random embeddings (e.g., SRFT, CountSketch with adaptive shift) lack complete theoretical development in the context of Tucker error control (Che et al., 5 Jun 2025).
  • Block-Krylov and fiber-sampling techniques are sensitive to rank overestimation in low-signal regimes.
  • Multi-shift or Rayleigh-quotient adaptive strategies may further accelerate randomized computations, but their full analysis remains open.

Practical rank-adaptive stopping and distributed parallelization strategies are active areas of development, as is generalization to non-Euclidean domains (e.g., tensors with graph structure or in nonstandard data types).


References:

(Cao et al., 2022) – CUR-type Tucker via L-DEIM with error bounds (Che et al., 2023) – Randomized/approximate T-HOSVD and error analysis (Che et al., 5 Jun 2025) – Randomized (S)T-HOSVD with adaptive shift power scheme (Pietrosanu et al., 2024) – OSE-compressed HOOI with JL-type error control (Qiu et al., 2022) – rBKI-TK, randomized block Krylov with rigorous bounds (Baert et al., 2021) – Hybrid (ST-HOSVD + quantization) and error tracking (Xiao et al., 2021) – Rank-adaptive HOOI, local optimality and convergence (De et al., 2023) – Streaming, error-controlled Tucker update (Iannacito et al., 22 Mar 2026) – Mode-parallel randomized Tucker with sampling bounds

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Truncated Tucker Decomposition with Error Control.