Near-Optimal Tensor PCA via Normalized Stochastic Gradient Ascent with Overparameterization (2510.14329v1)

Published 16 Oct 2025 in math.OC

Abstract: We study the Order-$k$ ($k \geq 4$) spiked tensor model for the tensor principal component analysis (PCA) problem: given $N$ i.i.d. observations of a $k$-th order tensor generated from the model $\mathbf{T} = \lambda \cdot v_^{\otimes k} + \mathbf{E}$, where $\lambda > 0$ is the signal-to-noise ratio (SNR), $v_$ is a unit vector, and $\mathbf{E}$ is a random noise tensor, the goal is to recover the planted vector $v_$. We propose a normalized stochastic gradient ascent (NSGA) method with overparameterization for solving the tensor PCA problem. Without any global (or spectral) initialization step, the proposed algorithm successfully recovers the signal $v_$ when $N\lambda² \geq \widetilde{\Omega}(d^{\lceil k/2 \rceil})$, thereby breaking the previous conjecture that (stochastic) gradient methods require at least $\Omega(d^{k-1})$ samples for recovery. For even $k$, the $\widetilde{\Omega}(d^{k/2})$ threshold coincides with the optimal threshold under computational constraints, attained by sum-of-squares relaxations and related algorithms. Theoretical analysis demonstrates that the overparameterized stochastic gradient method not only establishes a significant initial optimization advantage during the early learning phase but also achieves strong generalization guarantees. This work provides the first evidence that overparameterization improves statistical performance relative to exact parameterization that is solved via standard continuous optimization.