An efficient uniqueness theorem for overcomplete tensor decomposition (2404.07801v1)
Abstract: We give a new, constructive uniqueness theorem for tensor decomposition. It applies to order 3 tensors of format $n \times n \times p$ and can prove uniqueness of decomposition for generic tensors up to rank $r=4n/3$ as soon as $p \geq 4$. One major advantage over Kruskal's uniqueness theorem is that our theorem has an algorithmic proof, and the resulting algorithm is efficient. Like the uniqueness theorem, it applies in the range $n \leq r \leq 4n/3$. As a result, we obtain the first efficient algorithm for overcomplete decomposition of generic tensors of order 3. For instance, prior to this work it was not known how to efficiently decompose generic tensors of format $n \times n \times n$ and rank $r=1.01n$ (or rank $r \leq (1+\epsilon) n$, for some constant $\epsilon >0$). Efficient overcomplete decomposition of generic tensors of format $n \times n \times 3$ remains an open problem. Our results are based on the method of commuting extensions pioneered by Strassen for the proof of his $3n/2$ lower bound on tensor rank and border rank. In particular, we rely on an algorithm for the computation of commuting extensions recently proposed in a companion paper, and on the classical diagonalization-based "Jennrich algorithm" for undercomplete tensor decomposition.
- MD Atkinson and S Lloyd. Bounds on the ranks of some 3-tensors. Linear Algebra and its Applications, 31:19–31, 1980.
- Pseudospectral shattering, the sign function, and diagonalization in nearly matrix multiplication time. In 61st Annual Symposium on Foundations of Computer Science (FOCS), pages 529–540, 2020. arXiv preprint arxiv.org/1912.08805.
- Smoothed analysis of tensor decompositions. In Proc. 46th annual ACM symposium on Theory of Computing (STOC), pages 594–603, 2014.
- Commuting extensions and cubature formulae. Numerische Mathematik, 101:479–500, 2005.
- Canonical polyadic decomposition of third-order tensors: Reduction to generalized eigenvalue decomposition. SIAM Journal on Matrix Analysis and Applications, 35(2):636–660, 2014.
- Learning sums of powers of low-degree polynomials in the non-degenerate case. In Proc. 61st Annual Symposium on Foundations of Computer Science (FOCS), pages 889–899, 2020.
- Dimitri Grigoriev. Multiplicative complexity of a pair of bilinear forms and of the polynomial multiplication. In Proc. International Symposium on Mathematical Foundations of Computer Science (MFCS), pages 250–256, 1978.
- Richard Harshman. Foundations of the PARAFAC procedure: Models and conditions for an "explanatory" multimodal factor analysis. UCLA working papers in phonetics, 1970.
- Johan Håstad. Tensor rank is NP-complete. Journal of Algorithms, 11(4):644–654, 1990.
- Fast spectral algorithms from sum-of-squares proofs: tensor decomposition and planted sparse vectors. In Proc. 48th Annual Symposium on Theory of Computing (STOC), pages 178–191, 2016.
- Matrix Analysis. Cambridge University Press (second edition), 2013.
- Joseph JaJa. Optimal evaluation of pairs of bilinear forms. In Proc. 10th annual ACM Symposium on Theory of Computing (STOC), pages 173–183, 1978.
- Pascal Koiran. On tensor rank and commuting matrices. arXiv preprint arXiv:2006.02374, 2020.
- Pascal Koiran. Uniqueness and computation of commuting extensions. https://arxiv.org/abs/2401.01302, 2024.
- Absolute reconstruction for sums of powers of linear forms: degree 3 and beyond. Computational Complexity, 32(2):1–66, 2023.
- Complete decomposition of symmetric tensors in linear time and polylogarithmic precision. In CIAC (International Conference on Algorithms and Complexity), 2023. https://arxiv.org/abs/2211.07407.
- Undercomplete decomposition of symmetric tensors in linear time, and smoothed analysis of the condition number. https://arxiv.org/abs/2403.00643, 2024.
- Derandomization and absolute reconstruction for sums of powers of linear forms. Theoretical Computer Science, 887, 2021. arXiv preprint arXiv:1912.02021.
- Joseph Kruskal. Three-way arrays: rank and uniqueness of trilinear decompositions, with application to arithmetic complexity and statistics. Linear algebra and its applications, 18(2):95–138, 1977.
- JM Landsberg. Kruskal’s theorem. https://arxiv.org/abs/0902.0543, 2009.
- Joseph M. Landsberg. Tensors: Geometry and applications, volume 128 of Graduate Studies in Mathematics. American Mathematical Society, 2012.
- A decomposition for three-way arrays. SIAM Journal on Matrix Analysis and Applications, 14(4):1064–1083, 1993.
- Thomas Lickteig. Typical tensorial rank. Linear Algebra and Its Applications, 69:95–120, 1985.
- Polynomial-time tensor decompositions with sum-of-squares. In Proc. 57th Annual Symposium on Foundations of Computer Science (FOCS), pages 438–446, 2016.
- Ankur Moitra. Algorithmic aspects of machine learning. Cambridge University Press, 2018.
- Lower bounds on arithmetic circuits via partial derivatives. Computational Complexity, 6(3):217–234, 1996. Conference version in FOCS’95.
- Ran Raz. Tensor-rank and lower bounds for arithmetic formulas. Journal of the ACM, 60(6):1–15, 2013.
- John Rhodes. A concise proof of Kruskal’s theorem on tensor decomposition. Linear Algebra and its Applications, 432(7):1818–1824, 2010.
- J. T. Schwartz. Fast probabilistic algorithms for verification of polynomials identities. Journal of the ACM, 27:701–717, 1980.
- Volker Strassen. Rank and optimal computation of generic tensors. Linear Algebra and its Applications, 52:645–685, 1983.
- Joseph Swernofsky. Tensor rank is hard to approximate. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM), 2018.
- The link between sufficient conditions by Harshman and by Kruskal for uniqueness in Candecomp/Parafac. Journal of Chemometrics, 23(7-8):321–323, 2009.
- Richard Zippel. Probabilistic algorithms for sparse polynomials. In Symbolic and algebraic computation, pages 216–226. Springer, 1979.