Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
134 tokens/sec
GPT-4o
10 tokens/sec
Gemini 2.5 Pro Pro
47 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Tensor cumulants for statistical inference on invariant distributions (2404.18735v1)

Published 29 Apr 2024 in math.ST, cs.CC, cs.DS, math.PR, stat.ML, and stat.TH

Abstract: Many problems in high-dimensional statistics appear to have a statistical-computational gap: a range of values of the signal-to-noise ratio where inference is information-theoretically possible, but (conjecturally) computationally intractable. A canonical such problem is Tensor PCA, where we observe a tensor $Y$ consisting of a rank-one signal plus Gaussian noise. Multiple lines of work suggest that Tensor PCA becomes computationally hard at a critical value of the signal's magnitude. In particular, below this transition, no low-degree polynomial algorithm can detect the signal with high probability; conversely, various spectral algorithms are known to succeed above this transition. We unify and extend this work by considering tensor networks, orthogonally invariant polynomials where multiple copies of $Y$ are "contracted" to produce scalars, vectors, matrices, or other tensors. We define a new set of objects, tensor cumulants, which provide an explicit, near-orthogonal basis for invariant polynomials of a given degree. This basis lets us unify and strengthen previous results on low-degree hardness, giving a combinatorial explanation of the hardness transition and of a continuum of subexponential-time algorithms that work below it, and proving tight lower bounds against low-degree polynomials for recovering rather than just detecting the signal. It also lets us analyze a new problem of distinguishing between different tensor ensembles, such as Wigner and Wishart tensors, establishing a sharp computational threshold and giving evidence of a new statistical-computational gap in the Central Limit Theorem for random tensors. Finally, we believe these cumulants are valuable mathematical objects in their own right: they generalize the free cumulants of free probability theory from matrices to tensors, and share many of their properties, including additivity under additive free convolution.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (89)
  1. Random matrices and complexity of spin glasses. Communications on Pure and Applied Mathematics, 66(2):165–201, 2013.
  2. On the heat equation and the index theorem. Inventiones Math., 19:279–330, 1973.
  3. Homotopy analysis for tensor PCA. In Conference on Learning Theory, pages 79–104. PMLR, 2017.
  4. Finite free cumulants: multiplicative convolutions, genus expansion and infinitesimal distributions. Transactions of the American Mathematical Society, 376(06):4383–4420, 2023.
  5. Random tensor theory: extending random matrix theory to mixtures of random product states. Communications in Mathematical Physics, 310(1):25–74, 2012.
  6. Graph matrices: norm bounds and applications. arXiv preprint arXiv:1604.03423, 2016.
  7. Cumulants for finite free convolution. Journal of Combinatorial Theory, Series A, 155:244–266, 2018.
  8. Teodor Banica. The orthogonal Weingarten formula in compact form. Letters in Mathematical Physics, 91(2):105–118, 2010.
  9. Reducibility and statistical-computational gaps from secret leakage. In 33rd Annual Conference on Learning Theory (COLT 2020), pages 648–847. PMLR, 2020.
  10. On the Fourier coefficients of high-dimensional random geometric graphs. arXiv preprint arXiv:2402.12589, 2024.
  11. De Finetti-style results for Wishart matrices: Combinatorial structure and phase transitions. arXiv preprint arXiv:2103.14011, 2021.
  12. Phase transitions for detecting latent geometry in random graphs. Probability Theory and Related Fields, 178(3):1215–1289, 2020.
  13. How to iron out rough landscapes and get optimal performances: averaged gradient descent and its application to tensor PCA. Journal of Physics A: Mathematical and Theoretical, 53(17):174003, 2020.
  14. Testing for high-dimensional geometry in random graphs. Random Structures & Algorithms, 49(3):503–532, 2016.
  15. The Franz-Parisi criterion and computational trade-offs in high dimensional statistics. Advances in Neural Information Processing Systems, 35:33831–33844, 2022.
  16. Multiplicative approximations for polynomial optimization over the unit sphere. Electron. Colloquium Comput. Complex., TR16-185, 2016.
  17. Sum-of-squares certificates for maxima of random tensors on the sphere. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, APPROX/RANDOM 2017, volume 81 of LIPIcs, pages 31:1–31:20. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2017.
  18. Universality in p𝑝pitalic_p-spin glasses with correlated disorder. Journal of Statistical Mechanics: Theory and Experiment, 2013(02):L02003, 2013.
  19. A nearly tight sum-of-squares lower bound for the planted clique problem. SIAM Journal on Computing, 48(2):687–735, 2019.
  20. Reconfiguration of graphs with connectivity constraints. In International Workshop on Approximation and Online Algorithms, pages 295–309. Springer, 2018.
  21. Béla Bollobás. The asymptotic number of unlabelled regular graphs. Journal of the London Mathematical Society, 2(2):201–206, 1982.
  22. Remi Bonnin. Universality of the Wigner-Gurau limit for random tensors. arXiv preprint arxiv:2404.14144, 2024.
  23. Richard Brauer. On algebras which are connected with the semisimple continuous groups. Annals of Mathematics, 38(4):857–872, 1937.
  24. All real eigenvalues of symmetric tensors. SIAM Journal on Matrix Analysis and Applications, 35(4):1582–1601, 2014.
  25. The tensor Harish-Chandra–Itzykson–Zuber integral II: Detecting entanglement in large quantum systems. Communications in Mathematical Physics, pages 1–48, 2023.
  26. The tensor Harish-Chandra–Itzykson–Zuber integral I: Weingarten calculus and a generalization of monotone Hurwitz numbers. Journal of the European Mathematical Society, 2023.
  27. Second order freeness and fluctuations of random matrices, III. Higher order freeness and free cumulants. Documenta Mathematica, 12, 07 2006.
  28. Statistical and computational phase transitions in group testing. In Conference on Learning Theory, pages 4764–4781. PMLR, 2022.
  29. Benoît Collins. Moments and cumulants of polynomial random variables on unitary groups, the Itzykson-Zuber integral, and free probability. International Mathematics Research Notices, 2003(17):953–982, 2003.
  30. Integration with respect to the haar measure on unitary, orthogonal and symplectic group. Communications in Mathematical Physics, 264(3):773–795, 2006.
  31. Fast algorithm for overcomplete order-3 tensor decomposition. arXiv preprint arXiv:2202.06442, 2022.
  32. Low-degree hardness of detection for correlated Erdős-Rényi graphs. arXiv preprint arXiv:2311.15931, 2023.
  33. Estimating rank-one spikes from heavy-tailed noise via self-avoiding walks. arXiv preprint arXiv:2008.13735, 2020.
  34. Detection of dense subhypergraphs by low-degree polynomials. arXiv preprint arXiv:2304.08135, 2023.
  35. Oleg Evnin. Melonic dominance and the largest eigenvalue of a large random tensor. Letters in Mathematical Physics, 111(3):66, 2021.
  36. S. Geršgorin. Über die abgrenzung der eigenwerte einer matrix. Bulletin de l’Académie des Sciences de l’URSS. Classe des sciences mathématiques et na, pages 749–754, 1931.
  37. Sum-of-squares lower bounds for Sherrington-Kirkpatrick via planted affine planes. In 61st Annual Symposium on Foundations of Computer Science (FOCS 2020), pages 954–965. IEEE, 2020.
  38. Hardness of random optimization problems for Boolean circuits, low-degree polynomials, and Langevin dynamics. SIAM Journal on Computing, 53(1):1–46, 2024.
  39. Razvan Gurau. Universality for random tensors. In Annales de l’IHP Probabilités et statistiques, volume 50, pages 1474–1525, 2014.
  40. Răzvan Gheorghe Gurău. Random tensors. Oxford University Press, 2017.
  41. Razvan Gurau. On the generalization of the Wigner semicircle law to real symmetric tensors. arXiv preprint arXiv:2004.02660, 2020.
  42. Representations and Invariants of the Classical Groups. Cambridge University Press, 1998.
  43. Seifollah Louis Hakimi. On realizability of a set of integers as degrees of the vertices of a linear graph II. Uniqueness. Journal of the Society for Industrial and Applied Mathematics, 11(1):135–147, 1963.
  44. Matthew B Hastings. Classical and quantum algorithms for tensor principal component analysis. Quantum, 4:237, 2020.
  45. The power of sum-of-squares for detecting hidden structures. In 58th Annual Symposium on Foundations of Computer Science (FOCS), pages 720–731. IEEE, 2017.
  46. Samuel Hopkins. Statistical inference and the sum of squares method. PhD thesis, Cornell University, 2018.
  47. Asymptotic formulaæ in combinatory analysis. Proceedings of the London Mathematical Society, 2(1):75–115, 1918.
  48. Efficient Bayesian estimation from few samples: community detection and related problems. In 58th Annual Symposium on Foundations of Computer Science (FOCS), pages 379–390. IEEE, 2017.
  49. Tensor principal component analysis via sum-of-square proofs. In Peter Grünwald, Elad Hazan, and Satyen Kale, editors, Proc. 28th Conference on Learning Theory, volume 40 of Proceedings of Machine Learning Research, pages 956–1006, 2015.
  50. Fast spectral algorithms from sum-of-squares proofs: tensor decomposition and planted sparse vectors. In Proceedings of the Forty-Eighth Annual ACM Symposium on Theory of Computing, pages 178––191, 2016.
  51. L. Isserlis. On a formula for the product-moment coefficient of any order of a normal frequency distribution in any number of variables. Biometrika, 12:134–139, 1918.
  52. Statistical thresholds for tensor PCA. Annals of Applied Probability, 30(4):1910–1933, 2020.
  53. Almost-orthogonal bases for inner product polynomials. arXiv preprint arXiv:2107.00216, 2021.
  54. Sum-of-squares lower bounds for densest k𝑘kitalic_k-subgraph. In Proceedings of the 55th Annual ACM Symposium on Theory of Computing, pages 84–95, 2023.
  55. Is planted coloring easier than planted clique? In The Thirty Sixth Annual Conference on Learning Theory, pages 5343–5372. PMLR, 2023.
  56. Notes on computational hardness of hypothesis testing: Predictions using the low-degree likelihood ratio. In Paula Cerejeiras and Michael Reissig, editors, Mathematical Analysis, its Applications and Computation, pages 1–50, Cham, 2022. Springer International Publishing.
  57. Testing thresholds for high-dimensional sparse random geometric graphs. In Proceedings of the 54th Annual ACM SIGACT Symposium on Theory of Computing, pages 672–677, 2022.
  58. Laurent Massoulié. Community detection thresholds and the weak ramanujan property. In Proceedings of the forty-sixth annual ACM symposium on Theory of computing, pages 694–703, 2014.
  59. High-temperature expansions and message passing algorithms. Journal of Statistical Mechanics: Theory and Experiment, 2019(11):113301, 2019.
  60. Dan Mikulincer. A CLT in Stein’s distance for generalized Wishart matrices and higher order tensors. arXiv preprint arXiv:2002.10846, 2020.
  61. A critical point for random graphs with a given degree sequence. Random Structures & Algorithms, 6(2-3):161–180, 1995.
  62. A graph integral formulation of the circuit partition polynomial. Combinatorics, Probability & Computing, 20(6):911, 2011.
  63. A statistical model for tensor PCA. In Proceedings of the 27th International Conference on Neural Information Processing Systems - Volume 2, NIPS’14, page 2897–2905. MIT Press, 2014.
  64. Free Probability and Random Matrices, volume 35 of Fields Institute Monographs. Springer, 2017.
  65. Spectral methods from tensor networks. In 51st Annual ACM SIGACT Symposium on Theory of Computing (STOC 2019), pages 926–937, 2019.
  66. Lectures on the combinatorics of free probability, volume 13. Cambridge University Press, 2006.
  67. What is…a free cumulant? Notices of the American Mathematical Society, 58:300–301, 2011.
  68. A new framework for tensor PCA based on trace invariants. 2020.
  69. Random tensor theory for tensor decomposition. Proceedings of the AAAI Conference on Artificial Intelligence, 36(7):7913–7921, 2022.
  70. Mohamed Ouerfelli. New perspectives and tools for Tensor Principal Component Analysis and beyond. PhD thesis, Université Paris-Saclay, 2022.
  71. Sub-exponential time sum-of-squares lower bounds for principal components analysis. Advances in Neural Information Processing Systems, 35:35724–35740, 2022.
  72. Claudio Procesi. The invariant theory of n×n𝑛𝑛n\times nitalic_n × italic_n matrices. Advances in mathematics, 19(3):306–381, 1976.
  73. Tensor eigenvalues and their applications, volume 39. Springer, 2018.
  74. Liqun Qi. Eigenvalues of a real supersymmetric tensor. Journal of Symbolic Computation, 40(6):1302–1324, 2005.
  75. Liqun Qi. Eigenvalues and invariants of tensors. Journal of Mathematical Analysis and Applications, 325(2):1363–1377, 2007.
  76. Strongly refuting random CSPs below the spectral threshold. In Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing, pages 121–131, 2017.
  77. Strongly refuting random CSPs below the spectral threshold. In Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2017, page 121–131, 2017.
  78. Is it easier to count communities than find them? In 14th Innovations in Theoretical Computer Science Conference (ITCS 2023). Schloss-Dagstuhl-Leibniz Zentrum für Informatik, 2023.
  79. Guilhem Semerjian. Matrix denoising: Bayes-optimal estimators via low-degree polynomials. arXiv preprint arxiv:2402.16719, 2024.
  80. Eliran Subag. The complexity of spherical p𝑝pitalic_p-spin models — a second moment approach. The Annals of Probability, 45(5):3385–3450, 2017.
  81. Computational barriers to estimation from low-degree polynomials. The Annals of Statistics, 50(3):1833–1858, 2022.
  82. The Kikuchi hierarchy and tensor PCA. In 60th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2019, pages 1446–1468. IEEE Computer Society, 2019.
  83. Alexander S Wein. Average-case complexity of tensor decomposition for low-degree polynomials. In Proceedings of the 55th Annual ACM Symposium on Theory of Computing (STOC 2023), pages 1685–1698, 2023.
  84. Hans Wenzl. On the structure of Brauer’s centralizer algebras. Annals of Mathematics, 128(1):173–193, 1988.
  85. Hermann Weyl. The Classical Groups: Their Invariants and Representations. 1946.
  86. G. C. Wick. The evaluation of the collision matrix. Physical Review, 80:268–272, 1950.
  87. Todd G Will. Switching distance between graphs with the same degrees. SIAM Journal on Discrete Mathematics, 12(3):298–306, 1999.
  88. Sharp analysis of power iteration for tensor PCA. arXiv preprint arXiv:2401.01047, 2024.
  89. Paul Zinn-Justin. Jucys-Murphy elements and Weingarten matrices. arXiv preprint arXiv:0907.2719, 2009.
Citations (4)

Summary

We haven't generated a summary for this paper yet.

Youtube Logo Streamline Icon: https://streamlinehq.com