Cryptographic Hardness of Score Estimation
Abstract: We show that $L2$-accurate score estimation, in the absence of strong assumptions on the data distribution, is computationally hard even when sample complexity is polynomial in the relevant problem parameters. Our reduction builds on the result of Chen et al. (ICLR 2023), who showed that the problem of generating samples from an unknown data distribution reduces to $L2$-accurate score estimation. Our hard-to-estimate distributions are the "Gaussian pancakes" distributions, originally due to Diakonikolas et al. (FOCS 2017), which have been shown to be computationally indistinguishable from the standard Gaussian under widely believed hardness assumptions from lattice-based cryptography (Bruna et al., STOC 2021; Gupte et al., FOCS 2022).
- “An improved constant in Banaszczyk’s transference theorem” In arXiv preprint arXiv:1907.09020, 2019
- Wojciech Banaszczyk “New bounds in some transference theorems in the geometry of numbers” In Mathematische Annalen 296 Springer-Verlag, 1993, pp. 625–635
- “Reducibility and statistical-computational gaps from secret leakage” In Conference on Learning Theory, 2020, pp. 648–847 PMLR
- “Nearly d-linear convergence bounds for diffusion models via stochastic localization” In The Twelfth International Conference on Learning Representations, 2024
- Dominique Bakry, Ivan Gentil and Michel Ledoux “Analysis and geometry of Markov diffusion operators” Springer, 2014
- “In Search of Non-Gaussian Components of a High-Dimensional Distribution.” In Journal of Machine Learning Research 7.2, 2006
- Adam Block, Youssef Mroueh and Alexander Rakhlin “Generative modeling with denoising auto-encoders and Langevin sampling” In arXiv preprint arXiv:2002.00107, 2020
- “Public-Key Encryption from Continuous LWE” In Cryptology ePrint Archive, 2022
- Afonso S Bandeira, Amelia Perry and Alexander S Wein “Notes on computational-to-statistical gaps: predictions using statistical physics” In Portugaliae Mathematica 75.2, 2018, pp. 159–186
- “Computational lower bounds for sparse PCA” In arXiv preprint arXiv:1304.0828, 2013
- “Continuous LWE” In Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing, 2021
- Ben Brubaker “In Neural Networks, Unbreakable Locks Can Hide Invisible Doors” In Quanta magazine, 2023 URL: https://www.quantamagazine.org/cryptographers-show-how-to-hide-invisible-backdoors-in-ai-20230302/
- “Sum-of-squares proofs and the quest toward optimal algorithms” In arXiv preprint arXiv:1404.5236, 2014
- Clément L Canonne “A short note on an inequality between KL and TV” In arXiv preprint arXiv:2202.07198, 2022
- Yi-Hsiu Chen, Kai-Min Chung and Jyun-Jie Liao “On the complexity of simulating auxiliary input” In Annual International Conference on the Theory and Applications of Cryptographic Techniques, 2018, pp. 371–390 Springer
- “Sampling is as easy as learning the score: theory for diffusion models with minimal data assumptions” In International Conference on Learning Representations (ICLR), 2023
- Miranda Christ, Sam Gunn and Or Zamir “Undetectable watermarks for language models” In arXiv preprint arXiv:2306.09194, 2023
- Sinho Chewi “Log-concave sampling”, 2024 URL: https://chewisinho.github.io/main.pdf
- “Score approximation, estimation and distribution recovery of diffusion models on low-dimensional data” In International Conference on Machine Learning, 2023, pp. 4672–4712 PMLR
- Hongrui Chen, Holden Lee and Jianfeng Lu “Improved analysis of score-based generative modeling: User-friendly bounds under minimal smoothness assumptions” In International Conference on Machine Learning, 2023, pp. 4735–4763 PMLR
- Valentin De Bortoli “Convergence of denoising diffusion models under the manifold hypothesis” In arXiv preprint arXiv:2208.05314, 2022
- Scott Decatur, Oded Goldreich and Dana Ron “Computational sample complexity” In Proceedings of the tenth annual conference on Computational learning theory, 1997, pp. 130–142
- “Statistical query lower bounds for tensor pca” In Journal of Machine Learning Research 22.83, 2021, pp. 1–51
- “Non-gaussian component analysis via lattice basis reduction” In Conference on Learning Theory, 2022, pp. 4535–4547 PMLR
- Ilias Diakonikolas and Daniel M. Kane “Algorithmic High-Dimensional Robust Statistics” Cambridge university press Cambridge, 2023
- “Algorithms and sq lower bounds for pac learning one-hidden-layer relu networks” In Conference on Learning Theory, 2020, pp. 1514–1539 PMLR
- Ilias Diakonikolas, Daniel Kane and Lisheng Ren “Near-optimal cryptographic hardness of agnostically learning halfspaces and relu regression under gaussian marginals” In International Conference on Machine Learning, 2023, pp. 7922–7938 PMLR
- “Outcome indistinguishability” In Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing, 2021, pp. 1095–1108
- “SQ Lower Bounds for Non-Gaussian Component Analysis with Weaker Assumptions” In Advances in Neural Information Processing Systems 36, 2024
- Ilias Diakonikolas, Daniel M Kane and Alistair Stewart “Statistical query lower bounds for robust estimation of high-dimensional gaussians and gaussian mixtures” In 2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS), 2017, pp. 73–84 IEEE
- Ilias Diakonikolas, Daniel Kane and Nikos Zarifis “Near-optimal sq lower bounds for agnostically learning halfspaces and relus under gaussian marginals” In Advances in Neural Information Processing Systems 33, 2020, pp. 13586–13596
- Amit Daniely, Nati Linial and Shai Shalev-Shwartz “From average case complexity to improper learning complexity” In Proceedings of the forty-sixth annual ACM symposium on Theory of computing, 2014, pp. 441–448
- “Diffusion schrödinger bridge with applications to score-based generative modeling” In Advances in Neural Information Processing Systems 34, 2021, pp. 17695–17709
- “Statistical algorithms and a lower bound for detecting planted cliques” In J. ACM 64.2 New York, NY, USA: Association for Computing Machinery, 2017
- Jerome H Friedman and John W Tukey “A projection pursuit algorithm for exploratory data analysis” In IEEE Transactions on computers 100.9 IEEE, 1974, pp. 881–890
- David Gamarnik “The overlap gap property: A topological barrier to optimizing over random structures” In Proceedings of the National Academy of Sciences 118.41 National Acad Sciences, 2021, pp. e2108492118
- “Superpolynomial lower bounds for learning one-layer neural networks using gradient descent” In International Conference on Machine Learning, 2020, pp. 3587–3596 PMLR
- “Planting undetectable backdoors in machine learning models” In arXiv preprint arXiv:2204.06974, 2022
- Aparna Gupte, Neekon Vafa and Vinod Vaikuntanathan “Continuous LWE is as Hard as LWE & Applications to Learning Gaussian Mixtures” In arXiv preprint arXiv:2204.02550, 2022
- Jonathan Ho, Ajay Jain and Pieter Abbeel “Denoising diffusion probabilistic models” In Advances in neural information processing systems 33, 2020, pp. 6840–6851
- “Multicalibration: Calibration for the (computationally-identifiable) masses” In International Conference on Machine Learning, 2018, pp. 1939–1948 PMLR
- Peter J Huber “Projection pursuit” In The annals of Statistics JSTOR, 1985, pp. 435–475
- Aapo Hyvärinen “Estimation of non-normalized statistical models by score matching.” In Journal of Machine Learning Research 6.4, 2005
- “Deep residual learning for image recognition” In Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 770–778
- “How to fake auxiliary input” In Theory of Cryptography Conference, 2014, pp. 566–590 Springer
- Michael Kearns “Efficient noise-tolerant learning from statistical queries” In Journal of the ACM (JACM) 45.6 ACM New York, NY, USA, 1998, pp. 983–1006
- Dmitriy Kunisky “Spectral Barriers in Certification Problems”, 2022
- Dmitriy Kunisky, Alexander S Wein and Afonso S Bandeira “Notes on computational hardness of hypothesis testing: Predictions using the low-degree likelihood ratio” In Mathematical Analysis, its Applications and Computation: ISAAC 2019, Aveiro, Portugal, July 29–August 2 Springer, 2022, pp. 1–50
- Jean B Lasserre “Global optimization with polynomials and the problem of moments” In SIAM Journal on optimization 11.3 SIAM, 2001, pp. 796–817
- Holden Lee, Jianfeng Lu and Yixin Tan “Convergence for score-based generative modeling with polynomial complexity” In Advances in Neural Information Processing Systems 35, 2022, pp. 22870–22882
- Holden Lee, Jianfeng Lu and Yixin Tan “Convergence of score-based generative modeling for general data distributions” In International Conference on Algorithmic Learning Theory, 2023, pp. 946–985 PMLR
- “Better key sizes (and attacks) for LWE-based encryption” In Topics in Cryptology–CT-RSA 2011: The Cryptographers’ Track at the RSA Conference 2011, San Francisco, CA, USA, February 14-18, 2011. Proceedings, 2011, pp. 319–339 Springer
- “Let us build bridges: Understanding and extending diffusion generative models” In arXiv preprint arXiv:2208.14699, 2022
- “Lattice-based cryptography” In Post-quantum cryptography Springer, 2009, pp. 147–191
- “Computational barriers in minimax submatrix detection” In The Annals of Statistics 43.3 Institute of Mathematical Statistics, 2015
- “Deep networks as denoising algorithms: Sample-efficient learning of diffusion models in high-dimensional graphical models” In arXiv preprint arXiv:2309.11420, 2023
- NIST “Post-Quantum Cryptography” URL: https://csrc.nist.gov/Projects/Post-Quantum-Cryptography
- “Estimation of Wasserstein distances in the Spiked Transport Model” In Bernoulli 28.4 Bernoulli Society for Mathematical StatisticsProbability, 2022, pp. 2663–2688
- Pablo A Parrilo “Structured semidefinite programs and semialgebraic geometry methods in robustness and optimization” California Institute of Technology, 2000
- Chris Peikert “A decade of lattice cryptography” In Foundations and trends® in theoretical computer science 10.4 Now Publishers, Inc., 2016, pp. 283–424
- Jakiw Pidstrigach “Score-based generative models detect manifolds” In Advances in Neural Information Processing Systems 35, 2022, pp. 35852–35865
- “High-resolution image synthesis with latent diffusion models” In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2022, pp. 10684–10695
- “Hierarchical text-conditional image generation with clip latents” In arXiv preprint arXiv:2204.06125, 2022
- Oded Regev “On lattices, learning with errors, random linear codes, and cryptography” In Journal of the ACM (JACM) 56.6 ACM New York, NY, USA, 2009, pp. 1–40
- “A statistical model for tensor PCA” In Advances in neural information processing systems 27, 2014
- Steven Roman “The Umbral Calculus” In Pure and Applied Mathematics Academic Press, 1984
- Kulin Shah, Sitan Chen and Adam Klivans “Learning mixtures of gaussians using the ddpm objective” In Advances in Neural Information Processing Systems 36, 2024
- “Photorealistic text-to-image diffusion models with deep language understanding” In Advances in neural information processing systems 35, 2022, pp. 36479–36494
- “Generative modeling by estimating gradients of the data distribution” In Advances in neural information processing systems 32, 2019
- Yang Song and Diederik P Kingma “How to train your energy-based models” In arXiv preprint arXiv:2101.03288, 2021
- “Applied stochastic differential equations” Cambridge University Press, 2019
- Shai Shalev-Shwartz, Ohad Shamir and Eran Tromer “Using more data to speed-up training time” In Artificial Intelligence and Statistics, 2012, pp. 1019–1027 PMLR
- Noah Stephens-Davidowitz “On the Gaussian Measure Over Lattices.”, 2017
- “Deep unsupervised learning using nonequilibrium thermodynamics” In International conference on machine learning, 2015, pp. 2256–2265 PMLR
- Min Jae Song, Ilias Zadik and Joan Bruna “On the Cryptographic Hardness of Learning Single Periodic Neurons” In Advances in Neural Processing Systems (NeurIPS), 2021
- Stefan Tiegel “Hardness of agnostically learning halfspaces from worst-case lattice problems” In The Thirty Sixth Annual Conference on Learning Theory, 2023, pp. 3029–3064 PMLR
- Luca Trevisan, Madhur Tulsiani and Salil Vadhan “Regularity, boosting, and efficiently simulating every high-entropy distribution” In 2009 24th Annual IEEE Conference on Computational Complexity, 2009, pp. 126–136 IEEE
- Roman Vershynin “High-dimensional probability: An introduction with applications in data science” Cambridge university press, 2018
- Pascal Vincent “A connection between score matching and denoising autoencoders” In Neural computation 23.7 MIT Press, 2011, pp. 1661–1674
- Martin J Wainwright “Constrained forms of statistical minimax: Computation, communication and privacy” In Proceedings of the International Congress of Mathematicians, 2014, pp. 13–21
- Andre Wibisono, Yihong Wu and Kaylee Yingxi Yang “Optimal score estimation via empirical Bayes smoothing” In arXiv preprint arXiv:2402.07747, 2024
- “Statistical problems with planted structures: Information theoretical and computational limits” In Information-Theoretic Methods in Data Science 383 Cambridge University Press, 2021, pp. 13
- Kaylee Yingxi Yang and Andre Wibisono “Convergence of the Inexact Langevin Algorithm and Score-based Generative Models in KL Divergence” In arXiv preprint arXiv:2211.01512, 2022
- “Statistical physics of inference: Thresholds and algorithms” In Advances in Physics 65.5 Taylor & Francis, 2016, pp. 453–552
- “Lattice-based methods surpass sum-of-squares in clustering” In Conference on Learning Theory, 2022, pp. 1247–1248 PMLR
- Yuchen Zhang, Martin J Wainwright and Michael I Jordan “Lower bounds on the performance of polynomial-time algorithms for sparse linear regression” In Conference on Learning Theory, 2014, pp. 921–948 PMLR
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.