Detecting Low-Degree Truncation (2402.08133v2)
Abstract: We consider the following basic, and very broad, statistical problem: Given a known high-dimensional distribution ${\cal D}$ over $\mathbb{R}n$ and a collection of data points in $\mathbb{R}n$, distinguish between the two possibilities that (i) the data was drawn from ${\cal D}$, versus (ii) the data was drawn from ${\cal D}|_S$, i.e. from ${\cal D}$ subject to truncation by an unknown truncation set $S \subseteq \mathbb{R}n$. We study this problem in the setting where ${\cal D}$ is a high-dimensional i.i.d. product distribution and $S$ is an unknown degree-$d$ polynomial threshold function (one of the most well-studied types of Boolean-valued function over $\mathbb{R}n$). Our main results are an efficient algorithm when ${\cal D}$ is a hypercontractive distribution, and a matching lower bound: $\bullet$ For any constant $d$, we give a polynomial-time algorithm which successfully distinguishes ${\cal D}$ from ${\cal D}|_S$ using $O(n{d/2})$ samples (subject to mild technical conditions on ${\cal D}$ and $S$); $\bullet$ Even for the simplest case of ${\cal D}$ being the uniform distribution over ${+1, -1}n$, we show that for any constant $d$, any distinguishing algorithm for degree-$d$ polynomial threshold functions must use $\Omega(n{d/2})$ samples.
- Special functions, volume 71. Cambridge University Press, Cambridge, 1999.
- M. Anthony and P. L. Bartlett. Neural Network Learning - Theoretical Foundations. Cambridge University Press, 2002.
- The expressive power of voting polynomials. Combinatorica, 14(2):1–14, 1994.
- Randomly supported independence and resistance. In Proc. 41st Annual ACM Symposium on Theory of Computing (STOC), pages 483–492. ACM, 2009.
- Martin Anthony. Classification by polynomial surfaces. Discrete Applied Mathematics, 61(2):91–103, 1995.
- Alice and Bob meet Banach, volume 223 of Mathematical Surveys and Monographs. American Mathematical Society, Providence, RI, 2017. The interface of asymptotic geometric analysis and quantum information theory.
- N. Balakrishnan and Erhard Cramer. The art of progressive censoring. Springer, 2014.
- Efficient statistics for sparse graphical models from truncated samples. In International Conference on Artificial Intelligence and Statistics, pages 1450–1458. PMLR, 2021.
- Active learning polynomial threshold functions. CoRR, abs/2201.09433, 2022.
- H. Brascamp and E. Lieb. Best constants in Young’s Inequality, its converse and its generalization to more than three functions. Advances in Mathematics, 20:151–172, 1976.
- H. Brascamp and E. Lieb. On extensions of the Brunn-Minkowski and Prékopa-Leindler theorems, including inequalities for log-concave functions and with an application to the diffusion equation. Journal of Functional Analysis, 22:366–389, 1976.
- C. Borell. Geometric bounds on the Ornstein-Uhlenbeck velocity process. Probability Theory and Related Fields, 70:1–13, 1985.
- J. Bruck and R. Smolensky. Polynomial threshold functions, AC00{}^{0}start_FLOATSUPERSCRIPT 0 end_FLOATSUPERSCRIPT functions and spectral norms. SIAM Journal on Computing, 21(1):33–42, 1992.
- T. Bloom and B. Shiffman. Zeros of random polynomials on ℂmsuperscriptℂ𝑚\mathbb{C}^{m}blackboard_C start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT. Math Res. Lett., 14:469–479, 2014.
- Polynomial threshold functions, hyperplane arrangements, and random tensors. SIAM Journal on Mathematics of Data Science, 1(4):699–729, 2019.
- Learning from satisfying assignments under continuous distributions. In Shuchi Chawla, editor, Proceedings of the 2020 ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 82–101. SIAM, 2020.
- Mei-Chu Chang. A polynomial bound in Freiman’s theorem. Duke Mathematical Journal, 113(3):399 – 419, 2002.
- A. Clifford Cohen. Truncated and censored samples: theory and applications. CRC Press, 2016.
- John B Conway. A course in functional analysis, volume 96. Springer, 2019.
- Learning sums of independent integer random variables. In 54th Annual IEEE Symposium on Foundations of Computer Science, pages 217–226, 2013.
- Deterministic approximate counting for juntas of degree-2 polynomial threshold functions. In Proceedings of the 29th Annual Conference on Computational Complexity (CCC), pages 229–240. IEEE, 2014.
- Learning from satisfying assignments. In Proceedings of the Twenty-Sixth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2015, pages 478–497, 2015.
- On the Fourier tails of bounded functions over the discrete cube. In Proc. 38th ACM Symp. on Theory of Computing, pages 437–446, 2006.
- Learning from positive and unlabeled examples. Theor. Comput. Sci., 348(1):70–83, 2005.
- Efficient statistics, in high dimensions, from truncated samples. In 59th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2018, pages 639–649. IEEE Computer Society, 2018.
- Computationally and statistically efficient truncated regression. In Conference on Learning Theory (COLT), volume 99 of Proceedings of Machine Learning Research, pages 955–960, 2019.
- Bounded independence fools degree-2 threshold functions. In Proc. 51st IEEE Symposium on Foundations of Computer Science (FOCS), pages 11–20, 2010.
- Gaussian mean testing made simple. CoRR, abs/2210.13706, 2022.
- The optimality of polynomial regression for agnostic learning under gaussian marginals in the SQ model. In Mikhail Belkin and Samory Kpotufe, editors, Conference on Learning Theory, COLT 2021, 15-19 August 2021, Boulder, Colorado, USA, volume 134 of Proceedings of Machine Learning Research, pages 1552–1584. PMLR, 2021.
- The total variation distance between high-dimensional gaussians. arXiv:1810.08693v5, 22 May 2020, 2020.
- Convex influences. In Mark Braverman, editor, 13th Innovations in Theoretical Computer Science Conference, ITCS, volume 215 of LIPIcs, pages 53:1–53:21. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2022.
- Testing Convex Truncation. In Proceedings of the 2023 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 4050–4082. 2023.
- Hardness results for agnostically learning low-degree polynomial threshold functions. In SODA, pages 1590–1606, 2011.
- Average sensitivity and noise sensitivity of polynomial threshold functions. SIAM Journal on Computing, 43(1):231–253, 2014.
- Truncated linear regression in high dimensions. Advances in Neural Information Processing Systems, 33:10338–10347, 2020.
- Efficient deterministic approximate counting for low-degree polynomial threshold functions. In Proceedings of the 46th Annual Symposium on Theory of Computing (STOC), pages 832–841, 2014.
- Efficient truncated linear regression with unknown noise variance. Advances in Neural Information Processing Systems, 34:1952–1963, 2021.
- R.A. Fisher. Properties and applications of HH functions. In Mathematical tables, pages 815–852, 1931.
- Efficient parameter estimation of truncated boolean product distributions. In Conference on Learning Theory (COLT), volume 125 of Proceedings of Machine Learning Research, pages 1586–1600, 2020.
- Francis Galton. An examination into the registered speeds of American trotting horses, with remarks on their value as hereditary data. Proceedings of the Royal Society of London, 62(379-387):310–315, 1897.
- C. Gotsman and N. Linial. Spectral properties of threshold functions. Combinatorica, 14(1):35–50, 1994.
- J. Hammersley. The zeros of a random polynomial. In Proc. Third Berkeley Symposium on Probability and Statistics, volume 2, pages 89–111, 1956.
- Bounding the sensitivity of polynomial threshold functions. Theory of Computing, 10(1):1–26, 2014.
- Wassily Hoeffding. A class of statistics with asymptotically normal distribution. The Collected Works of Wassily Hoeffding, pages 171–204, 1994.
- I. Ibragimov and O. Zeitouni. On roots of random polynomials. Transactions of the American Mathematical Society, 349(6):2427–2441, 1997.
- Daniel Kane. k𝑘kitalic_k-independent Gaussians fool polynomial threshold functions. In Proceedings of the 26th Conference on Computational Complexity (CCC), pages 252–261, 2011.
- Daniel Kane. A small PRG for polynomial threshold functions of Gaussians. In Proceedings of the 52nd Annual Symposium on Foundations of Computer Science (FOCS), pages 257–266, 2011.
- D. Kane. A Structure Theorem for Poorly Anticoncentrated Gaussian Chaoses and Applications to the Study of Polynomial Threshold Functions. In 53rd Annual IEEE Symposium on Foundations of Computer Science, FOCS 2012, New Brunswick, NJ, USA, October 20-23, 2012, pages 91–100, 2012.
- Daniel M Kane. The correct exponent for the gotsman-linial conjecture. In 2013 IEEE Conference on Computational Complexity, pages 56–64. IEEE, 2013.
- D. M. Kane. The correct exponent for the Gotsman-Linial Conjecture. Computational Complexity, 23(2):151–175, 2014.
- Daniel Kane. A pseudorandom generator for polynomial threshold functions of Gaussians with subpolynomial seed length. In Proceedings of the 29th Annual Conference on Computational Complexity (CCC), pages 217–228, 2014.
- D. M. Kane. A Polylogarithmic PRG for Degree 2 Threshold Functions in the Gaussian Setting. In 30th Conference on Computational Complexity, CCC 2015, June 17-19, 2015, Portland, Oregon, USA, pages 567–581, 2015.
- A polynomial restriction lemma with applications. In Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing (STOC), pages 615–628, 2017.
- Satisfiability and Derandomization for Small Polynomial Threshold Circuits. In Proceedings of the 22nd International Conference on Randomization and Computation (RANDOM), volume 116, pages 46:1–46:19, 2018.
- A prg for lipschitz functions of polynomials with applications to sparsest cut. In Proceedings of the Forty-Fifth Annual ACM Symposium on Theory of Computing, STOC ’13, page 1–10, New York, NY, USA, 2013. Association for Computing Machinery.
- A PRG for Boolean PTF of degree 2 with seed length subpolynomial in ε𝜀\varepsilonitalic_ε and logarithmic in n𝑛nitalic_n. In Proceedings of the 33rd Computational Complexity Conference (CCC), pages 2:1–2:24, 2018.
- Efficient truncated statistics with unknown truncation. In David Zuckerman, editor, 60th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2019, Baltimore, Maryland, USA, November 9-12, 2019, pages 1578–1595. IEEE Computer Society, 2019.
- M. Kearns and U. Vazirani. An Introduction to Computational Learning Theory. MIT Press, Cambridge, MA, 1994.
- Alice Lee. Table of the Gaussian “Tail” Functions; When the “Tail” is Larger than the Body. Biometrika, 10(2/3):208–214, 1914.
- R. Meka and D. Zuckerman. Pseudorandom Generators for Polynomial Threshold Functions. SIAM J. Comput., 42(3):1275–1301, 2013.
- Ryan O’Donnell. Analysis of Boolean Functions. Cambridge University Press, 2014.
- Fooling gaussian ptfs via local hyperconcentration. In Konstantin Makarychev, Yury Makarychev, Madhur Tulsiani, Gautam Kamath, and Julia Chuzhoy, editors, Proccedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing (STOC), pages 1170–1183. ACM, 2020.
- Karl Pearson. On the systematic fitting of frequency curves. Biometrika, 2:2–7, 1902.
- M. Saks. Slicing the hypercube. In Keith Walker, editor, Surveys in Combinatorics 1993, pages 211–257. London Mathematical Society Lecture Note Series 187, 1993.
- Helmut Schneider. Truncated and censored samples from normal populations. Marcel Dekker, Inc., 1986.
- Luby-velickovic-wigderson revisited: Improved correlation bounds and pseudorandom generators for depth-two circuits. In Eric Blais, Klaus Jansen, José D. P. Rolim, and David Steurer, editors, Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, (APPROX/RANDOM), volume 116 of LIPIcs, pages 56:1–56:20. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2018.
- M. Talagrand. How much are increasing sets positively correlated? Combinatorica, 16(2):243–258, 1996.
- VN Vapnik and A Ya Chervonenkis. On the uniform convergence of relative frequencies of events to their probabilities. Theory of Probability and its Applications, 16(2):264, 1971.
- Santosh S. Vempala. Learning convex concepts from gaussian distributions with PCA. In 51th Annual IEEE Symposium on Foundations of Computer Science, FOCS 2010, October 23-26, 2010, Las Vegas, Nevada, USA, pages 124–130. IEEE Computer Society, 2010.
- Paweł Wolff. Hypercontractivity of simple random variables. Studia Mathematica, 3:219–236, 2007.