Papers
Topics
Authors
Recent
2000 character limit reached

GloptiNets: Scalable Non-Convex Optimization with Certificates (2306.14932v3)

Published 26 Jun 2023 in math.OC and cs.LG

Abstract: We present a novel approach to non-convex optimization with certificates, which handles smooth functions on the hypercube or on the torus. Unlike traditional methods that rely on algebraic properties, our algorithm exploits the regularity of the target function intrinsic in the decay of its Fourier spectrum. By defining a tractable family of models, we allow at the same time to obtain precise certificates and to leverage the advanced and powerful computational techniques developed to optimize neural networks. In this way the scalability of our approach is naturally enhanced by parallel computing with GPUs. Our approach, when applied to the case of polynomials of moderate dimensions but with thousands of coefficients, outperforms the state-of-the-art optimization methods with certificates, as the ones based on Lasserre's hierarchy, addressing problems intractable for the competitors.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (31)
  1. Exponential convergence of sum-of-squares hierarchies for trigonometric polynomials. SIAM Journal on Optimization, 33(3):2137–2159, 2023.
  2. Infinite-Dimensional Sums-of-Squares for Optimal Control. In 2022 IEEE 61st Conference on Decision and Control (CDC), pages 577–582, December 2022. doi: 10.1109/CDC51059.2022.9992396.
  3. Convex optimization. Cambridge university press, 2004.
  4. Sub-Gaussian Mean Estimators. The Annals of Statistics, 44(6):2695–2725, 2016.
  5. Hyperbolic Cross Approximation. arXiv:2211.04889, April 2017.
  6. Deep learning. MIT press, 2016.
  7. The Moment-SOS Hierarchy, volume 4 of Optimization and Its Applications. World Scientific Publishing Europe Ltd., December 2020. doi: 10.1142/q0252.
  8. The geometric measure of multipartite entanglement and the singular values of a hypermatrix. Journal of Mathematical Physics, 51(7):072102, July 2010.
  9. Handbook of global optimization, volume 2. Springer Science & Business Media, 2013.
  10. Lasserre Hierarchy for Large Scale Polynomial Optimization in Real and Complex Variables. SIAM Journal on Optimization, 28(2):1017–1048, January 2018.
  11. Jean B. Lasserre. Global Optimization with Polynomials and the Problem of Moments. SIAM Journal on Optimization, 11(3):796–817, January 2001. doi: 10.1137/S1052623400366802.
  12. Jean Bernard Lasserre. Moments, Positive Polynomials and Their Applications, volume 1 of Series on Optimization and Its Applications. October 2009. doi: 10.1142/p665.
  13. An effective version of schmüdgen’s positivstellensatz for the hypercube. Optimization Letters, September 2022. doi: 10.1007/s11590-022-01922-5.
  14. Exploiting Constant Trace Property in Large-scale Polynomial Optimization. ACM Transactions on Mathematical Software, 48(4):40:1–40:39, December 2022.
  15. Non-parametric Models for Non-negative Functions. In H. Larochelle, M. Ranzato, R. Hadsell, M. F. Balcan, and H. Lin, editors, Advances in Neural Information Processing Systems, volume 33, pages 12816–12826. Curran Associates, Inc., 2020.
  16. Pablo Moscato et al. On evolution, search, optimization, genetic algorithms and martial arts: Towards memetic algorithms. Caltech concurrent computation program, C3P Report, 826(1989):37, 1989.
  17. Near-optimal estimation of smooth transport maps with kernel sums-of-squares. arXiv:2112.01907, December 2021.
  18. An Introduction to the Theory of Reproducing Kernel Hilbert Spaces. Cambridge Studies in Advanced Mathematics. Cambridge University Press, Cambridge, 2016. doi: 10.1017/CBO9781316219232.
  19. PSD Representations for Effective Probability Models. In Advances in Neural Information Processing Systems, volume 34, pages 19411–19422. Curran Associates, Inc., 2021.
  20. Finding Global Minima via Kernel Approximations. arXiv:2012.11978, December 2020.
  21. Walter Rudin. The Basic Theorems of Fourier Analysis. In Fourier Analysis on Groups, chapter 1, pages 1–34. John Wiley & Sons, Ltd, 1990. doi: 10.1002/9781118165621.ch1.
  22. Support vector machines. Springer Science & Business Media, 2008.
  23. Pascal Van Hentenryck. Machine Learning for Optimal Power Flows. INFORMS Tutorials in Operations Research, October 18.
  24. Simulated annealing. Springer, 1987.
  25. Sums of Squares and Semidefinite Program Relaxations for Polynomial Optimization Problems with Structured Sparsity. SIAM Journal on Optimization, 17(1):218–242, January 2006. doi: 10.1137/050623802.
  26. Phase Recovery, MaxCut and Complex Semidefinite Programming, July 2013.
  27. Exploiting Sparsity in Complex Polynomial Optimization. Journal of Optimization Theory and Applications, 192(1):335–359, January 2022.
  28. Chordal-TSSOS: A Moment-SOS Hierarchy That Exploits Term Sparsity with Chordal Extension. SIAM Journal on Optimization, 31(1):114–141, January 2021a. doi: 10.1137/20M1323564.
  29. TSSOS: A Moment-SOS Hierarchy That Exploits Term Sparsity. SIAM Journal on Optimization, 31(1):30–58, January 2021b. doi: 10.1137/19M1307871.
  30. G. N. Watson. A Treatise on the Theory of Bessel Functions. Cambridge University Press, 1922.
  31. Non-Convex Optimization with Certificates and Fast Rates Through Kernel Sums of Squares. In Proceedings of Thirty Fifth Conference on Learning Theory, pages 4620–4642. PMLR, June 2022.

Summary

We haven't generated a summary for this paper yet.

Whiteboard

Paper to Video (Beta)

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.