Learning to optimize with convergence guarantees using nonlinear system theory (2403.09389v2)

Published 14 Mar 2024 in eess.SY, cs.LG, and cs.SY

Abstract: The increasing reliance on numerical methods for controlling dynamical systems and training machine learning models underscores the need to devise algorithms that dependably and efficiently navigate complex optimization landscapes. Classical gradient descent methods offer strong theoretical guarantees for convex problems; however, they demand meticulous hyperparameter tuning for non-convex ones. The emerging paradigm of learning to optimize (L2O) automates the discovery of algorithms with optimized performance leveraging learning models and data - yet, it lacks a theoretical framework to analyze convergence of the learned algorithms. In this paper, we fill this gap by harnessing nonlinear system theory. Specifically, we propose an unconstrained parametrization of all convergent algorithms for smooth non-convex objective functions. Notably, our framework is directly compatible with automatic differentiation tools, ensuring convergence by design while learning to optimize.

References (22)

Citations (3)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Related Papers

Learning to Optimize: A Primer and A Benchmark (2021)
Non-convex Optimization for Machine Learning (2017)
Recent Theoretical Advances in Non-Convex Optimization (2020)
Safeguarded Learned Convex Optimization (2020)
ODE-based Learning to Optimize (2024)

Learning to optimize with convergence guarantees using nonlinear system theory (2403.09389v2)

Summary

Related Papers

Tweets