Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
129 tokens/sec
GPT-4o
28 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Escaping Saddle Points for Zeroth-order Nonconvex Optimization using Estimated Gradient Descent (1910.01277v1)

Published 3 Oct 2019 in math.OC, cs.LG, and stat.ML

Abstract: Gradient descent and its variants are widely used in machine learning. However, oracle access of gradient may not be available in many applications, limiting the direct use of gradient descent. This paper proposes a method of estimating gradient to perform gradient descent, that converges to a stationary point for general non-convex optimization problems. Beyond the first-order stationary properties, the second-order stationary properties are important in machine learning applications to achieve better performance. We show that the proposed model-free non-convex optimization algorithm returns an $\epsilon$-second-order stationary point with $\widetilde{O}(\frac{d{2+\frac{\theta}{2}}}{\epsilon{8+\theta}})$ queries of the function for any arbitrary $\theta>0$.

Citations (7)

Summary

We haven't generated a summary for this paper yet.