Papers

Topics

Authors

Recent

View all

Detailed Answer

Quick Answer

Concise responses based on abstracts only

Detailed Answer

Well-researched responses based on abstracts and relevant paper content.

Custom Instructions Pro

Preferences or requirements that you'd like Emergent Mind to consider when generating responses

Gemini 2.5 Flash

Gemini 2.5 Flash 83 tok/s

Gemini 2.5 Pro 42 tok/s Pro

GPT-5 Medium 30 tok/s Pro

GPT-5 High 36 tok/s Pro

GPT-4o 108 tok/s Pro

Kimi K2 220 tok/s Pro

GPT OSS 120B 473 tok/s Pro

Claude Sonnet 4 39 tok/s Pro

2000 character limit reached

On the Convergence of (Stochastic) Gradient Descent with Extrapolation for Non-Convex Optimization (1901.10682v2)

Published 30 Jan 2019 in math.OC

Abstract: Extrapolation is a well-known technique for solving convex optimization and variational inequalities and recently attracts some attention for non-convex optimization. Several recent works have empirically shown its success in some machine learning tasks. However, it has not been analyzed for non-convex minimization and there still remains a gap between the theory and the practice. In this paper, we analyze gradient descent and stochastic gradient descent with extrapolation for finding an approximate first-order stationary point in smooth non-convex optimization problems. Our convergence upper bounds show that the algorithms with extrapolation can be accelerated than without extrapolation.