Papers

Topics

Authors

Recent

View all

Detailed Answer

Quick Answer

Concise responses based on abstracts only

Detailed Answer

Well-researched responses based on abstracts and relevant paper content.

Custom Instructions Pro

Preferences or requirements that you'd like Emergent Mind to consider when generating responses

Gemini 2.5 Flash

Gemini 2.5 Flash 96 tok/s

Gemini 2.5 Pro 55 tok/s Pro

GPT-5 Medium 33 tok/s Pro

GPT-5 High 29 tok/s Pro

GPT-4o 113 tok/s Pro

Kimi K2 191 tok/s Pro

GPT OSS 120B 453 tok/s Pro

Claude Sonnet 4 37 tok/s Pro

2000 character limit reached

Stochastic Adaptive Gradient Descent Without Descent (2509.14969v1)

Published 18 Sep 2025 in cs.LG, math.OC, and stat.ML

Abstract: We introduce a new adaptive step-size strategy for convex optimization with stochastic gradient that exploits the local geometry of the objective function only by means of a first-order stochastic oracle and without any hyper-parameter tuning. The method comes from a theoretically-grounded adaptation of the Adaptive Gradient Descent Without Descent method to the stochastic setting. We prove the convergence of stochastic gradient descent with our step-size under various assumptions, and we show that it empirically competes against tuned baselines.