Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
133 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Complexity of trust-region methods in the presence of unbounded Hessian approximations (2408.06243v2)

Published 12 Aug 2024 in math.OC

Abstract: We extend traditional complexity analyses of trust-region methods for unconstrained, possibly nonconvex, optimization. Whereas most complexity analyses assume uniform boundedness of the model Hessians, we work with potentially unbounded model Hessians. Boundedness is not guaranteed in practical implementations, in particular ones based on quasi-Newton updates such as PSB, BFGS and SR1. Our analysis is conducted for a family of trust-region methods that includes most known methods as special cases. We examine two regimes of Hessian growth: one bounded by a power of the number of successful iterations, and one bounded by a power of the number of iterations. This allows us to formalize and address the intuition of Powell [IMA J. Numer. Ana. 30(1):289-301,2010], who studied convergence under a special case of our assumptions, but whose proof contained complexity arguments. Specifically, for (0 \leq p < 1), we establish sharp (O([(1-p)\epsilon{-2}]{1/(1-p)})) evaluation complexity to find an (\epsilon)-stationary point when model Hessians are (O(|\mathcal{S}{k-1}|p)), where (|\mathcal{S}{k-1}|) is the number of iterations where the step was accepted, up to iteration (k-1). For (p = 1), which is the case studied by Powell, we establish a sharp (O(\exp(c_1\epsilon{-2}))) evaluation complexity for a certain constant (c_1 > 0). We establish similar sharp bounds when model Hessians are (O(kp)), where (k) is the iteration counter, for (0 \leq p < 1). When (p = 1), the complexity bound depends on the parameters of the family, but reduces to (O((1 - \log(\epsilon))\exp(c_2\epsilon{-2}))) for a certain constant (c_2 > 0) for the special case of the standard trust-region method. As special cases, we derive novel complexity bounds for (strongly) convex objectives under the same growth assumptions.

Citations (1)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com