Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
167 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Higher-Order Corrections to Optimisers based on Newton's Method (2307.03820v2)

Published 7 Jul 2023 in math.NA and cs.NA

Abstract: The Newton, Gauss--Newton and Levenberg--Marquardt methods all use the first derivative of a vector function (the Jacobian) to minimise its sum of squares. When the Jacobian matrix is ill-conditioned, the function varies much faster in some directions than others and the space of possible improvement in sum of squares becomes a long narrow ellipsoid in the linear model. This means that even a small amount of nonlinearity in the problem parameters can cause a proposed point far down the long axis of the ellipsoid to fall outside of the actual curved valley of improved values, even though it is quite nearby. This paper presents a differential equation that `follows' these valleys, based on the technique of geodesic acceleration, which itself provides a 2$\mathrm{nd}$ order improvement to the Levenberg--Marquardt iteration step. Higher derivatives of this equation are computed that allow $n\mathrm{th}$ order improvements to the optimisation methods to be derived. These higher-order accelerated methods up to 4$\mathrm{th}$ order are tested numerically and shown to provide substantial reduction of both number of steps and computation time.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (13)
  1. “A Method for the Solution of Certain Non-Linear Problems in Least Squares”, Kenneth Levenberg, Quarterly of Applied Mathematics 2 (2), pp.164–168 (1944) doi:10.1090/qam/10666
  2. “An Algorithm for Least-Squares Estimation of Nonlinear Parameters”, Donald Marquardt, SIAM Journal on Applied Mathematics 11 (2), pp.431–441 (1963) doi:10.1137/0111030
  3. “Why are Nonlinear Fits to Data so Challenging?”, Mark K. Transtrum, Benjamin B. Machta and James P. Sethna, Phys. Rev. Lett. 104, 060201 (2010) doi:10.1103/PhysRevLett.104.060201
  4. “Geometry of nonlinear least squares with applications to sloppy models and optimization”, Mark K. Transtrum, Benjamin B. Machta, and James P. Sethna, Phys. Rev. E 83, 036701 (2011) doi:10.1103/PhysRevE.83.036701
  5. “Improvements to the Levenberg-Marquardt algorithm for nonlinear least-squares minimization”, Mark K. Transtrum and James P. Sethna, arXiv:1201.5885v1 [physics.data-an] (2012).
  6. “Geodesic acceleration and the small-curvature approximation for nonlinear least squares”, Mark K. Transtrum and James P. Sethna, arXiv:1207.4999v1 [math.OC] (2012).
  7. “Du calcul des derivations” [On the calculus of derivatives] (in French), L.F.A. Arbogast, Strasbourg: Levrault, pp. xxiii+404 (1800). Entirely freely available from Google books https://books.google.com/books?id=YoPq8uCy5Y8C
  8. “Sullo sviluppo delle funzioni” [On the development of the functions] (in Italian), F. Faà di Bruno, Annali di Scienze Matematiche e Fisiche, 6: 479–480, LCCN 06036680 (1855). Entirely freely available from Google books https://books.google.com/books?id=ddE3AAAAMAAJ&pg=PA479
  9. “Note sur une nouvelle formule de calcul differentiel” [On a new formula of differential calculus] (in French), F. Faà di Bruno, The Quarterly Journal of Pure and Applied Mathematics, 1: 359–360 (1857). Entirely freely available from Google books https://books.google.com/books?id=7BELAAAAYAAJ&pg=PA359
  10. https://stephenbrooks.org/ap/src/optimtest.zip contains the source code for the simple optimisation problem, and https://stephenbrooks.org/ap/src/tracker.zip the complex optimisation.
  11. “Bounded Approximate Solutions of Linear Systems using SVD”, Stephen Brooks, tech note BNL-223624-2022-TECH, available from https://technotes.bnl.gov/ or https://stephenbrooks.org/ap/report/2015-3/svdboundedsolve.pdf (2015).
  12. “A Class of Methods for Solving Nonlinear Simultaneous Equations”, C.G. Broyden, Mathematics of Computation 19, pp.577–593 (1965). doi:10.1090/S0025-5718-1965-0198670-6
  13. “Ultra-Low Emittance Bunches from Laser Cooled Ion Traps for Intense Focal Points”, Stephen Brooks, Proc. HB2023. doi:10.18429/JACoW-HB2023-TUC3I1
Citations (1)

Summary

We haven't generated a summary for this paper yet.