Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 134 tok/s
Gemini 2.5 Pro 41 tok/s Pro
GPT-5 Medium 23 tok/s Pro
GPT-5 High 18 tok/s Pro
GPT-4o 86 tok/s Pro
Kimi K2 194 tok/s Pro
GPT OSS 120B 432 tok/s Pro
Claude Sonnet 4.5 37 tok/s Pro
2000 character limit reached

Full Gauss-Newton Method for Nonlinear Least Squares

Updated 15 October 2025
  • The full Gauss-Newton method is an iterative algorithm for solving nonlinear least squares problems using only first-order derivatives, which simplifies computation compared to Newton's method.
  • It employs a majorant function to rigorously establish local convergence rates—quadratic with zero residuals and linear with small residuals—by bounding the nonlinear behavior of the Jacobian.
  • Optimal convergence and uniqueness radii are derived under unified theoretical conditions, guiding effective initialization and adaptive termination in practical nonlinear optimization.

The full Gauss-Newton method is a fundamental iterative algorithm for solving nonlinear least squares problems of the form minxF(x)2\min_x \|F(x)\|^2, where F:ΩRnRmF: \Omega \subset \mathbb{R}^n \to \mathbb{R}^m is a continuously differentiable map. Distinct from Newton's method, which requires computation of second derivatives, the full Gauss-Newton method leverages only first-order derivative information—specifically, the Jacobian—thus simplifying the local quadratic model of the least-squares objective. The method's update,

xk+1=xk[F(xk)F(xk)]1F(xk)F(xk),x_{k+1} = x_k - \left[ F'(x_k)^\top F'(x_k) \right]^{-1} F'(x_k)^\top F(x_k),

defines a sequence that approximates the solution xx^*, provided certain regularity and proximity conditions are met. The local convergence characteristics of the full Gauss-Newton method, including convergence rate, radius of convergence, and uniqueness domain, can be rigorously elucidated via the introduction of a majorant function—a scalar function that upper bounds the nonlinear behavior of the Jacobian. This approach both unifies and generalizes classical local convergence results, including those based on Lipschitz continuity and analytic (Smale-type) conditions (Ferreira et al., 2010).

1. Local Convergence Under Majorant Conditions

A central insight is that global assumptions (e.g., global Lipschitz continuity of the Jacobian FF') are often unnecessarily restrictive for local convergence analysis. Instead, the method's behavior near the solution can be characterized using a majorant function f:[0,R)Rf:[0, R) \to \mathbb{R} with

f(0)=0,f(0)=1,f(0) = 0, \quad f'(0) = -1,

and satisfying a differential bounding condition: F(x)F(x+τ(xx))f(xx)f(τxx),τ[0,1]\|F'(x) - F'(x^* + \tau(x - x^*))\| \leq f'(\|x - x^*\|) - f'(\tau\|x - x^*\|), \quad \forall \tau \in [0, 1] for all xx in a neighborhood of xx^*. This condition generalizes the classical Lipschitz assumption and allows for sharper, more flexible local analysis.

The main result (Theorem 7) states that, assuming F(x)F'(x^*) has full rank, the sequence {xk}\{x_k\} defined by the full Gauss-Newton iteration remains within a ball B(x,r)B(x^*,r) and converges to xx^*. The estimate

xk+1x(B[f(xkx)xkxf(xkx)]+)xkx\|x_{k+1} - x^*\| \leq \left( B \left[ f'(\|x_k - x^*\|) \|x_k - x^*\| - f(\|x_k - x^*\|) \right] + \cdots \right) \|x_k - x^*\|

captures both the nonlinear behavior of FF (via ff) and the local residual c=F(x)c = \|F(x^*)\|. When c=0c = 0 the method achieves Q-quadratic convergence; for small cc the rate is Q-linear; and for large cc convergence may fail.

2. Optimal Radius and Uniqueness Region

The majorant condition enables precise quantification of the optimal convergence radius. Constants are introduced:

  • κ>0\kappa > 0 such that B(x,κ)ΩB(x^*, \kappa) \subset \Omega,
  • v=sup{t[0,R):B[f(t)+1]<1}v = \sup \{ t \in [0, R): B [ f'(t) + 1 ] < 1 \},
  • p=sup{t(0,v):[B(tf(t)f(t))+(2)c(f(t)+1)t]/(1B(f(t)+1))<1}p = \sup \{ t \in (0, v): [B(t f'(t) - f(t)) + (\sqrt{2})c (f'(t) + 1) t] / (1 - B(f'(t) + 1)) < 1 \}, and the convergence ball is r=min{κ,p}r = \min\{\kappa, p\}.

It is shown (Lemma 16) that, under a certain boundary condition, r=pr = p is sharp—providing the largest possible convergence radius under the majorant structure. In situations where the uniqueness of solution is required—from the additional bound (2)B0(\sqrt{2}) B_0, with B0=[F(x)F(x)]1B_0 = \|[F'(x^*)^\top F'(x^*)]^{-1}\|—an even smaller radius oo is defined, ensuring uniqueness of the solution to F(x)=0F(x) = 0 in B(x,o)B(x^*, o).

3. Unified Framework for Convergence Analyses

The majorant function approach subsumes previous, apparently disparate frameworks:

  • Classical local convergence under a Lipschitz condition on FF' is recovered by selecting ff quadratic,
  • Smale's point estimate theory for analytic mappings is recovered by taking appropriate majorants,
  • Both yield optimal radii and uniqueness results under the same abstract machinery.

This unification demonstrates that the full Gauss-Newton method's local convergence is not bounded to the classical linear bounds but is adaptable via the choice of an appropriate majorant.

4. Convergence Sensitivity and Rate

A key feature revealed by the analysis is the sensitivity of the convergence rate to both the behavior of FF and the residual c=F(x)c = \|F(x^*)\|. The bound for xk+1x\|x_{k+1} - x^*\| displays explicit dependence on cc:

  • Zero-residual regime: c=0c = 0, Q-quadratic convergence;
  • Small residual regime: cc small but nonzero; Q-linear convergence, rate deteriorating smoothly as cc increases;
  • Large residual regime: convergence may not be assured.

This delineation clarifies practical expectations on convergence based on proximity and the nature of the target solution (exact or inexact).

5. Implications for Method Design and Termination

The optimal convergence radius informs how close the initial iterate x0x_0 must lie to xx^* for guaranteed convergence. Practically, this determines initialization strategies and the reliability of the full Gauss-Newton approach in the presence of pronounced nonlinearity. Furthermore, the explicit error estimates can guide adaptive termination criteria and step-size control, especially in ill-conditioned or near-degenerate cases.

6. Broader Impact and Extension to Generalized Gauss-Newton Methods

The majorant framework directly inspires generalizations to composite and constrained settings. For example, proximal or projected variants of Gauss-Newton inherit analogous convergence properties when a suitable majorant condition controls the Jacobian's behavior in the regularized or projected domain. The results also communicate that strong Lipschitz conditions may be replaced with more tractable, locally-verifiable majorant constructions, broadening the full Gauss-Newton method's effective domain of applicability.

7. Summary Table: Theoretical Guarantees under Majorant Conditions

Result Property Majorant Condition (f) Residual Regime Convergence Rate Guarantee Domain
Local Existence/Convergence f(0)=0,f(0)=1f(0)=0,\, f'(0)=-1; (2) c=0c=0 Quadratic B(x,r)B(x^*, r)
Local Existence/Convergence General cc cc small Linear B(x,r)B(x^*, r)
Optimality of Radius (see Lemma 16) any -- B(x,p)B(x^*, p)
Uniqueness of Solution Additional bound (2)B0(\sqrt{2})B_0 -- -- B(x,o)B(x^*, o)
Unification of Theories Choice of ff -- -- Both Lipschitz/analytic

These results provide a comprehensive local theory, supplanting and enhancing classical convergence analyses for nonlinear least squares problems.


The majorant-based local convergence framework developed in (Ferreira et al., 2010) provides the mathematical infrastructure for rigorous and generalizable guarantees for the full Gauss-Newton method in nonlinear least-squares settings. It enables practical assessment of convergence/uniqueness domains and convergence rates, offers a unified view of previously diverse results, and lays the foundation for further development of Newton-type and Gauss-Newton-type methods in broader nonlinear and composite settings.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)
Forward Email Streamline Icon: https://streamlinehq.com

Follow Topic

Get notified by email when new papers are published related to Full Gauss-Newton Method.