Normalized Weighted Least Squares (NWLS)
- NWLS is a function approximation strategy that uses Christoffel weights to stabilize least squares systems and achieve near-optimal recovery from finite samples.
- It employs a normalized weighted design matrix ensuring uniform row norms, minimal variance, and robust error bounds even in high-dimensional or irregular domains.
- NWLS is applied in uncertainty quantification, polynomial approximation, and numerical solutions for parametric and stochastic PDEs, offering proven stability and efficiency.
Normalised Weighted Least Squares (NWLS) is a methodology for function approximation in finite-dimensional spaces, achieving stable and near-optimal recovery from finite, possibly noisy samples. It leverages the Christoffel function and related sampling measures to guarantee well-conditioned least-squares systems and robust error bounds even in high-dimensional, irregular, or unbounded domains. The foundational principle is the normalization of the weighted design matrix, enabling uniform row norms and minimal variance in the empirical projection operator. NWLS underpins modern approaches in uncertainty quantification, polynomial approximation, and numerical solutions of parametric and stochastic PDEs (Cohen et al., 2016, Migliorati, 2019, Narayan et al., 2014).
1. Mathematical Formulation and Theoretical Framework
NWLS operates in a probability space where is a probability measure on . Let be an -dimensional space with an -orthonormal basis . The Christoffel function is defined as
NWLS specifies the weight and sampling measure as
Given independent samples , the weighted least-squares estimator minimizes the weighted empirical risk:
In matrix terms, with Gram matrix and right-hand side , the coefficients solve (Cohen et al., 2016, Narayan et al., 2014).
2. Christoffel Function and Sampling Measures
The Christoffel function plays a central role in sample weighting and measure design. Its reciprocal, normalized as , ensures that the weighted design matrix has row norms equal to unity. Under the sampling measure , the Christoffel-weighted sum is constant, minimizing the supremum and regularizing the least-squares system. For general bounded domains, the approximation space (with orthonormal basis ) yields , normalized via and corresponding sampling measure (Migliorati, 2019).
A principal innovation is the prescription to sample from the measure induced by (or the pluripotential equilibrium measure in some settings), which yields stability and accuracy guarantees unattainable with naive Monte Carlo sampling from the orthogonality measure alone (Narayan et al., 2014).
3. Algorithmic Procedures
NWLS methodology can be summarized by the following steps:
- Basis Construction: Identify an -orthonormal basis (analytically or via discrete surrogate orthogonalization when unavailable in closed form, as for irregular domains) (Migliorati, 2019).
- Christoffel Function Evaluation: Compute for all evaluation points.
- Sampling Measure: Draw sample points i.i.d. from or, for irregular domains, from an empirical surrogate measure.
- Weight Calculation: Assign weights .
- Weighted Gram Matrix Assembly: Form .
- Empirical Right-hand Side: Assemble .
- Solve Normal Equations: Compute coefficients via .
- Estimator Construction: Return . In practice, truncation is used to enforce bounds on .
For irregular, non-tensor-product, or empirical domains, an additional Stage A constructs a discrete near-orthonormal basis via QR factorization of a Vandermonde-like matrix, at cost with scaling with the Christoffel function supremum (Migliorati, 2019).
4. Theoretical Guarantees: Stability and Error Bounds
NWLS provides high-probability stability and quasi-optimal error guarantees:
- Condition Number Bound: With probability at least , if , (Cohen et al., 2016).
- Error Convergence: The mean-square error of the clipped estimator satisfies
- Sample Complexity: In both regular and irregular domains, stable estimation requires samples (or for the number of samples fitting an -dimensional space), a substantial efficiency improvement over classical unweighted least squares. The key analysis for irregular domains also shows that, provided or , overall computational cost remains tractable up to in the thousands (Migliorati, 2019, Narayan et al., 2014).
- Bias Control: Exact sampling from the Christoffel-induced measure yields clean minimax-type error bounds, while sampling from approximate equilibrium measures risks persistent bias terms (Cohen et al., 2016).
5. Sampling Algorithms and Implementation in Multivariate and Irregular Domains
For multivariate problems with non-tensor-product measures , NWLS employs sequential conditional sampling:
- Each is drawn from a marginal depending on the basis; subsequent coordinates are sampled from conditional densities constructed from basis evaluations of prior coordinates.
- When the Christoffel function or equilibrium measure is defined only empirically (as for irregular domains), surrogate bases are constructed via QR-decomposition of evaluation matrices built on random samples (Migliorati, 2019).
- Univariate sampling within these algorithms may utilize rejection sampling (envelope method) or inverse transform sampling, both connected directly to the construction of the Christoffel-weighted density or its cumulative (Cohen et al., 2016).
This allows NWLS to be applied to domains with high or infinite dimensions and to polynomial spaces defined on unbounded supports (e.g., multivariate Hermite polynomials under Gaussian measures) (Cohen et al., 2016).
6. Connections, Comparison with Alternative Methods, and Extensions
NWLS generalizes and improves upon standard least-squares and Monte Carlo approaches:
- In standard least-squares, sampling from the orthogonality measure may yield extremely ill-conditioned Gram matrices when the Christoffel function spikes, resulting in poor accuracy unless a prohibitively large number of samples is used (Narayan et al., 2014).
- Sampling from the equilibrum measure or the Christoffel-induced measure renders the normalized rows of the design matrix, directly controlling the spectral properties of the Gram matrix and enabling efficient sample complexity independent of the details of or (Cohen et al., 2016).
- Recent alternative methods (e.g., Jakeman–Narayan–Zhou) propose sampling from fixed equilibrium measures with Christoffel weights. NWLS analysis demonstrates that only exact pairing of measure and weight achieves unbiased minimax error results (Cohen et al., 2016).
- NWLS extends to parametric and stochastic PDEs with infinite-dimensional parameter spaces by truncation to finite downward-closed index sets of cardinality , with no modification to the overall methodology (Cohen et al., 2016).
A summary table of methodological positions:
| Approach | Sampling Measure | Weights | Error/Condition Bounds |
|---|---|---|---|
| Standard Least Squares | Orthogonality measure | Uniform | Potentially poor |
| NWLS (Christoffel) | Christoffel measure | Uniform, minimax | |
| Equilibrium measure + Christoffel | Equilibrium measure | May introduce bias |
7. Computational Complexity and Practical Aspects
The dominant computational cost in NWLS arises in basis orthogonalization (for irregular domains) and assembly of weighted Gram matrices:
- For regular domains with analytic bases, the cost is for Gram assembly and for solving normal equations.
- For domains without analytic bases, QR factorization of a matrix is , with scaling with the Christoffel supremum .
- Evaluating basis functions at any point costs , and total costs remain provided grows mildly with (Migliorati, 2019).
Implementation is fully practical up to moderate or high dimensions ( in the thousands), with complexity and stability tied directly to properties of the Christoffel function for the chosen approximation space and domain.
References:
- [Optimal weighted least-squares methods, (Cohen et al., 2016)]
- [Multivariate approximation of functions on irregular domains by weighted least-squares methods, (Migliorati, 2019)]
- [A Christoffel function weighted least squares algorithm for collocation approximations, (Narayan et al., 2014)]