Identifiable Convex-Concave NLS

Updated 7 July 2025

ICCNLS is a nonparametric regression method that uniquely decomposes functions into convex and concave components for clear interpretability.
It employs orthogonality constraints and regularization to resolve affine non-identifiability, ensuring accurate and stable estimation.
The approach supports scalable optimization and has practical applications in forecasting, benchmarking, and policy evaluation across diverse fields.

Identifiable Convex-Concave Nonparametric Least Squares (ICCNLS) is a nonparametric regression methodology designed to estimate functions that possess both convex and concave structural components. ICCNLS decomposes a target function into additive, shape-constrained subcomponents and implements statistical and algorithmic mechanisms that guarantee identifiability of this decomposition. The theoretical and practical frameworks are motivated by both foundational and recent developments in shape-constrained regression, Bayesian nonparametrics, high-dimensional convex optimization, and difference-of-convex programming. ICCNLS further introduces regularization and inference techniques to ensure interpretability, statistical reliability, and scalability for real-world datasets.

1. Theoretical Foundations and Model Representation

ICCNLS formalizes regression problems where the response function $f(x)$ is assumed to admit an additive decomposition: $f(x) = g_\mathrm{c}(x) + g_\mathrm{v}(x)$ where $g_\mathrm{c}(\cdot)$ is concave and $g_\mathrm{v}(\cdot)$ is convex. This structure enables the modeling of functions with changing curvature, such as those encountered in economic production, pricing, and resource systems, where distinct regimes of returns or costs are convex or concave in different domains (Chung, 22 Jun 2025).

Both $g_\mathrm{c}$ and $g_\mathrm{v}$ are nonparametric and typically represented as lower and upper envelopes of subgradient-constrained affine functions. For a sample $\{(x_i, y_i)\}_{i=1}^n$ with $x_i \in \mathbb{R}^d$ , localizations take the form: $g_\mathrm{c}(x) = a^c + (B^c)^\top x, \qquad g_\mathrm{v}(x) = a^v + (B^v)^\top x$ for appropriate intercepts $a^c,a^v$ and gradient (subgradient) vectors $B^c,B^v \in \mathbb{R}^d$ , subject to shape constraints enforced by inequalities that guarantee concavity or convexity (Chung, 22 Jun 2025, Mazumder et al., 2015, Hannah et al., 2011).

2. Identifiability and Orthogonality Constraints

A central challenge in convex-concave decomposition is affine non-identifiability: if $f(x) = g_\mathrm{c}(x) + g_\mathrm{v}(x)$ , then for any affine $L(x)$ ,

$f(x) = [g_\mathrm{c}(x) + L(x)] + [g_\mathrm{v}(x) - L(x)]$

implying many possible valid decompositions. ICCNLS resolves this ambiguity by imposing global orthogonality constraints on residuals $\epsilon_i = y_i - f(x_i)$ : $\sum_{i=1}^n \epsilon_i = 0, \qquad \sum_{i=1}^n \epsilon_i x_{ij} = 0 \quad \forall j=1,\ldots, d.$ These constraints force the residuals to be statistically independent of the constant function and all input coordinates, centering any affine perturbation and rendering the decomposition unique up to an additive constant (Chung, 22 Jun 2025). Proposition-level results show that, modulo a constant, any pair of valid decompositions must coincide, thus facilitating interpretation of the individual convex and concave effects.

3. Optimization and Algorithmic Approaches

ICCNLS is formulated as a nonlinear least squares problem with shape and orthogonality constraints, and typically incorporates regularization: $\min_{\theta} \sum_{i=1}^n \left[ y_i - (a^c + (B^c_i)^\top x_i + a^v + (B^v_i)^\top x_i) \right]^2 + \lambda R(\theta)$ subject to all inequalities defining convexity of $g_\mathrm{v}$ , concavity of $g_\mathrm{c}$ , and the orthogonality constraints on residuals (Chung, 22 Jun 2025). The regularization term $R(\theta)$ promotes parsimony and well-conditioned estimates (see Section 4).

Algorithmic strategies depend on problem scale and structure. For moderate $n$ and $d$ , this is a quadratic program with $O(n^2)$ linear constraints, often addressed by augmented Lagrangian/ADMM methods (Mazumder et al., 2015), penalized quadratic programs (Keshvari, 2016), or specialized convex-concave procedures for DC decompositions (Oliveira et al., 2021). For large-scale Bayesian variants, stochastic search strategies such as reversible jump MCMC (Hannah et al., 2011), or block coordinate methods exploiting problem sparsity, are effective.

4. Regularization and Structural Sparsity

ICCNLS introduces regularization directly on the subgradients defining the local hyperplanes. Regularization types include:

L2 (Tikhonov): adds a squared penalty on each subgradient, promoting smoothness and numerical stability.

$R_{\mathrm{L2}}(\theta) = \lambda \sum_{i=1}^n \left( \Vert B^c_i \Vert_2^2 + \Vert B^v_i \Vert_2^2 \right)$

L1 (Sparsity-promoting): encourages sparsity in the subgradients, leading to interpretable models with few active variables.

$R_{\mathrm{L1}}(\theta) = \lambda \sum_{i=1}^n \left( \Vert B^c_i \Vert_1 + \Vert B^v_i \Vert_1 \right)$

Elastic Net: combines L1 and L2 penalties with mixing parameter $\alpha$ ,

$R_{\mathrm{EN}}(\theta) = \lambda \sum_{i=1}^n \left[ \alpha \left(\Vert B^c_i \Vert_1 + \Vert B^v_i \Vert_1\right) + (1-\alpha)(\Vert B^c_i \Vert_2^2 + \Vert B^v_i \Vert_2^2)\right]$

Careful calibration of these penalties strikes a balance between overfitting and underfitting, reducing the effective number of hyperplanes and improving generalization (Chung, 22 Jun 2025, Mazumder et al., 2015, Keshvari, 2016).

5. Statistical Properties and Inference

Estimation consistency and convergence rates of ICCNLS are connected to related results in convex (or concave) regression, shape-restricted least squares, and single-index models. For finite $d$ , typical risk bounds of order $\mathcal{O}(n^{-4/(d+4)})$ are achievable under regularization and favorable design (Mazumder et al., 2015, Kuchibhotla et al., 2017). For higher-dimensional settings, minimax rates may deteriorate, and ICCNLS can be integrated with dimension-adaptive techniques or Bayesian priors having Kullback-Leibler support over convex functions (Hannah et al., 2011, Kur et al., 2020).

For inference, pivotal limit theory and locally normalized residuals yield tuning-free confidence intervals for function values and derivatives, exploiting the piecewise affine structure of the estimators (Deng et al., 2020, Doss, 2018). Likelihood ratio tests and asymptotically pivotal statistics have been developed for related shape-constrained models, often yielding intervals with optimal or near-optimal coverage properties.

6. Practical Implementation and Empirical Evaluation

Practical ICCNLS implementations are validated on synthetic datasets with known convex and concave regimes, as well as on real-world data such as healthcare pricing. Empirical studies show:

Without regularization, ICCNLS selects nearly $n$ hyperplanes, leading to overfitting.
Moderate $\lambda$ and appropriate $\alpha$ parameters yield substantial reductions in model complexity (number of distinct hyperplanes), with little or no sacrifice in predictive accuracy (RMSE, MAE) (Chung, 22 Jun 2025).
On real datasets, the model provides interpretable decompositions corresponding to economically or scientifically meaningful regions (e.g., economies and diseconomies of scale, energy price effects).

Computationally, the dominating cost arises from handling the $O(n^2)$ constraints. ADMM-type algorithms, quadratic penalty reformulations, and primal-dual schemes mitigate the scaling challenges, supporting moderate-to-large problems (Mazumder et al., 2015, Keshvari, 2016, Ouyang, 2023, Ouyang, 2023).

7. Applications and Extensions

ICCNLS is suited for tasks where local curvature features are key:

Forecasting: Captures demand or pricing responses exhibiting both acceleration and saturation.
Benchmarking: Decomposes production or cost data into regimes supporting performance evaluation.
Policy Evaluation: Quantifies the local effect of interventions, separating convex and concave influences.

Extensions include single-index regression with convex-concave link functions (Kuchibhotla et al., 2017), multidimensional penalty adaptations, and saddle-point optimization strategies for identifiability-regularized large-scale regression (Ouyang, 2023, Ouyang, 2023).

Summary Table: Core Components of ICCNLS

Component	Description	Key References
Decomposition	$f(x) = g_\mathrm{c}(x) + g_\mathrm{v}(x)$ ; concave + convex parts	(Chung, 22 Jun 2025, Hannah et al., 2011)
Identifiability	Orthogonality of residuals to constants and covariates	(Chung, 22 Jun 2025)
Regularization	L1/L2/Elastic net on subgradients	(Chung, 22 Jun 2025, Mazumder et al., 2015)
Optimization	Quadratic programming, ADMM, CCP for DC problems	(Mazumder et al., 2015, Oliveira et al., 2021)
Statistical rates	Minimax: $\mathcal{O}(n^{-4/(d+4)})$ , adaptivity for low $d$	(Mazumder et al., 2015, Kuchibhotla et al., 2017)
Inference	Pivotally normalized confidence intervals	(Deng et al., 2020, Doss, 2018)
Applications	Forecasting, benchmarking, policy evaluation	(Chung, 22 Jun 2025)

ICCNLS unifies theory and computation to address the challenges of shape-constrained regression with changing curvature, supporting both statistical rigor and model interpretability in diverse data analysis contexts.