Papers
Topics
Authors
Recent
Search
2000 character limit reached

Neuron Interpolation in Neural Networks

Updated 25 March 2026
  • Neuron interpolation is a set of methods that enable neural networks to exactly match specified outputs and align internal representations.
  • It extends classical interpolation schemes with neural operators such as Taylor-accelerated, RBF-based, and tensor decomposition techniques.
  • The approach underpins improvements in function reconstruction, model fusion, and generalization with rigorously established theoretical bounds.

Neuron interpolation refers to a spectrum of theoretical, algorithmic, and architectural constructs in which neural network models achieve or exploit interpolation properties at the level of units, representations, or function approximations. This paradigm encompasses classical interpolation schemes interpreted through the neural-network lens, theoretical results on memorization and universal approximation, neuron-centric fusion and alignment protocols, and advanced operator constructions attaining high-order or structure-preserving interpolation across Euclidean and non-Euclidean domains. Neuron interpolation plays a foundational role in contemporary machine learning theory, scientific computing, and multi-model systems, undergirding advances in representation fusion, function reconstruction, mesh-based simulations, and generalization theory.

1. Definition and Theoretical Landscape

Neuron interpolation formalizes the ability of a neural network, or a set of its constituent neurons, to exactly fit prescribed values at specified sites—be these spatial sample points, activation patterns, or model checkpoints. At the function-approximation level, this requires the network N\mathcal{N} with sufficient capacity to construct fθf_\theta such that fθ(xi)=yif_\theta(x_i) = y_i for each (xi,yi)(x_i, y_i) training pair, or (in a structural context) align or interpolate internal activations across models or sub-networks.

Theoretical bounds delineate the sample complexity and network width/depth necessary for interpolation. For instance, deep feedforward networks with real-analytic, nonpolynomial activations attain the optimal Θ(nd)\Theta(\sqrt{nd'}) neuron count for nn generic data points in Rd×Rd\mathbb{R}^d \times \mathbb{R}^{d'}, independent of the activation’s specific shape as long as it avoids piecewise polynomial restriction (Madden, 2024). In contrast, for ReLU (piecewise-linear) networks, tight lower and upper bounds demonstrate a phase transition: if the minimal pointwise separation δ(N)\delta(N) between nodes decays exponentially in NN (δ<exp(cN)\delta < \exp(-cN)), a linear number of parameters Ω(N)\Omega(N) is both necessary and sufficient to guarantee interpolation of arbitrary labels, reflecting an intrinsic combinatorial bottleneck (Siegel, 2023).

2. Neural Network Interpolation Operators

Neural network operators for interpolation generalize classical linear interpolants (Lagrange, piecewise polynomial, splines) by embedding their logic into neural architectures. A canonical case is the Taylor-accelerated neural network interpolation operator T~n,r\widetilde{T}_{n,r}, defined over quasi-uniform grids as

T~n,r(g;y)=k=0nσ(2md1(yzk))Pr(g;y,zk)k=0nσ(2md1(yzk))\widetilde{T}_{n,r}(g; y) = \frac{\sum_{k=0}^{n} \sigma\left(\frac{2m}{d_1}(y - z_k)\right) P_r(g; y, z_k)} {\sum_{k=0}^{n} \sigma\left(\frac{2m}{d_1}(y - z_k)\right)}

where Pr(g;y,zk)P_r(g; y, z_k) is the degree-rr Taylor polynomial centered at zkz_k, and the sigmoidal σ\sigma enforces localization. This operator exactly interpolates at the nodes, reproduces polynomials up to degree rr, and attains an O(n(r+1))O(n^{-(r+1)}) error rate for Cr+1C^{r+1} functions, outperforming Lagrange-type NNOs—especially on nonuniform meshes (Saini, 5 Feb 2026).

Boundary interpolation on non-rectangular domains is addressed via Boolean-sum neural network operators Bn1,n2,ξ\mathcal{B}_{n_1, n_2, \xi}, which combine univariate interpolants in each coordinate direction and their tensor product, guaranteeing exact reproduction on all triangle edges and convergence in mixed moduli of continuity as neuron counts increase (Bhat et al., 2024).

3. Algorithmic Approaches: ELM, RBF-ELM, and Tensor Decompositions

Algorithmic neuron interpolation is achieved predominantly by two classes of constructions: overparameterized shallow architectures and tensorized multilinear networks.

  • Extreme Learning Machines (ELM): In single-layer (shallow) networks, neuron weights and biases are randomized and fixed, and only the output weights are learned by solving a linear system to impose interpolation constraints—either square systems Sw=ySw = y for equal nodes/neuron regime or least squares for overparameterized settings. This supports guaranteed exact interpolation with high numerical stability, and avoids the Runge phenomenon observed in high-degree polynomial interpolation, achieving Chebyshev-optimal convergence rates even for equispaced or random nodes (Auricchio et al., 2023).
  • RBF-augmented ELM: By replacing generic activations with localized radial basis functions, one obtains mesh- and topology-agnostic interpolators with vastly increased robustness, reaching machine-precision errors (101010^{-10} to 101410^{-14}) on unstructured and adaptive meshes with computational cost dominated by a single linear solve (Hao et al., 16 Nov 2025).
  • Tensor Neural Network (TNN) Interpolation: For high-dimensional inputs, tensor-product neural architectures decompose the function into products of univariate subnetworks, parameterized as Ψ(x;Θ)=j=1pcji=1dϕ^i,j(xi;θi)\Psi(x; \Theta) = \sum_{j=1}^p c_j \prod_{i=1}^d \widehat{\phi}_{i,j}(x_i; \theta_i). This structure reduces theoretical and numerical complexity of high-dimensional interpolation and allows efficient quadrature and PDE solvers in moderate ranks pp, circumventing the curse of dimensionality present in dense mesh-based interpolators (Li et al., 2024). Similar tensor-decomposition and low-rank expansions are exploited in interpolating neural network (INN) models used in scientific computing and simulation—drastically reducing parameter and computational counts compared to full MLPs while preserving spline-type convergence guarantees (Park et al., 2024).

4. Interpolation in Representation Fusion and Model Alignment

Neuron interpolation serves as the mathematical core of model fusion, representation alignment, and layerwise matching protocols.

  • Model Fusion via Neuron Interpolation: When merging several pretrained models with potentially misaligned internal representations, neuron interpolation proceeds by clustering neurons (via weighted K-means or Hungarian assignment), constructing target cluster centers (weighted averages of activations), and fitting the fused model so its neuron activations replicate these interpolation targets. The approach generalizes to any architecture (MLP, CNN, Transformer) and supports arbitrary layer widths. Empirically this yields superior “zero-shot” performance in non-IID or sharded data settings, outperforming ensembling, OT, or “re-basin” methods (Luenam et al., 18 Jun 2025).
  • Permutation-Invariant Linear Paths and REPAIR: Interpolating network weights between two independently trained solutions generally requires solving an alignment problem due to permutation invariance of neurons. Even after alignment, naive linear interpolation often exhibits “variance collapse,” resulting in degenerate activations and poor performance. The REPAIR scheme rescales preactivations at every channel to match the convex combination of endpoint statistics, eliminating variance collapse and realizing near-zero loss barriers along alignment-corrected interpolation paths. This enables empirical validation of the conjecture that SGD solutions can be linearly connected (after proper re-normalization), for deep and wide networks across architectures and normalization schemes (Jordan et al., 2022).

5. Geometric, Structural, and Generalization Perspectives

Neuron interpolation is tightly linked to the geometric configuration of data and the architecture of representations.

  • Intrinsic vs. Ambient Dimension: Neural networks’ generalization often occurs in a latent space of far lower effective dimension than the input or original feature space. In this space, most test samples lie in the convex hull of training activations—a regime dominated by interpolation rather than extrapolation, defying the “curse of dimensionality” argument. Statistical proximity (nearest-neighbor distances) better predicts classification accuracy than convex-hull membership, linking successful generalization to coverage and local interpolation (Bonnasse-Gahot, 2022).
  • Sample Separation and Memorization Barriers: There exists a geometric phase transition in minimal model size as a function of point separation. For well-separated points (δNα\delta \gtrsim N^{-\alpha}), the interpolation task can be solved with sublinear parameter counts (e.g., O(NpolylogN)O(\sqrt{N} poly\log N) for certain cases), but as δ\delta falls below exponential thresholds the minimal model size jumps to Ω(N)\Omega(N) (Siegel, 2023).

6. Advanced Extensions and Specialized Frameworks

Neuron interpolation encompasses a diverse range of specialized formulations:

  • Spline-based and Localized Activations: Construction of neuron-specific, adaptive activations via cubic spline interpolation enables local, data-dependent tuning of functional shape—optimizing the representation capacity and regularity properties of the network. The training procedure, based on local cardinal spline coefficients, supports stable learning dynamics, explicit shape regularization, and potential for non-standard activation morphologies (Scardapane et al., 2016).
  • Neural Operator-based Interpolation: In boundary value problems, operator-inspired neural network interpolants such as generalized Boolean-sum constructions achieve exact satisfaction of boundary constraints and rapid convergence in the interior, with explicit weights and biases determined by geometric and neuron-count parameters—eliminating the need for data-driven training (Bhat et al., 2024).
  • Continuous Coordinate-based Interpolation: In graphics, coordinate-conditioned networks represent mappings from continuous geometry or spatio-temporal coordinates to high-resolution representations (e.g., neural view interpolation in light-field rendering), demonstrating the inherent capacity for continuous, differentiable interpolation across sparse or sampled data while encoding scene or physical priors (Bemana et al., 2019).

7. Summary Table: Illustrative Neuron Interpolation Paradigms

Paradigm Key Principle Reference
Taylor-accelerated NNO Local Taylor + sigmoidal basis (Saini, 5 Feb 2026)
Model fusion via neuron alignment K-means/Hungarian interpolation of neurons (Luenam et al., 18 Jun 2025)
RBF-ELM mesh transfer Localized RBF activation, closed-form solve (Hao et al., 16 Nov 2025)
TNN/INN tensor-product methods Tensorized product of univariate subnetworks (Li et al., 2024, Park et al., 2024)
Layerwise REPAIR interpolation Alignment, mean/variance-corrected rescaling (Jordan et al., 2022)
Spline-based per-neuron AFs B-spline parameterization, local adaptivity (Scardapane et al., 2016)

8. Open Problems and Future Directions

Several structural and theoretical challenges remain outstanding:

  • Rigorous interpolation guarantees for ReLU (and other piecewise-polynomial activations) in the non-generic or worst-case data regime; closing the gap between lower and upper bounds for practical datasets (Madden, 2024).
  • Incorporation of conservation laws and physics-informed priors in neuron-interpolation frameworks for scientific computing (Hao et al., 16 Nov 2025).
  • Characterization of the trade-off between minimal-interpolant capacity and generalization under stochastic training, implicit regularization, or data-dependent activation adaptation (Bonnasse-Gahot, 2022, Scardapane et al., 2016).
  • Interpolatory operator constructions for unstructured domains, manifolds, or graphs with controlled regularity and fast evaluation, including high-dimension scaling.

Neuron interpolation, therefore, constitutes a multifaceted and unifying construct encompassing the algebraic, geometric, and statistical apparatus underlying the approximation, memorization, and fusion capabilities of neural network models. It bridges classical numerical analysis, modern deep learning theory, and real-world computational science (Saini, 5 Feb 2026, Jordan et al., 2022, Luenam et al., 18 Jun 2025, Hao et al., 16 Nov 2025, Siegel, 2023, Madden, 2024, Li et al., 2024, Park et al., 2024, Auricchio et al., 2023, Bhat et al., 2024, Scardapane et al., 2016, Bemana et al., 2019, Bonnasse-Gahot, 2022).

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Neuron Interpolation.