Hybrid Parallel KAN/MLP PINNs

Updated 16 February 2026

The paper demonstrates that HPKM-PINN significantly reduces L2 error—up to two orders of magnitude—compared to standard PINN or KAN-only models.
It employs a parallel hybrid strategy that blends KAN’s high-frequency resolution with MLP’s nonlinear expressivity via adaptive convex fusion.
Domain decomposition and adaptive weighting enhance scalability and robustness, making HPKM-PINN ideal for multiscale and high-dimensional PDE challenges.

The Hybrid Parallel Kolmogorov–Arnold/MLP Physics-Informed Neural Network (HPKM-PINN) is a neural architecture that combines the explicit functional representation capabilities of Kolmogorov–Arnold Networks (KANs) with the spectral and nonlinear expressivity of standard Multi-Layer Perceptrons (MLPs) within the Physics-Informed Neural Networks (PINN) framework. Designed for the efficient and robust solution of partial differential equations (PDEs) and multi-frequency regression tasks, HPKM-PINN utilizes a parallel hybrid strategy and, in recent extensions, adaptive convex weighting and overlapping domain decomposition for improved accuracy and scalability (Xu et al., 30 Mar 2025, Huang et al., 14 Nov 2025).

1. Architecture and Mathematical Formalism

The canonical HPKM-PINN architecture consists of two parallel branches: a KAN branch and an MLP branch, both operating on the same input features (e.g., spatial and temporal coordinates).

Kolmogorov–Arnold Network (KAN) Branch: Implements the Kolmogorov–Arnold representation theorem, which ensures that any continuous multivariate function on a compact domain can be decomposed as

$f(x_1,\dots,x_n) = \sum_{q=1}^{2n+1} \Phi_q\left( \sum_{p=1}^n \phi_{q,p}(x_p) \right),$

with trainable univariate activations $\phi_{q,p}$ and $\Phi_q$ . In practical implementations, each $\phi$ and $\Phi$ is parameterized via basis expansions, for instance, employing B-splines and smooth nonlinearities. Alternative parameterizations, such as truncated Fourier series (Fourier-KAN), are employed in high-frequency settings (Huang et al., 14 Nov 2025).

Multi-Layer Perceptron (MLP) Branch: Constitutes a conventional feedforward architecture,

$u_{\mathrm{MLP}}(x;\theta) = \Phi^L \circ \cdots \circ \Phi^1(x), \qquad \Phi^{(\ell)}(z) = \sigma(W^{(\ell)}z + b^{(\ell)})$

with widths and depths tailored to the problem’s dimension and complexity, typically using activation functions such as $\tanh$ or ReLU.

Hybrid Parallel Fusion: The outputs of the KAN and MLP branches, $u_{\mathrm{KAN}}(x)$ and $u_{\mathrm{MLP}}(x)$ , are fused via a convex combination governed by a scalar parameter $\xi$ :

$\phi_{q,p}$ 0

Enhanced variants introduce a trainable scalar $\phi_{q,p}$ 1 and a monotonic mapping $\phi_{q,p}$ 2 (e.g., sigmoid or tanh) for adaptive fusion during training:

$\phi_{q,p}$ 3

This permits the weighting between KAN and MLP to adapt dynamically, yielding better balancing of frequency components and feature complexity (Huang et al., 14 Nov 2025).

2. Loss Construction and Optimization Protocols

HPKM-PINN adheres to the standard PINN composite loss paradigm for training:

PDE Loss (Collocation/Residual Term): For a PDE of the form $\phi_{q,p}$ 4 with initial/boundary conditions, the residual at each collocation point is defined as

$\phi_{q,p}$ 5

with pointwise losses

$\phi_{q,p}$ 6

Initial and Boundary Conditions: Losses are imposed by

$\phi_{q,p}$ 7

Total Loss Function:

$\phi_{q,p}$ 8

with $\phi_{q,p}$ 9, $\Phi_q$ 0, and $\Phi_q$ 1 typically assigned weight 1, or tuned via grid search.

Optimization: Training is conducted using Adam, with learning rates typically $\Phi_q$ 2 for regression tasks and $\Phi_q$ 3 for PDE-solving, sometimes incorporating learning-rate scheduling. Collocation points are sampled using Sobol or Latin-Hypercube sequences to ensure robust residual evaluation (Xu et al., 30 Mar 2025).
Domain Decomposition Extension: The modified HPKM-PINN introduces overlapping domain decomposition, partitioning the domain $\Phi_q$ 4 into overlapping subdomains $\Phi_q$ 5 and assembling the global solution via partition-of-unity weighting. Each subdomain is equipped with a local MHPKM network and its own trainable fusion coefficient $\Phi_q$ 6. The global output is assembled as

$\Phi_q$ 7

where $\Phi_q$ 8 forms a partition of unity (Huang et al., 14 Nov 2025).

3. Benchmark Problems and Empirical Results

HPKM-PINN and its variants have been evaluated on a suite of canonical PDEs and mixed-frequency regression benchmarks:

Function Fitting (Mixed High–Low Frequency): For $\Phi_q$ 9 on $\phi$ 0 with high- and low-frequency components, HPKM-PINN (optimal $\phi$ 1) achieved $\phi$ 2 error of $\phi$ 3, outperforming pure MLP ( $\phi$ 4) and KAN ( $\phi$ 5).
1D Poisson Equation: For $\phi$ $ϕ$ 6 with exact $\phi$ $ϕ$ 7,
- PINN ( $\phi$ 8): $\phi$ 9,
- PIKAN ( $\Phi$ 0): $\Phi$ 1,
- HPKM ( $\Phi$ 2): $\Phi$ 3.
1D Advection, Convection–Diffusion: For $\Phi$ 4 and $\Phi$ 5, respectively, HPKM consistently found lower errors and faster loss convergence at problem-specific $\Phi$ 6 (typically $\Phi$ 7– $\Phi$ 8).
Helmholtz Equation (2D): For $\Phi$ 9 with $u_{\mathrm{MLP}}(x;\theta) = \Phi^L \circ \cdots \circ \Phi^1(x), \qquad \Phi^{(\ell)}(z) = \sigma(W^{(\ell)}z + b^{(\ell)})$ 0 Dirichlet boundary, HPKM with $u_{\mathrm{MLP}}(x;\theta) = \Phi^L \circ \cdots \circ \Phi^1(x), \qquad \Phi^{(\ell)}(z) = \sigma(W^{(\ell)}z + b^{(\ell)})$ 1 attained lowest $u_{\mathrm{MLP}}(x;\theta) = \Phi^L \circ \cdots \circ \Phi^1(x), \qquad \Phi^{(\ell)}(z) = \sigma(W^{(\ell)}z + b^{(\ell)})$ 2 error ( $u_{\mathrm{MLP}}(x;\theta) = \Phi^L \circ \cdots \circ \Phi^1(x), \qquad \Phi^{(\ell)}(z) = \sigma(W^{(\ell)}z + b^{(\ell)})$ 3) among compared baselines.
High-Frequency, High-Dimensional, and Multiscale Benchmarks: The modified HPKM-PINN with overlapping domain decomposition demonstrated marked improvements in high-frequency and high-dimensional settings. For $u_{\mathrm{MLP}}(x;\theta) = \Phi^L \circ \cdots \circ \Phi^1(x), \qquad \Phi^{(\ell)}(z) = \sigma(W^{(\ell)}z + b^{(\ell)})$ 4D Helmholtz with $u_{\mathrm{MLP}}(x;\theta) = \Phi^L \circ \cdots \circ \Phi^1(x), \qquad \Phi^{(\ell)}(z) = \sigma(W^{(\ell)}z + b^{(\ell)})$ 5, the method yielded $u_{\mathrm{MLP}}(x;\theta) = \Phi^L \circ \cdots \circ \Phi^1(x), \qquad \Phi^{(\ell)}(z) = \sigma(W^{(\ell)}z + b^{(\ell)})$ 6, compared to failure of MLP (error $u_{\mathrm{MLP}}(x;\theta) = \Phi^L \circ \cdots \circ \Phi^1(x), \qquad \Phi^{(\ell)}(z) = \sigma(W^{(\ell)}z + b^{(\ell)})$ 7) and moderate accuracy of KAN ( $u_{\mathrm{MLP}}(x;\theta) = \Phi^L \circ \cdots \circ \Phi^1(x), \qquad \Phi^{(\ell)}(z) = \sigma(W^{(\ell)}z + b^{(\ell)})$ 8). For $u_{\mathrm{MLP}}(x;\theta) = \Phi^L \circ \cdots \circ \Phi^1(x), \qquad \Phi^{(\ell)}(z) = \sigma(W^{(\ell)}z + b^{(\ell)})$ 9D Poisson, comparable $\tanh$ 0 ( $\tanh$ 1) was achieved with concurrent reduction in per-network parameter count (Huang et al., 14 Nov 2025).

Empirical performance summary

PDE / Task	Model	Param Count	Rel. $\tanh$ 2 Error	Wall Clock (s)
1D Poisson	PINN [1,20,20,1]	481	$\tanh$ 3	88.1
	PIKAN [1,30,30,1]	9600	$\tanh$ 4	696.4
	HPKM-PINN	10481	$\tanh$ 5	858.7
1D Advection	PINN [2,20,20,20,1]	921	$\tanh$ 6	51.5
	PIKAN [2,5,5,1]	400	$\tanh$ 7	283.6
	HPKM-PINN	1321	$\tanh$ 8	353.6

These results highlight consistent relative error reductions—up to two orders of magnitude—by hybridizing KAN and MLP outputs.

4. Advantages, Limitations, and Algorithmic Insights

Principal advantages documented include significantly reduced approximation error (up to two orders of magnitude over standard PINN or KAN-only PINN), accelerated convergence to low-loss regimes, and superior robustness against data noise, especially under Gaussian perturbations. HPKM-PINN’s fusion parameter $\tanh$ 9 furnishes a practical mechanism to interpolate between the spectral coverage of MLP (low-frequency, smooth components) and KAN’s sensitivity to high-frequency, nonlinear features, without alteration to overall network topologies. Domain decomposition in the modified formulation further reduces global optimization complexity by distributing frequency content and PDE variability over localized subnets.

Notable limitations are increased parameter count and wall-clock time (approximately double relative to single-branch approaches in standard implementations), the lack of an automated protocol for optimal $u_{\mathrm{KAN}}(x)$ 0 (or $u_{\mathrm{KAN}}(x)$ 1) selection, and scaling challenges for very high-dimensional or large-scale PDEs. Prospective improvements include adaptive or learned branch fusion strategies and structural regularization via pruning or depth adjustment (Xu et al., 30 Mar 2025, Huang et al., 14 Nov 2025).

5. Overlapping Domain Decomposition and Adaptive Weighting

The extension of HPKM-PINN to include overlapping domain decomposition targets the curse of dimensionality, multiscale features, and high-frequency regimes often present in complex PDE systems (Huang et al., 14 Nov 2025). The algorithm consists of dividing the global domain $u_{\mathrm{KAN}}(x)$ 2 into overlapping subdomains $u_{\mathrm{KAN}}(x)$ 3, each equipped with a local HPKM architecture with an independently trained fusion weight. Transition and window functions $u_{\mathrm{KAN}}(x)$ 4 construct a partition of unity, ensuring global prediction smoothness and continuity. The convex weight between KAN/MLP local predictors is made learnable, with $u_{\mathrm{KAN}}(x)$ 5 (e.g., sigmoid mappings) steering representation capacity as necessitated by local frequency or regularity.

Empirical findings from benchmarks such as high-frequency 2D Helmholtz, 2D/5D Poisson, and nonlinear reaction–diffusion equations demonstrate that MHPKM-PINN consistently attains lower $u_{\mathrm{KAN}}(x)$ 6 errors versus MLP-only and KAN-only baselines—with robustness to increasing frequency and dimensional complexity unattainable via other architectures at equivalent parameterization.

6. Context and Research Trajectory

HPKM-PINN and its domain-decomposed variant address key challenges that have limited the standard PINN paradigm, notably spectral bias toward low frequencies (MLP), and representational rigidity or parameter inefficiency (KAN). By adaptively blending branches—either via hand-tuned or trainable weights—and employing parallelism, hybridization, and domain partitioning, these methods furnish a systematic improvement in the modeling and computational resolution of challenging multiscale PDEs and oscillatory functional regression tasks. Open research directions include automated or regionally adaptive fusion mechanisms, scaling strategies for extreme dimensions, and further exploration of model pruning and depth-heterogeneity across subdomains (Xu et al., 30 Mar 2025, Huang et al., 14 Nov 2025).

7. Summary Table: Key HPKM-PINN Features and Comparisons

Feature	HPKM-PINN	PINN (MLP)	PIKAN (KAN)
Branch types	KAN + MLP (parallel)	MLP	KAN
Fusion weight	$u_{\mathrm{KAN}}(x)$ 7 (fixed or learned)	---	---
Domain decomposition	Optional (modified model only)	No	No
Frequency handling	Adaptive (high/low)	Low freq. bias	High freq. bias
Typical error reduction	Up to $u_{\mathrm{KAN}}(x)$ 8	---	---
Parameter cost	$u_{\mathrm{KAN}}(x)$ 9 PINN/PIKAN	Baseline	Baseline
Robustness to noise	High	Moderate	Moderate-to-high

HPKM-PINN constitutes a versatile, modular enhancement within the PINN ecosystem, enabling tunable and efficient resolution of multi-frequency, high-dimensional, and noise-challenged PDEs by leveraging explicit feature fusion and, where needed, spatial domain partitioning (Xu et al., 30 Mar 2025, Huang et al., 14 Nov 2025).

Markdown Report Issue Upgrade to Chat

References (2)

Enhancing Physics-Informed Neural Networks with a Hybrid Parallel Kolmogorov-Arnold and MLP Architecture (2025)

The modified Physics-Informed Hybrid Parallel Kolmogorov--Arnold and Multilayer Perceptron Architecture with domain decomposition (2025)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Hybrid Parallel Kolmogorov-Arnold/MLP PINNs (HPKM-PINN).