Hybrid Parallel KAN/MLP PINNs
- The paper demonstrates that HPKM-PINN significantly reduces L2 error—up to two orders of magnitude—compared to standard PINN or KAN-only models.
- It employs a parallel hybrid strategy that blends KAN’s high-frequency resolution with MLP’s nonlinear expressivity via adaptive convex fusion.
- Domain decomposition and adaptive weighting enhance scalability and robustness, making HPKM-PINN ideal for multiscale and high-dimensional PDE challenges.
The Hybrid Parallel Kolmogorov–Arnold/MLP Physics-Informed Neural Network (HPKM-PINN) is a neural architecture that combines the explicit functional representation capabilities of Kolmogorov–Arnold Networks (KANs) with the spectral and nonlinear expressivity of standard Multi-Layer Perceptrons (MLPs) within the Physics-Informed Neural Networks (PINN) framework. Designed for the efficient and robust solution of partial differential equations (PDEs) and multi-frequency regression tasks, HPKM-PINN utilizes a parallel hybrid strategy and, in recent extensions, adaptive convex weighting and overlapping domain decomposition for improved accuracy and scalability (Xu et al., 30 Mar 2025, Huang et al., 14 Nov 2025).
1. Architecture and Mathematical Formalism
The canonical HPKM-PINN architecture consists of two parallel branches: a KAN branch and an MLP branch, both operating on the same input features (e.g., spatial and temporal coordinates).
- Kolmogorov–Arnold Network (KAN) Branch: Implements the Kolmogorov–Arnold representation theorem, which ensures that any continuous multivariate function on a compact domain can be decomposed as
with trainable univariate activations and . In practical implementations, each and is parameterized via basis expansions, for instance, employing B-splines and smooth nonlinearities. Alternative parameterizations, such as truncated Fourier series (Fourier-KAN), are employed in high-frequency settings (Huang et al., 14 Nov 2025).
- Multi-Layer Perceptron (MLP) Branch: Constitutes a conventional feedforward architecture,
with widths and depths tailored to the problem’s dimension and complexity, typically using activation functions such as or ReLU.
- Hybrid Parallel Fusion: The outputs of the KAN and MLP branches, and , are fused via a convex combination governed by a scalar parameter :
Enhanced variants introduce a trainable scalar and a monotonic mapping (e.g., sigmoid or tanh) for adaptive fusion during training:
This permits the weighting between KAN and MLP to adapt dynamically, yielding better balancing of frequency components and feature complexity (Huang et al., 14 Nov 2025).
2. Loss Construction and Optimization Protocols
HPKM-PINN adheres to the standard PINN composite loss paradigm for training:
- PDE Loss (Collocation/Residual Term): For a PDE of the form with initial/boundary conditions, the residual at each collocation point is defined as
with pointwise losses
- Initial and Boundary Conditions: Losses are imposed by
- Total Loss Function:
with , , and typically assigned weight 1, or tuned via grid search.
- Optimization: Training is conducted using Adam, with learning rates typically for regression tasks and for PDE-solving, sometimes incorporating learning-rate scheduling. Collocation points are sampled using Sobol or Latin-Hypercube sequences to ensure robust residual evaluation (Xu et al., 30 Mar 2025).
- Domain Decomposition Extension: The modified HPKM-PINN introduces overlapping domain decomposition, partitioning the domain into overlapping subdomains and assembling the global solution via partition-of-unity weighting. Each subdomain is equipped with a local MHPKM network and its own trainable fusion coefficient . The global output is assembled as
where forms a partition of unity (Huang et al., 14 Nov 2025).
3. Benchmark Problems and Empirical Results
HPKM-PINN and its variants have been evaluated on a suite of canonical PDEs and mixed-frequency regression benchmarks:
- Function Fitting (Mixed High–Low Frequency): For on with high- and low-frequency components, HPKM-PINN (optimal ) achieved error of , outperforming pure MLP () and KAN ().
- 1D Poisson Equation: For with exact ,
- PINN (): ,
- PIKAN (): ,
- HPKM (): .
- 1D Advection, Convection–Diffusion: For and , respectively, HPKM consistently found lower errors and faster loss convergence at problem-specific (typically $0.2$–$0.7$).
- Helmholtz Equation (2D): For with Dirichlet boundary, HPKM with attained lowest error ($0.23$) among compared baselines.
- High-Frequency, High-Dimensional, and Multiscale Benchmarks: The modified HPKM-PINN with overlapping domain decomposition demonstrated marked improvements in high-frequency and high-dimensional settings. For $2$D Helmholtz with , the method yielded , compared to failure of MLP (error $2.71$) and moderate accuracy of KAN (). For $5$D Poisson, comparable () was achieved with concurrent reduction in per-network parameter count (Huang et al., 14 Nov 2025).
Empirical performance summary
| PDE / Task | Model | Param Count | Rel. Error | Wall Clock (s) |
|---|---|---|---|---|
| 1D Poisson | PINN [1,20,20,1] | 481 | 88.1 | |
| PIKAN [1,30,30,1] | 9600 | 696.4 | ||
| HPKM-PINN | 10481 | 858.7 | ||
| 1D Advection | PINN [2,20,20,20,1] | 921 | 51.5 | |
| PIKAN [2,5,5,1] | 400 | 283.6 | ||
| HPKM-PINN | 1321 | 353.6 |
These results highlight consistent relative error reductions—up to two orders of magnitude—by hybridizing KAN and MLP outputs.
4. Advantages, Limitations, and Algorithmic Insights
Principal advantages documented include significantly reduced approximation error (up to two orders of magnitude over standard PINN or KAN-only PINN), accelerated convergence to low-loss regimes, and superior robustness against data noise, especially under Gaussian perturbations. HPKM-PINN’s fusion parameter furnishes a practical mechanism to interpolate between the spectral coverage of MLP (low-frequency, smooth components) and KAN’s sensitivity to high-frequency, nonlinear features, without alteration to overall network topologies. Domain decomposition in the modified formulation further reduces global optimization complexity by distributing frequency content and PDE variability over localized subnets.
Notable limitations are increased parameter count and wall-clock time (approximately double relative to single-branch approaches in standard implementations), the lack of an automated protocol for optimal (or ) selection, and scaling challenges for very high-dimensional or large-scale PDEs. Prospective improvements include adaptive or learned branch fusion strategies and structural regularization via pruning or depth adjustment (Xu et al., 30 Mar 2025, Huang et al., 14 Nov 2025).
5. Overlapping Domain Decomposition and Adaptive Weighting
The extension of HPKM-PINN to include overlapping domain decomposition targets the curse of dimensionality, multiscale features, and high-frequency regimes often present in complex PDE systems (Huang et al., 14 Nov 2025). The algorithm consists of dividing the global domain into overlapping subdomains , each equipped with a local HPKM architecture with an independently trained fusion weight. Transition and window functions construct a partition of unity, ensuring global prediction smoothness and continuity. The convex weight between KAN/MLP local predictors is made learnable, with (e.g., sigmoid mappings) steering representation capacity as necessitated by local frequency or regularity.
Empirical findings from benchmarks such as high-frequency 2D Helmholtz, 2D/5D Poisson, and nonlinear reaction–diffusion equations demonstrate that MHPKM-PINN consistently attains lower errors versus MLP-only and KAN-only baselines—with robustness to increasing frequency and dimensional complexity unattainable via other architectures at equivalent parameterization.
6. Context and Research Trajectory
HPKM-PINN and its domain-decomposed variant address key challenges that have limited the standard PINN paradigm, notably spectral bias toward low frequencies (MLP), and representational rigidity or parameter inefficiency (KAN). By adaptively blending branches—either via hand-tuned or trainable weights—and employing parallelism, hybridization, and domain partitioning, these methods furnish a systematic improvement in the modeling and computational resolution of challenging multiscale PDEs and oscillatory functional regression tasks. Open research directions include automated or regionally adaptive fusion mechanisms, scaling strategies for extreme dimensions, and further exploration of model pruning and depth-heterogeneity across subdomains (Xu et al., 30 Mar 2025, Huang et al., 14 Nov 2025).
7. Summary Table: Key HPKM-PINN Features and Comparisons
| Feature | HPKM-PINN | PINN (MLP) | PIKAN (KAN) |
|---|---|---|---|
| Branch types | KAN + MLP (parallel) | MLP | KAN |
| Fusion weight | (fixed or learned) | --- | --- |
| Domain decomposition | Optional (modified model only) | No | No |
| Frequency handling | Adaptive (high/low) | Low freq. bias | High freq. bias |
| Typical error reduction | Up to | --- | --- |
| Parameter cost | PINN/PIKAN | Baseline | Baseline |
| Robustness to noise | High | Moderate | Moderate-to-high |
HPKM-PINN constitutes a versatile, modular enhancement within the PINN ecosystem, enabling tunable and efficient resolution of multi-frequency, high-dimensional, and noise-challenged PDEs by leveraging explicit feature fusion and, where needed, spatial domain partitioning (Xu et al., 30 Mar 2025, Huang et al., 14 Nov 2025).