PO-CKAN: Physics-Informed Operator Learning
- PO-CKAN is a neural operator learning framework that integrates a DeepONet branch–trunk architecture with chunkwise rational KAN modules to efficiently approximate parametric PDE solutions.
- Its chunkwise rational activations reduce parameter count and FLOPs while maintaining full connectivity, leading to improved shock capturing and detailed geometry resolution.
- PO-CKAN employs composite PINN-style loss functions to enforce physical constraints, achieving enhanced physical fidelity and sharper convergence across diverse PDE benchmarks.
PO-CKAN (Physics-Informed Deep Operator Kolmogorov–Arnold Network with Chunk Rational Structure) is a neural operator learning framework that integrates DeepONet-style architecture with chunkwise rational Kolmogorov–Arnold Network (CKAN) modules and enforces physics-informed constraints. It is specifically engineered to approximate solution operators for parametric families of partial differential equations (PDEs) and achieves substantial improvements in expressivity, efficiency, and physical fidelity compared to standard MLP or KAN-based approaches (Wu et al., 9 Oct 2025).
1. Architecture: DeepONet with CKAN Sub-networks
PO-CKAN is built upon the DeepONet “branch–trunk” paradigm for operator learning:
- Branch Network (Bₜ): Encodes the input function (e.g., initial/boundary condition, discretized over sensors) into a feature vector .
- Trunk Network (Tₜ): Maps a spatio-temporal coordinate to basis functions .
- Operator Output: The solution at coordinate is obtained as
(see Eq. (1) in (Wu et al., 9 Oct 2025)).
Uniquely, both branch and trunk networks are instantiated as chunkwise rational KANs (CKANs), rather than standard MLPs or vanilla KANs.
CKAN Layer
CKAN addresses the quadratic parameter bottleneck of vanilla KANs by:
- Rational Activations: Each edge activation is parameterized as
where are learnable, degree is even, and ensures no poles (Eq. (4)).
- Chunkwise Sharing: Inputs/outputs are partitioned into chunks. Within each chunk, edges share the same rational base function but maintain individual scalar weights . This reduces parameter count and FLOPs from to for the rational basis, while preserving full connectivity.
Parameter and FLOP comparison for a single layer:
| Model | Parameters | FLOPs |
|---|---|---|
| MLP | — | |
| KAN | ||
| CKAN |
This organization delivers the expressivity of KANs with a tractable memory and computational profile (Wu et al., 9 Oct 2025).
2. Physics-Informed Losses and Constraints
PO-CKAN enforces the underlying physics of the target PDE via a composite, PINN-style loss:
with data, initial-condition, boundary-condition, and PDE residual terms respectively.
- Data Loss:
- Initial Condition Loss:
- Boundary Condition Loss (Dirichlet):
- PDE Residual Loss:
where is the PDE residual; all required derivatives are computed via automatic differentiation. The hyperparameters are chosen per problem (Wu et al., 9 Oct 2025).
3. Training Protocol and Benchmark Problems
All benchmarks employ the Adam optimizer and train solely with physics-informed losses (no paired input–solution data beyond IC/BC). Three canonical testbeds illustrate the method:
- Burgers’ Equation (1D): For , with sampled from a Gaussian random field, , cases (101 time-snapshots each). Network: 4 CKAN layers (), , 100 units/layer. Baseline: 4100 MLP (PI-DeepONet).
- Eikonal Equation (2D): Domain , random circle boundaries. Network: 4 CKAN layers (), , 50 units/layer.
- Diffusion–Reaction: . IC/BC homogeneous, 5 layers of 50 units (CKAN , rational units).
No ground-truth data (beyond required IC/BC) is used—only the PINN composite loss guides learning (Wu et al., 9 Oct 2025).
4. Quantitative Performance and Expressivity
Across all benchmarks, PO-CKAN demonstrates marked improvements over PI-DeepONet and baseline PINN variants.
| Problem / Metric | PI-DeepONet Error | PO-CKAN Error | Improvement |
|---|---|---|---|
| Burgers' () | ~48% reduction | ||
| Eikonal (2D) | > 20 lower | ||
| Diffusion-Reaction | >50% reduction | ||
| Fractional PDE | ~80% reduction |
Test-loss for Eikonal converges two orders of magnitude lower; max absolute error (0.016 vs. 2.5) is similarly improved. Results generalize across parametric variations, input regularities, and geometric complexity (Wu et al., 9 Oct 2025).
PO-CKAN’s chunkwise rational structure yields:
- Substantial parameter reduction compared to vanilla KANs ( rational base functions vs ).
- fewer FLOPs compared to B-spline KANs.
- Enhanced convergence and representational capacity, evident in sharper shock capturing (Burgers’) and finer geometric detail (Eikonal).
- Consistent outperformance over deep MLP or standard operator network baselines.
5. Advantages, Limitations, and Research Directions
Advantages
- Parameter/FLOP Efficiency: CKAN’s chunkwise rational activations enable full connectivity with tractable scaling, supporting larger, deeper, or more expressive models.
- Physical Consistency: Integrated PINN losses guarantee that solutions respect the underlying PDE constraints (no ground-truth data required except at boundaries/initialization).
- Generalization: PO-CKAN is effective across diverse PDE families, including parametric, nonlinear, and fractional-order equations.
Limitations
- Adaptive Complexity: Fixed chunking and rational order may limit local adaptivity. Adaptive strategies for chunk granularity or rational degree are needed for sharp local features or heterogeneous domains.
- Geometry: Extension to arbitrary or highly complex domains requires additional machinery (e.g., domain decomposition, meshless collocation, XPINNs).
- Uncertainty Quantification: No native quantification of prediction uncertainty; prospective Bayesian or ensemble PINN extensions would address this.
- Scalability: While chunked, the architecture still entails nontrivial computational cost for very high-dimensional problems; exploiting chunk-based parallelism (e.g., on multi-GPU/TPU) can further scale inference in large domains.
Future Research Directions
- Adaptive CKANs with local refinement.
- Integration with meshless or domain-decomposition methods for complex domains.
- Bayesian extensions for uncertainty quantification in sparse/noisy data regimes.
- Hardware acceleration leveraging CKAN’s chunk structure for large-scale, real-time operator learning (Wu et al., 9 Oct 2025).
6. Related Developments and Broader Context
The PO-CKAN framework extends and complements advances in physics-informed neural operator learning. AC-PKAN incorporates attention and Chebyshev polynomial bases to address expressivity and rank-collapse syndrome, using wavelet-activated MLPs with internal and external attention (Residue-Gradient Attention). This preserves a full-rank Jacobian and guarantees universal PDE approximation, albeit at higher computational cost than plain MLPs (Zhang et al., 13 May 2025).
A plausible implication is that PO-CKAN’s chunkwise rational activation design, when combined with advanced weighting schemes or alternative polynomial bases (e.g., orthogonal Chebyshev/Jacobi systems), could further enhance accuracy, stability, and scalability. This suggests a promising deployment template for operator learning in data-sparse regimes and complex, real-world engineering workflows (Zhang et al., 13 May 2025).