PO-CKAN: Physics-Informed Operator Learning

Updated 6 March 2026

PO-CKAN is a neural operator learning framework that integrates a DeepONet branch–trunk architecture with chunkwise rational KAN modules to efficiently approximate parametric PDE solutions.
Its chunkwise rational activations reduce parameter count and FLOPs while maintaining full connectivity, leading to improved shock capturing and detailed geometry resolution.
PO-CKAN employs composite PINN-style loss functions to enforce physical constraints, achieving enhanced physical fidelity and sharper convergence across diverse PDE benchmarks.

PO-CKAN (Physics-Informed Deep Operator Kolmogorov–Arnold Network with Chunk Rational Structure) is a neural operator learning framework that integrates DeepONet-style architecture with chunkwise rational Kolmogorov–Arnold Network (CKAN) modules and enforces physics-informed constraints. It is specifically engineered to approximate solution operators for parametric families of partial differential equations (PDEs) and achieves substantial improvements in expressivity, efficiency, and physical fidelity compared to standard MLP or KAN-based approaches (Wu et al., 9 Oct 2025).

1. Architecture: DeepONet with CKAN Sub-networks

PO-CKAN is built upon the DeepONet “branch–trunk” paradigm for operator learning:

Branch Network (Bₜ): Encodes the input function $u(y)$ (e.g., initial/boundary condition, discretized over $p$ sensors) into a feature vector $b = (b_1, ..., b_p) \in \mathbb{R}^p$ .
Trunk Network (Tₜ): Maps a spatio-temporal coordinate $x$ to $p$ basis functions $t(x) = (t_1(x), ..., t_p(x)) \in \mathbb{R}^p$ .
Operator Output: The solution at coordinate $x$ is obtained as

$s(x) = G(u)(x) \approx \sum_{k=1}^p b_k t_k(x)$

(see Eq. (1) in (Wu et al., 9 Oct 2025)).

Uniquely, both branch and trunk networks are instantiated as chunkwise rational KANs (CKANs), rather than standard MLPs or vanilla KANs.

CKAN Layer

CKAN addresses the quadratic parameter bottleneck of vanilla KANs by:

Rational Activations: Each edge activation is parameterized as

$\varphi(x) = w F(x), \quad F(x) = a x + b + \sum_{k=1}^{n/2} \frac{c_k x + d_k}{(x-e_k)^2 + f_k^2 + \epsilon}$

where $\{a, b, c_k, d_k, e_k, f_k\}$ are learnable, degree $n$ is even, and $\epsilon > 0$ ensures no poles (Eq. (4)).

Chunkwise Sharing: Inputs/outputs are partitioned into $c \times c$ chunks. Within each chunk, edges share the same rational base function $F_{m,n}(\cdot)$ but maintain individual scalar weights $w_{ij}$ . This reduces parameter count and FLOPs from $O(d_\text{in}\cdot d_\text{out})$ to $O(c^2)$ for the rational basis, while preserving full connectivity.

Parameter and FLOP comparison for a single layer:

Model	Parameters	FLOPs
MLP	$d_\text{in} \cdot d_\text{out} + d_\text{out}$	—
KAN	$\sim d_\text{in} \cdot d_\text{out} \cdot (G+K+3) + d_\text{out}$	$\sim O(d_\text{in} \cdot d_\text{out} \cdot K \cdot G)$
CKAN	$d_\text{in} \cdot d_\text{out} + d_\text{out} + (2n+2)c^2$	$(4.5 n+1) d_\text{in} c + 2(d_\text{in} d_\text{out})$

This organization delivers the expressivity of KANs with a tractable memory and computational profile (Wu et al., 9 Oct 2025).

2. Physics-Informed Losses and Constraints

PO-CKAN enforces the underlying physics of the target PDE via a composite, PINN-style loss:

$\mathcal{L}(\theta) = \lambda_\text{data} \mathcal{L}_\text{data} + \lambda_\text{ic} \mathcal{L}_\text{ic} + \lambda_\text{bc} \mathcal{L}_\text{bc} + \lambda_r \mathcal{L}_r$

with data, initial-condition, boundary-condition, and PDE residual terms respectively.

Data Loss:

$\mathcal{L}_\text{data} = \frac{1}{N_d} \sum_{i=1}^{N_d} \left\| G_\theta(u_i)(y_i) - s_i(y_i) \right\|_2^2$

Initial Condition Loss:

$\mathcal{L}_\text{ic} = \frac{1}{N_\text{ic}} \sum_{j=1}^{N_\text{ic}} \left\| G_\theta(u_j)(y_j^\text{ic,0}) - s_0(y_j^\text{ic}) \right\|_2^2$

Boundary Condition Loss (Dirichlet):

$\mathcal{L}_\text{bc} = \frac{1}{N_\text{bc}} \sum_{j=1}^{N_\text{bc}} \left\| G_\theta(u_j)(y_j^\text{bc}) - s_\text{bc}(y_j^\text{bc}) \right\|_2^2$

PDE Residual Loss:

$\mathcal{L}_r = \frac{1}{N_p} \sum_{j=1}^{N_p} \left\| \mathcal{R}(u_j, G_\theta(u_j))(y_j^\text{phys}) \right\|_2^2$

where $\mathcal{R}(u, s)$ is the PDE residual; all required derivatives are computed via automatic differentiation. The hyperparameters $\lambda_*$ are chosen per problem (Wu et al., 9 Oct 2025).

3. Training Protocol and Benchmark Problems

All benchmarks employ the Adam optimizer and train solely with physics-informed losses (no paired input–solution data beyond IC/BC). Three canonical testbeds illustrate the method:

Burgers’ Equation (1D): For $\nu \in \{0.05, 0.03, 0.01\}$ , with $u$ sampled from a Gaussian random field, $N_\text{train} = 1500$ , $N_\text{test} = 500$ cases (101 time-snapshots each). Network: 4 CKAN layers ( $1\times1$ ), $n=4$ , 100 units/layer. Baseline: 4 $\times$ 100 MLP (PI-DeepONet).
Eikonal Equation (2D): Domain $[-2,2]^2$ , random circle boundaries. Network: 4 CKAN layers ( $2\times2$ ), $n=4$ , 50 units/layer.
Diffusion–Reaction: $D = k = 0.01$ . IC/BC homogeneous, 5 layers of 50 units (CKAN $2\times2$ , $n=4$ rational units).

No ground-truth data (beyond required IC/BC) is used—only the PINN composite loss guides learning (Wu et al., 9 Oct 2025).

4. Quantitative Performance and Expressivity

Across all benchmarks, PO-CKAN demonstrates marked improvements over PI-DeepONet and baseline PINN variants.

Problem / Metric	PI-DeepONet Error	PO-CKAN Error	Improvement
Burgers' ( $\nu=0.01$ )	$6.23 \times 10^{-2}$	$3.21 \times 10^{-2}$	~48% reduction
Eikonal (2D)	$> 1 \times 10^{-1}$	$5.10 \times 10^{-3}$	> 20 $\times$ lower
Diffusion-Reaction	$5.19 \times 10^{-3}$	$2.58 \times 10^{-3}$	>50% reduction
Fractional PDE	$1.32 \times 10^{-1}$	$2.54 \times 10^{-2}$	~80% reduction

Test-loss for Eikonal converges two orders of magnitude lower; max absolute error (0.016 vs. 2.5) is similarly improved. Results generalize across parametric variations, input regularities, and geometric complexity (Wu et al., 9 Oct 2025).

PO-CKAN’s chunkwise rational structure yields:

Substantial parameter reduction compared to vanilla KANs ( $O(c^2)$ rational base functions vs $O(d_\text{in} d_\text{out})$ ).
$10\times$ fewer FLOPs compared to B-spline KANs.
Enhanced convergence and representational capacity, evident in sharper shock capturing (Burgers’) and finer geometric detail (Eikonal).
Consistent outperformance over deep MLP or standard operator network baselines.

5. Advantages, Limitations, and Research Directions

Advantages

Parameter/FLOP Efficiency: CKAN’s chunkwise rational activations enable full connectivity with tractable scaling, supporting larger, deeper, or more expressive models.
Physical Consistency: Integrated PINN losses guarantee that solutions respect the underlying PDE constraints (no ground-truth data required except at boundaries/initialization).
Generalization: PO-CKAN is effective across diverse PDE families, including parametric, nonlinear, and fractional-order equations.

Limitations

Adaptive Complexity: Fixed chunking and rational order may limit local adaptivity. Adaptive strategies for chunk granularity or rational degree are needed for sharp local features or heterogeneous domains.
Geometry: Extension to arbitrary or highly complex domains requires additional machinery (e.g., domain decomposition, meshless collocation, XPINNs).
Uncertainty Quantification: No native quantification of prediction uncertainty; prospective Bayesian or ensemble PINN extensions would address this.
Scalability: While chunked, the architecture still entails nontrivial computational cost for very high-dimensional problems; exploiting chunk-based parallelism (e.g., on multi-GPU/TPU) can further scale inference in large domains.

Future Research Directions

Adaptive CKANs with local refinement.
Integration with meshless or domain-decomposition methods for complex domains.
Bayesian extensions for uncertainty quantification in sparse/noisy data regimes.
Hardware acceleration leveraging CKAN’s chunk structure for large-scale, real-time operator learning (Wu et al., 9 Oct 2025).

The PO-CKAN framework extends and complements advances in physics-informed neural operator learning. AC-PKAN incorporates attention and Chebyshev polynomial bases to address expressivity and rank-collapse syndrome, using wavelet-activated MLPs with internal and external attention (Residue-Gradient Attention). This preserves a full-rank Jacobian and guarantees universal PDE approximation, albeit at higher computational cost than plain MLPs (Zhang et al., 13 May 2025).

A plausible implication is that PO-CKAN’s chunkwise rational activation design, when combined with advanced weighting schemes or alternative polynomial bases (e.g., orthogonal Chebyshev/Jacobi systems), could further enhance accuracy, stability, and scalability. This suggests a promising deployment template for operator learning in data-sparse regimes and complex, real-world engineering workflows (Zhang et al., 13 May 2025).

Markdown Report Issue Upgrade to Chat

References (2)

PO-CKAN:Physics Informed Deep Operator Kolmogorov Arnold Networks with Chunk Rational Structure (2025)

AC-PKAN: Attention-Enhanced and Chebyshev Polynomial-Based Physics-Informed Kolmogorov-Arnold Networks (2025)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to PO-CKAN.

PO-CKAN: Physics-Informed Operator Learning

1. Architecture: DeepONet with CKAN Sub-networks

CKAN Layer

2. Physics-Informed Losses and Constraints

3. Training Protocol and Benchmark Problems

4. Quantitative Performance and Expressivity

5. Advantages, Limitations, and Research Directions

Advantages

Limitations

Future Research Directions

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Don't miss out on important new AI/ML research

PO-CKAN: Physics-Informed Operator Learning

1. Architecture: DeepONet with CKAN Sub-networks

CKAN Layer

2. Physics-Informed Losses and Constraints

3. Training Protocol and Benchmark Problems

4. Quantitative Performance and Expressivity

5. Advantages, Limitations, and Research Directions

Advantages

Limitations

Future Research Directions

6. Related Developments and Broader Context

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Related Topics

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research