KA-CRNN: Interpretable Kinetic Modeling
- KA-CRNN is a framework that generalizes chemical reaction neural networks by parameterizing kinetic parameters as continuous functions of external variables like SOC or pressure using univariate Kolmogorov–Arnold activations.
- It integrates a CRNN core with KAN layers to capture SOC- or pressure-dependent kinetic behaviors while strictly enforcing Arrhenius and mass action laws.
- Demonstrated applications in Li-ion cathode decomposition and pressure-dependent kinetics show high model fidelity with low mean squared error and enhanced interpretability.
The Kolmogorov–Arnold Chemical Reaction Neural Network (KA-CRNN) framework generalizes classical Chemical Reaction Neural Networks by representing each kinetic parameter as a continuous function of an external variable—typically state of charge (SOC) or pressure—using univariate Kolmogorov–Arnold Network (KAN) activations. This approach enables fully interpretable, assumption-free inference of extended kinetic behaviors (such as SOC- or pressure-dependent rates and enthalpies) from data, while preserving strict adherence to Arrhenius and mass action laws. KA-CRNNs have been demonstrated in the context of learning SOC-dependent thermal decomposition kinetics for Li-ion cathodes and pressure-dependent elementary chemical kinetics, providing accurate, physically constrained, and highly interpretable kinetic models (Koenig et al., 17 Dec 2025, Koenig et al., 10 Nov 2025).
1. Theoretical Foundations
1.1. Kolmogorov–Arnold Representation Theorem
The Kolmogorov–Arnold representation theorem states that any continuous multivariate function can be decomposed as
with and continuous univariate functions. For KA-CRNNs, this principle motivates a neural architecture in which each kinetic parameter—such as pre-exponential factor , activation energy , or enthalpy change —is represented as a learnable continuous function of a single external variable (e.g., SOC or pressure) via a univariate KAN layer.
1.2. Motivation and Distinction from Standard CRNNs
Standard CRNNs fit scalar kinetic parameters (fixed in the external variable), so they cannot represent continuous dependence on SOC, pressure, or other coordinates. In KA-CRNNs, each parameter (e.g., , with denoting SOC or ) is parameterized as a continuous function using learnable basis expansions, enabling the model to mirror gradual or abrupt kinetic regime changes. This yields an interpretable, continuous mapping from operational variables to mechanistic rates (Koenig et al., 17 Dec 2025, Koenig et al., 10 Nov 2025).
2. Network Architecture and Chemical Encoding
2.1. Structural Overview
| Input | Hidden Layers | Output |
|---|---|---|
| External variable (SOC/Pressure) | Bank of KAN activations for each kinetic parameter | ODE-integrated species trajectories |
| Temperature profile | Arrhenius/mass-action feed-forward CRNN core | Model heat release or concentration |
| Species initial mass fractions |
In the thermal decomposition (SOC-dependent) setting (Koenig et al., 17 Dec 2025):
- Input: normalized SOC , imposed , latent .
- Hidden: for each scalar kinetic parameter , a Chebyshev-polynomial KAN computes ; the CRNN core computes .
- Output: species mass fraction trajectories and heat release rate .
Mechanistically, the reaction network is embedded in the architecture:
- R1: (“layered spinel”); R2: (“spinel rock-salt + O”); R3: (O-driven electrolyte oxidation).
- R1 and R2: SOC-dependent kinetics; R3: fixed parameters, rate proportional to O produced in R2.
For pressure-dependent reactions (Koenig et al., 10 Nov 2025), the architecture is analogous, with normalized as input, KAN layers for each parameter, and the reaction network ODE integrated accordingly.
2.2. Physical Encoding and Interpretability
Mechanistic reaction topologies are strictly enforced by the architectural design. Each kinetic parameter has a single interpretable univariate mapping (e.g., , ), which can be visualized or symbolically regressed post hoc.
3. Mathematical Framework
3.1. Rate Law and ODE System
Each elemental reaction is governed by:
where is SOC or pressure as dictated by the application. In log space:
For the SOC-dependent Li-ion cathode decomposition:
- , , correspond to layered, spinel, and O mass fractions, respectively.
- The ODE system, \begin{align*} \frac{dc_1}{dt} &= -r_1 \ \frac{dc_2}{dt} &= +r_1 - r_2 \ \frac{dc_3}{dt} &= +\nu(z) r_2 - r_3 \end{align*}
- Total heat release:
3.2. KAN Basis Expansion for Kinetic Parameters
Each SOC- or pressure-dependent parameter is parameterized as:
In practice, is typical for SOC or pressure; the set is learned to fit observed data.
3.3. Loss Function and Regularization
The loss,
includes:
- , which penalizes non-monotonicity in key parameters (e.g., ensures and are non-decreasing in SOC),
- , which enforces ,
- , which prevents pathological parameter values.
This structure is analogous when training on pressure-dependent data, with terms modified for the relevant physical and kinetic constraints.
4. Training Methodology
4.1. Data Preprocessing and Sampling
For SOC-dependent kinetics, experimental DSC scans are collected at multiple linearly spaced SOCs per cathode chemistry; one scan is held out for testing. Heat release is computed as , and SOC is normalized to . For pressure dependence, training sets consist of trajectory data over a grid of temperatures and normalized pressures.
4.2. Neural ODE Integration and Optimization
The network integrates the chemical reaction ODEs (species and heat) using Neural-ODE libraries (e.g., DifferentialEquations.jl). Gradients for all model parameters (KAN weights, initial conditions, etc.) are computed via forward-mode automatic differentiation (e.g., ForwardDiff.jl).
Optimization employs the ADAM algorithm, typically with step size ; training proceeds until plateau, generally – steps. Physics-informed penalties are incorporated to avoid overfitting and to regularize model extrapolation.
4.3. Regularization and Physicality
Regularizers enforce physically plausible monotonicity, parameter range, kinetic order constraints, and smoothness in the KAN representations. For pressure-dependent cases, initialization at the known high-pressure (or high-SOC) limiting parameter values expedites convergence.
5. Interpretability and Model Validation
5.1. Parameter Variation with External Variables
Learned parameter functions capture known or hypothesized physical transitions:
- In Li-ion cathode decomposition (Koenig et al., 17 Dec 2025), , , and the O stoichiometric jump sharply at a “critical SOC,” recapitulating the lattice oxygen release and phase transformation reported in literature.
- The functional dependence is continuous, interpretable, and can be visualized directly.
- In pressure-dependent kinetics (Koenig et al., 10 Nov 2025), learned KAN curves reproduce expected falloff behavior and are directly human-inspectable.
5.2. Mechanistic and Domain Insight
Interpretability is enforced at two levels: (1) strict physical/chemical network encoding in the CRNN ODE, (2) mapping each parameter to an explicitly learnable function of the external variable via a sparse basis. Sudden regime transitions in kinetic parameters align with domain knowledge (e.g., abrupt O release at high SOC, pressure falloff in combustion). Symbolic regression can extract closed-form kinetic laws from learned KANs.
5.3. Generalization Fidelity
KA-CRNNs reproduce all training curves with low mean squared error and correctly interpolate to held-out external variable values (e.g., mid-range SOC or pressure). The continuous basis eliminates artifacts at intermediate values and outperforms piecewise or discrete fits. Quantitative metrics on test sets (e.g., for thermal peak properties) demonstrate model fidelity.
6. Extensions, Limitations, and Applications
6.1. Framework Generality
KA-CRNNs can be immediately extended to:
- Additional external coordinates (multivariate KANs for , , SOC, etc.),
- More complex reaction networks (a KAN per parameter per step, or sharing among reactions),
- Symbolic extraction of interpretable kinetic rate laws from learned representations.
6.2. Limitations and Recommendations
Current deployments focus on univariate external variable dependencies; multidimensional KAN layers would extend capability but increase data and computational demands. For sparse data (Koenig et al., 10 Nov 2025), incorporating analytic priors or hierarchical approaches may be essential. In stiff or highly multi-scale ODE systems, advanced solvers or architectural regularizations are advisable.
6.3. Application Contexts
Recommended applications include:
- Thermal decomposition and abuse prediction for Li-ion batteries across all SOCs (Koenig et al., 17 Dec 2025),
- Pressure- or concentration-dependent kinetics in combustion, catalysis, atmospheric chemistry,
- Enzymatic and materials science scenarios with field-coupled kinetics (Koenig et al., 10 Nov 2025).
7. Comparative and Methodological Significance
KA-CRNNs offer a domain-informed, physics-consistent approach for inferring complex, continuous kinetic parameter dependencies directly from data. They outperform black-box neural networks and standard interpolative models (e.g., PLOG) both in accuracy and interpretability. All kinetic dependencies remain physically explicit and subject to posterior analysis. This suggests KA-CRNNs are a broadly applicable tool for kinetic inference when external variable dependence is critical and interpretability essential (Koenig et al., 17 Dec 2025, Koenig et al., 10 Nov 2025).