Papers
Topics
Authors
Recent
Search
2000 character limit reached

Kolmogorov-Arnold Neuro-Fuzzy Inference

Updated 10 February 2026
  • KANFIS is a neuro-symbolic system that uses additive fuzzy rule superposition to overcome exponential rule complexity.
  • It leverages the Kolmogorov–Arnold representation to achieve linear parameter scaling and explicitly model uncertainty.
  • Empirical results demonstrate that KANFIS outperforms traditional ANFIS and neural baselines while offering interpretable and sparse rule sets.

The Kolmogorov-Arnold Neuro-Fuzzy Inference System (KANFIS) is a neuro-symbolic framework designed to address the challenge of exponential rule complexity in neuro-fuzzy inference by leveraging the Kolmogorov–Arnold additive representation. By unifying interpretable fuzzy reasoning with the additive decomposition of multivariate functions, KANFIS achieves both linear parameter scaling and explicit uncertainty modeling, while maintaining semantically transparent rule sets and competitive empirical performance relative to established neuro-fuzzy and neural baselines (Yong et al., 3 Feb 2026).

1. Mathematical Foundations

KANFIS builds on the classical Kolmogorov–Arnold superposition theorem, which states that any continuous multivariate function f ⁣:[0,1]DRf\colon [0,1]^D \rightarrow \mathbb{R} can be represented as

f(x)=q=02D  ϕq(p=1Dψqp(xp))f(\mathbf{x}) = \sum_{q=0}^{2D}\; \phi_q\left(\sum_{p=1}^D \psi_{qp}(x_p)\right)

with each ψqp\psi_{qp} and ϕq\phi_q a univariate continuous function. This decomposition motivates an alternative to the product-based rule firing employed in conventional Adaptive Neuro-Fuzzy Inference Systems (ANFIS).

In traditional ANFIS, Takagi–Sugeno–Kang (TSK) fuzzy system rules use the firing strength

wj(x)=i=1Dμij(xi),w_j(\mathbf{x}) = \prod_{i=1}^D \mu_{ij}(x_i),

where μij\mu_{ij} is the membership function for the ii-th feature and jj-th rule. This rule formulation requires MDM^D rules for MM fuzzy sets per input and DD input dimensions, rapidly leading to intractable model sizes in high dimensions.

KANFIS replaces the product-based aggregation with an additive superposition. For RR rules, each rule consists of univariate fuzzy transforms per feature. For each input dimension ii and rule jj, KK fuzzy basis functions {μijk(xi)}k=1K\{\mu_{ijk}(x_i)\}_{k=1}^K are learned. The soft-antecedent is computed as

eij(xi)=k=1KDijk(xi),e_{ij}(x_i) = \sum_{k=1}^K D_{ijk}(x_i),

where Dijk(xi)D_{ijk}(x_i) is a Type-1 or Interval Type-2 (IT2) membership value. The total additive rule activation is

hj(x)=i=1Deij(xi).h_j(\mathbf{x}) = \sum_{i=1}^D e_{ij}(x_i).

A direct consequence is that both rule count and parameter complexity now scale linearly with DD instead of exponentially, conditioned on the number of rules RMDR \ll M^D (Yong et al., 3 Feb 2026).

2. Architecture and Structural Components

2.1 KANFIS Layer Structure

A KANFIS layer receives input x=(x1,,xD)\mathbf{x}=(x_1,\dots,x_D). For each edge between input xix_i and rule jj, KK fuzzy basis functions are learned:

Dijk(xi)={μijk(xi;θ)(Type-1) 12(μijk(xi)+μijk(xi))(IT2)D_{ijk}(x_i) = \begin{cases} \mu_{ijk}(x_i;\theta) & \text{(Type-1)} \ \tfrac{1}{2}\bigl(\overline{\mu}_{ijk}(x_i) + \underline{\mu}_{ijk}(x_i)\bigr) & \text{(IT2)} \end{cases}

Aggregation occurs across KK bases and DD features: eij(xi)=k=1KDijk(xi),hj(x)=i=1Deij(xi).e_{ij}(x_i) = \sum_{k=1}^K D_{ijk}(x_i), \quad h_j(\mathbf{x}) = \sum_{i=1}^D e_{ij}(x_i).

Multiple such layers can be stacked, with each output vector renormalized: h()Norm(h())\mathbf{h}^{(\ell)} \rightarrow \mathrm{Norm}(\mathbf{h}^{(\ell)}).

The final output is produced according to a Takagi–Sugeno linear consequent:

y^=wh(L)+b.\widehat{y} = \mathbf{w}^\top \mathbf{h}^{(L)} + b.

2.2 Sparse Masking Mechanism

To enhance interpretability by restricting each rule to a limited subset of features, KANFIS applies a soft mask M[0,1]D×RM \in [0,1]^{D \times R}, yielding

eij(xi)=Mijk=1KDijk(xi).e_{ij}(x_i) = M_{ij} \sum_{k=1}^K D_{ijk}(x_i).

An entropy regularization

Rsparse=1DRi=1Dj=1R[MijlogMij(1Mij)log(1Mij)]R_{\rm sparse} = \frac{1}{D R} \sum_{i=1}^D \sum_{j=1}^R \left[-M_{ij}\log M_{ij} - (1-M_{ij})\log(1-M_{ij})\right]

pushes MM toward binarization. Distinctiveness among rules is encouraged by penalizing high pairwise cosine similarity of rule activations:

Rdistinct=jhjhhj  h.R_{\rm distinct} = \sum_{j\neq \ell} \frac{\mathbf{h}_j\cdot \mathbf{h}_\ell}{\|\mathbf{h}_j\|\;\|\mathbf{h}_\ell\|}.

3. Fuzzy Logic and Uncertainty Representation

KANFIS supports both Type-1 and IT2 fuzzy logic.

3.1 Type-1 Fuzzy Sets

Type-1 membership functions can use Gaussian, Generalized Bell, or Sigmoid forms. The Gaussian type is defined as:

μijk(xi)=exp((xiμijk)22σijk2)\mu_{ijk}(x_i) = \exp\left(-\frac{(x_i-\mu_{ijk})^2}{2\sigma_{ijk}^2}\right)

where μijk\mu_{ijk} and σijk\sigma_{ijk} are learnable parameters.

3.2 Interval Type-2 Fuzzy Sets

Interval Type-2 (IT2) fuzzy sets model additional uncertainty. For each basis, two widths σijk(l)σijk(u)\sigma^{(l)}_{ijk} \leq \sigma^{(u)}_{ijk} define upper and lower membership functions:

μijk(xi)=exp((xiμijk)22[σijk(l)]2),μijk(xi)=exp((xiμijk)22[σijk(u)]2)\overline{\mu}_{ijk}(x_i) = \exp\left(-\frac{(x_i-\mu_{ijk})^2}{2[\sigma^{(l)}_{ijk}]^2}\right), \quad \underline{\mu}_{ijk}(x_i) = \exp\left(-\frac{(x_i-\mu_{ijk})^2}{2[\sigma^{(u)}_{ijk}]^2}\right)

The crisp activation is their average. The region between these curves defines the Footprint of Uncertainty (FOU), providing explicit quantification of ambiguity in the fuzzy representation (Yong et al., 3 Feb 2026).

4. Learning, Optimization, and Regularization

The KANFIS training objective combines standard regression or classification loss with regularizers for sparsity and distinctiveness:

Ltotal=1Nn=1N(y^(n)y(n))2+λsRsparse+λdRdistinctL_{\rm total} = \frac{1}{N}\sum_{n=1}^N (\widehat{y}^{(n)} - y^{(n)})^2 + \lambda_s R_{\rm sparse} + \lambda_d R_{\rm distinct}

All parameters are optimized by backpropagation:

  • Membership centers μijk\mu_{ijk} and widths σijk\sigma_{ijk} are updated via chain-rule derivatives.
  • The soft mask MijM_{ij} receives a combined update from the task loss and the entropy regularizer.
  • Takagi–Sugeno consequent weights wjw_j and bias bb use standard linear updates.

This joint optimization enforces structural properties—sparsity (for feature selection per rule) and rule distinctiveness—alongside convergence on the predictive task.

5. Model Complexity, Scalability, and Interpretability

KANFIS fundamentally alters the curse of dimensionality characteristic of traditional neuro-fuzzy inference. In a conventional ANFIS system with MM fuzzy sets per feature, the required number of rules is MDM^D, resulting in O(MD×D)\mathcal{O}(M^D \times D) parameters.

KANFIS instead requires only RR rules, each with D×KD \times K fuzzy bases, producing O(RDK)\mathcal{O}(R\,D\,K) parameters and rule complexity that scales linearly in DD, with RMDR \ll M^D.

Rule semantics are enhanced by mask-enforced sparsity: at convergence, each hidden unit jj corresponds to a rule of the form, “IF xix_i is in fuzzy set kk for those ii with Mij1M_{ij} \approx 1, THEN output contribution is wjhj(x)w_j h_j(\mathbf{x}).” Thus, rules are concise and human-interpretable, and rule sets are compact and easily examined by domain experts.

6. Empirical Evaluation

Empirical results on five benchmark datasets indicate that both Type-1 and IT2 variants of KANFIS match or outperform baseline multilayer perceptron (MLP), ANFIS, and Kolmogorov–Arnold Network (KAN) models in regression and classification tasks. On the Combined Cycle Power Plant (CCPP) regression dataset:

  • MLP: RMSE = 4.1883
  • T1-ANFIS: RMSE = 3.9980
  • T1-KANFIS: RMSE = 3.9542
  • IT2-KANFIS: RMSE = 4.1240

For classification datasets including Breast Cancer, Spambase, and Medical Health Records, KANFIS achieves accuracy and F1 scores in the range $0.93–0.99$, generally outperforming both ANFIS and deep neural baselines while retaining a small, interpretable set of fuzzy rules (Yong et al., 3 Feb 2026).

Model Rule/Param Scaling Explicit Uncertainty Interpretable Rules Empirical RMSE (CCPP)
ANFIS Exponential (MDM^D) No No 3.9980
KANFIS (T1) Linear (RDKRDK) No Yes 3.9542
KANFIS (IT2) Linear (RDKRDK) Yes Yes 4.1240
MLP No No 4.1883

The data suggest that KANFIS architecture offers both scalability and interpretability, as well as accurate and uncertainty-aware predictions, by leveraging additive fuzzy rule superposition and explicit rule sparsity controls (Yong et al., 3 Feb 2026).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Kolmogorov-Arnold Neuro-Fuzzy Inference System (KANFIS).