Kolmogorov-Arnold Neuro-Fuzzy Inference

Updated 10 February 2026

KANFIS is a neuro-symbolic system that uses additive fuzzy rule superposition to overcome exponential rule complexity.
It leverages the Kolmogorov–Arnold representation to achieve linear parameter scaling and explicitly model uncertainty.
Empirical results demonstrate that KANFIS outperforms traditional ANFIS and neural baselines while offering interpretable and sparse rule sets.

The Kolmogorov-Arnold Neuro-Fuzzy Inference System (KANFIS) is a neuro-symbolic framework designed to address the challenge of exponential rule complexity in neuro-fuzzy inference by leveraging the Kolmogorov–Arnold additive representation. By unifying interpretable fuzzy reasoning with the additive decomposition of multivariate functions, KANFIS achieves both linear parameter scaling and explicit uncertainty modeling, while maintaining semantically transparent rule sets and competitive empirical performance relative to established neuro-fuzzy and neural baselines (Yong et al., 3 Feb 2026).

1. Mathematical Foundations

KANFIS builds on the classical Kolmogorov–Arnold superposition theorem, which states that any continuous multivariate function $f\colon [0,1]^D \rightarrow \mathbb{R}$ can be represented as

$f(\mathbf{x}) = \sum_{q=0}^{2D}\; \phi_q\left(\sum_{p=1}^D \psi_{qp}(x_p)\right)$

with each $\psi_{qp}$ and $\phi_q$ a univariate continuous function. This decomposition motivates an alternative to the product-based rule firing employed in conventional Adaptive Neuro-Fuzzy Inference Systems (ANFIS).

In traditional ANFIS, Takagi–Sugeno–Kang (TSK) fuzzy system rules use the firing strength

$w_j(\mathbf{x}) = \prod_{i=1}^D \mu_{ij}(x_i),$

where $\mu_{ij}$ is the membership function for the $i$ -th feature and $j$ -th rule. This rule formulation requires $M^D$ rules for $M$ fuzzy sets per input and $f(\mathbf{x}) = \sum_{q=0}^{2D}\; \phi_q\left(\sum_{p=1}^D \psi_{qp}(x_p)\right)$ 0 input dimensions, rapidly leading to intractable model sizes in high dimensions.

KANFIS replaces the product-based aggregation with an additive superposition. For $f(\mathbf{x}) = \sum_{q=0}^{2D}\; \phi_q\left(\sum_{p=1}^D \psi_{qp}(x_p)\right)$ 1 rules, each rule consists of univariate fuzzy transforms per feature. For each input dimension $f(\mathbf{x}) = \sum_{q=0}^{2D}\; \phi_q\left(\sum_{p=1}^D \psi_{qp}(x_p)\right)$ 2 and rule $f(\mathbf{x}) = \sum_{q=0}^{2D}\; \phi_q\left(\sum_{p=1}^D \psi_{qp}(x_p)\right)$ 3, $f(\mathbf{x}) = \sum_{q=0}^{2D}\; \phi_q\left(\sum_{p=1}^D \psi_{qp}(x_p)\right)$ 4 fuzzy basis functions $f(\mathbf{x}) = \sum_{q=0}^{2D}\; \phi_q\left(\sum_{p=1}^D \psi_{qp}(x_p)\right)$ 5 are learned. The soft-antecedent is computed as

$f(\mathbf{x}) = \sum_{q=0}^{2D}\; \phi_q\left(\sum_{p=1}^D \psi_{qp}(x_p)\right)$ 6

where $f(\mathbf{x}) = \sum_{q=0}^{2D}\; \phi_q\left(\sum_{p=1}^D \psi_{qp}(x_p)\right)$ 7 is a Type-1 or Interval Type-2 (IT2) membership value. The total additive rule activation is

$f(\mathbf{x}) = \sum_{q=0}^{2D}\; \phi_q\left(\sum_{p=1}^D \psi_{qp}(x_p)\right)$ 8

A direct consequence is that both rule count and parameter complexity now scale linearly with $f(\mathbf{x}) = \sum_{q=0}^{2D}\; \phi_q\left(\sum_{p=1}^D \psi_{qp}(x_p)\right)$ 9 instead of exponentially, conditioned on the number of rules $\psi_{qp}$ 0 (Yong et al., 3 Feb 2026).

2. Architecture and Structural Components

2.1 KANFIS Layer Structure

A KANFIS layer receives input $\psi_{qp}$ 1. For each edge between input $\psi_{qp}$ 2 and rule $\psi_{qp}$ 3, $\psi_{qp}$ 4 fuzzy basis functions are learned:

$\psi_{qp}$ 5

Aggregation occurs across $\psi_{qp}$ 6 bases and $\psi_{qp}$ 7 features: $\psi_{qp}$ 8

Multiple such layers can be stacked, with each output vector renormalized: $\psi_{qp}$ 9.

The final output is produced according to a Takagi–Sugeno linear consequent:

$\phi_q$ 0

2.2 Sparse Masking Mechanism

To enhance interpretability by restricting each rule to a limited subset of features, KANFIS applies a soft mask $\phi_q$ 1, yielding

$\phi_q$ 2

An entropy regularization

$\phi_q$ 3

pushes $\phi_q$ 4 toward binarization. Distinctiveness among rules is encouraged by penalizing high pairwise cosine similarity of rule activations:

$\phi_q$ 5

3. Fuzzy Logic and Uncertainty Representation

KANFIS supports both Type-1 and IT2 fuzzy logic.

3.1 Type-1 Fuzzy Sets

Type-1 membership functions can use Gaussian, Generalized Bell, or Sigmoid forms. The Gaussian type is defined as:

$\phi_q$ 6

where $\phi_q$ 7 and $\phi_q$ 8 are learnable parameters.

3.2 Interval Type-2 Fuzzy Sets

Interval Type-2 (IT2) fuzzy sets model additional uncertainty. For each basis, two widths $\phi_q$ 9 define upper and lower membership functions:

$w_j(\mathbf{x}) = \prod_{i=1}^D \mu_{ij}(x_i),$ 0

The crisp activation is their average. The region between these curves defines the Footprint of Uncertainty (FOU), providing explicit quantification of ambiguity in the fuzzy representation (Yong et al., 3 Feb 2026).

4. Learning, Optimization, and Regularization

The KANFIS training objective combines standard regression or classification loss with regularizers for sparsity and distinctiveness:

$w_j(\mathbf{x}) = \prod_{i=1}^D \mu_{ij}(x_i),$ 1

All parameters are optimized by backpropagation:

Membership centers $w_j(\mathbf{x}) = \prod_{i=1}^D \mu_{ij}(x_i),$ 2 and widths $w_j(\mathbf{x}) = \prod_{i=1}^D \mu_{ij}(x_i),$ 3 are updated via chain-rule derivatives.
The soft mask $w_j(\mathbf{x}) = \prod_{i=1}^D \mu_{ij}(x_i),$ 4 receives a combined update from the task loss and the entropy regularizer.
Takagi–Sugeno consequent weights $w_j(\mathbf{x}) = \prod_{i=1}^D \mu_{ij}(x_i),$ 5 and bias $w_j(\mathbf{x}) = \prod_{i=1}^D \mu_{ij}(x_i),$ 6 use standard linear updates.

This joint optimization enforces structural properties—sparsity (for feature selection per rule) and rule distinctiveness—alongside convergence on the predictive task.

5. Model Complexity, Scalability, and Interpretability

KANFIS fundamentally alters the curse of dimensionality characteristic of traditional neuro-fuzzy inference. In a conventional ANFIS system with $w_j(\mathbf{x}) = \prod_{i=1}^D \mu_{ij}(x_i),$ 7 fuzzy sets per feature, the required number of rules is $w_j(\mathbf{x}) = \prod_{i=1}^D \mu_{ij}(x_i),$ 8, resulting in $w_j(\mathbf{x}) = \prod_{i=1}^D \mu_{ij}(x_i),$ 9 parameters.

KANFIS instead requires only $\mu_{ij}$ 0 rules, each with $\mu_{ij}$ 1 fuzzy bases, producing $\mu_{ij}$ 2 parameters and rule complexity that scales linearly in $\mu_{ij}$ 3, with $\mu_{ij}$ 4.

Rule semantics are enhanced by mask-enforced sparsity: at convergence, each hidden unit $\mu_{ij}$ 5 corresponds to a rule of the form, “IF $\mu_{ij}$ 6 is in fuzzy set $\mu_{ij}$ 7 for those $\mu_{ij}$ 8 with $\mu_{ij}$ 9, THEN output contribution is $i$ 0.” Thus, rules are concise and human-interpretable, and rule sets are compact and easily examined by domain experts.

6. Empirical Evaluation

Empirical results on five benchmark datasets indicate that both Type-1 and IT2 variants of KANFIS match or outperform baseline multilayer perceptron (MLP), ANFIS, and Kolmogorov–Arnold Network (KAN) models in regression and classification tasks. On the Combined Cycle Power Plant (CCPP) regression dataset:

MLP: RMSE = 4.1883
T1-ANFIS: RMSE = 3.9980
T1-KANFIS: RMSE = 3.9542
IT2-KANFIS: RMSE = 4.1240

For classification datasets including Breast Cancer, Spambase, and Medical Health Records, KANFIS achieves accuracy and F1 scores in the range $i$ 1, generally outperforming both ANFIS and deep neural baselines while retaining a small, interpretable set of fuzzy rules (Yong et al., 3 Feb 2026).

Model	Rule/Param Scaling	Explicit Uncertainty	Interpretable Rules	Empirical RMSE (CCPP)
ANFIS	Exponential ( $i$ 2)	No	No	3.9980
KANFIS (T1)	Linear ( $i$ 3)	No	Yes	3.9542
KANFIS (IT2)	Linear ( $i$ 4)	Yes	Yes	4.1240
MLP	—	No	No	4.1883

The data suggest that KANFIS architecture offers both scalability and interpretability, as well as accurate and uncertainty-aware predictions, by leveraging additive fuzzy rule superposition and explicit rule sparsity controls (Yong et al., 3 Feb 2026).

Markdown Report Issue Upgrade to Chat

References (1)

KANFIS A Neuro-Symbolic Framework for Interpretable and Uncertainty-Aware Learning (2026)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Kolmogorov-Arnold Neuro-Fuzzy Inference System (KANFIS).

Kolmogorov-Arnold Neuro-Fuzzy Inference

1. Mathematical Foundations

2. Architecture and Structural Components

2.1 KANFIS Layer Structure

2.2 Sparse Masking Mechanism

3. Fuzzy Logic and Uncertainty Representation

3.1 Type-1 Fuzzy Sets

3.2 Interval Type-2 Fuzzy Sets

4. Learning, Optimization, and Regularization

5. Model Complexity, Scalability, and Interpretability

6. Empirical Evaluation

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Don't miss out on important new AI/ML research

Kolmogorov-Arnold Neuro-Fuzzy Inference

1. Mathematical Foundations

2. Architecture and Structural Components

2.1 KANFIS Layer Structure

2.2 Sparse Masking Mechanism

3. Fuzzy Logic and Uncertainty Representation

3.1 Type-1 Fuzzy Sets

3.2 Interval Type-2 Fuzzy Sets

4. Learning, Optimization, and Regularization

5. Model Complexity, Scalability, and Interpretability

6. Empirical Evaluation

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Related Topics

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research