Papers
Topics
Authors
Recent
Search
2000 character limit reached

X-ANFIS: Explainability in Neuro-Fuzzy Systems

Updated 14 April 2026
  • X-ANFIS is a neuro-fuzzy framework that integrates explicit explainability constraints to yield linguistically interpretable rules alongside high predictive performance.
  • It employs a differentiable objective based on membership function distinguishability, ensuring well-separated, non-overlapping fuzzy partitions.
  • An alternating bi-objective gradient optimization recovers non-convex trade-offs between accuracy and interpretability across diverse datasets.

Explainability constraints in adaptive neuro-fuzzy inference systems (X-ANFIS) refer to the explicit incorporation of interpretability as an integral objective in the end-to-end learning process, aligning system parameters to facilitate semantically meaningful, human-comprehensible fuzzy rules without sacrificing predictive performance. The X-ANFIS methodology defines a concrete, differentiable explainability objective—distinguishability of membership functions—and utilizes alternating bi-objective gradient optimization. This framework enables the recovery of accurate and linguistically interpretable rule bases in otherwise non-convex segments of the accuracy–explainability Pareto front (Khaled et al., 22 Feb 2026).

1. Accuracy–Explainability Trade-Off in Neuro-Fuzzy Systems

The foundational challenge in neuro-fuzzy models arises from the tension between transparency (linguistic clarity of rules) and statistical accuracy. Fuzzy systems achieve intuitive explainability via rule-based architectures and linguistic variables, imparting their outputs with human-understandable semantics. However, enforcing interpretability through strong constraints (few rules, well-separated membership functions) often comes at a direct cost to accuracy, as models may lose expressiveness or coverage. Conversely, unconstrained, high-capacity ANFIS models optimized solely for loss can fine-tune thousands of parameters—yielding highly overlapping, semantically ambiguous fuzzy sets where linguistic rule interpretation is compromised.

This dilemma, frequently described as the "accuracy–explainability trade-off," motivates the development of explicit explainability constraints as part of the training process, rather than relying on post-hoc pruning or computationally intensive evolutionary multi-objective optimization (Khaled et al., 22 Feb 2026).

2. X-ANFIS Model Structure and Semantic Initialization

X-ANFIS operates on a standard zero-order Takagi–Sugeno ANFIS framework with RR fuzzy rules and mm-dimensional real-valued input vectors x=(x1,,xm)x=(x_1,\dots,x_m). The architecture comprises five layers:

  1. Input layer: Passes xix_i directly.
  2. Membership layer: Evaluates fuzzy membership degrees using Cauchy membership functions,

μij(xi;cij,γij)=11+(xicijγij)2\mu_{ij}(x_i; c_{ij}, \gamma_{ij}) = \frac{1}{1 + \left(\frac{x_i - c_{ij}}{\gamma_{ij}}\right)^2}

where cijc_{ij} and γij\gamma_{ij} are the center and scale for the jjth fuzzy set on the iith input.

  1. Rule layer: Computes rule firing strengths,

wj(x;θ)=i=1mμij(xi;cij,γij)w_j(x; \theta) = \prod_{i=1}^m \mu_{ij}(x_i; c_{ij}, \gamma_{ij})

  1. Normalization layer: Normalizes firing strengths,

mm0

  1. Output layer: Weighted sum to obtain the final prediction,

mm1

where mm2 are zero-order consequent parameters.

Semantic initialization utilizes fuzzy c-means (FCM) clustering independently on each feature to obtain mm3 cluster centroids and spreads, yielding antecedent parameters mm4 that uniformly partition the input space in mm5. This enhances rule interpretability and provides a stable starting point for gradient-based learning (Khaled et al., 22 Feb 2026).

3. Differentiable Explainability Objective: Distinguishability

A core innovation of X-ANFIS is the operationalization of distinguishability as a differentiable constraint during training. For each feature mm6 and each adjacent pair of fuzzy sets mm7 (those with consecutive ordered FCM centers), define the pairwise distance:

mm8

A target distinguishability mm9 (typically x=(x1,,xm)x=(x_1,\dots,x_m)0) is specified to maximize semantic separation without loss of coverage.

The explainability loss is a quadratic penalty, summed over all features and their adjacent fuzzy sets:

x=(x1,,xm)x=(x_1,\dots,x_m)1

where x=(x1,,xm)x=(x_1,\dots,x_m)2 denotes adjacent index pairs for feature x=(x1,,xm)x=(x_1,\dots,x_m)3. Gradients of x=(x1,,xm)x=(x_1,\dots,x_m)4 with respect to centers x=(x1,,xm)x=(x_1,\dots,x_m)5 are used to adjust membership function placement, holding x=(x1,,xm)x=(x_1,\dots,x_m)6 fixed in the X-pass to prevent degenerate solutions (e.g., reducing widths to artificially inflate x=(x1,,xm)x=(x_1,\dots,x_m)7) (Khaled et al., 22 Feb 2026).

4. Alternating Bi-Objective Gradient Optimization

To address the inability of scalarized objectives (e.g., x=(x1,,xm)x=(x_1,\dots,x_m)8) to recover non-convex Pareto-optimal trade-offs, X-ANFIS employs an alternating bi-objective gradient scheme:

  • Performance Pass (Backward Pass):
    • Minimize mean squared error (MSE) of outputs.
    • Consequent parameters x=(x1,,xm)x=(x_1,\dots,x_m)9 are solved in closed form (regularized least squares).
    • Membership parameters xix_i0 are updated via gradients. Cauchy function derivatives are

    xix_i1

  • Explainability Pass (X-pass):

    • Temporarily hold xix_i2 and consequents fixed.
    • Update centers xix_i3 by gradient descent on xix_i4.

Pseudocode per epoch:

  1. Forward pass to compute all outputs.
  2. Backward (performance) pass: update xix_i5 and solve xix_i6.
  3. X-pass: update only xix_i7 minimizing xix_i8.
  4. Early stopping based on validation MSE.

This alternation enables X-ANFIS to converge to solutions with both competitive accuracy and interpretable, well-separated fuzzy sets, including points outside the convex hull attainable by scalarization or traditional MOO (Khaled et al., 22 Feb 2026).

5. Empirical Validation and Comparative Analysis

Extensive experimentation was performed on nine UCI regression datasets in approximately 5,000 configurations. Key outcomes:

  • Initialization stability: Cauchy ANFIS maintains both accuracy (xix_i9) and distinguishability (μij(xi;cij,γij)=11+(xicijγij)2\mu_{ij}(x_i; c_{ij}, \gamma_{ij}) = \frac{1}{1 + \left(\frac{x_i - c_{ij}}{\gamma_{ij}}\right)^2}0) across a wide range of μij(xi;cij,γij)=11+(xicijγij)2\mu_{ij}(x_i; c_{ij}, \gamma_{ij}) = \frac{1}{1 + \left(\frac{x_i - c_{ij}}{\gamma_{ij}}\right)^2}1, while Gaussian ANFIS collapses for small initial spreads.
  • Pareto front recovery: Weighted-scalarization MO-ANFIS yields solutions on the convex hull of the accuracy–explainability graph; X-ANFIS recovers non-convex trade-off points with μij(xi;cij,γij)=11+(xicijγij)2\mu_{ij}(x_i; c_{ij}, \gamma_{ij}) = \frac{1}{1 + \left(\frac{x_i - c_{ij}}{\gamma_{ij}}\right)^2}2 and μij(xi;cij,γij)=11+(xicijγij)2\mu_{ij}(x_i; c_{ij}, \gamma_{ij}) = \frac{1}{1 + \left(\frac{x_i - c_{ij}}{\gamma_{ij}}\right)^2}3 within μij(xi;cij,γij)=11+(xicijγij)2\mu_{ij}(x_i; c_{ij}, \gamma_{ij}) = \frac{1}{1 + \left(\frac{x_i - c_{ij}}{\gamma_{ij}}\right)^2}4–μij(xi;cij,γij)=11+(xicijγij)2\mu_{ij}(x_i; c_{ij}, \gamma_{ij}) = \frac{1}{1 + \left(\frac{x_i - c_{ij}}{\gamma_{ij}}\right)^2}5 of pure ANFIS in 8 out of 9 datasets using μij(xi;cij,γij)=11+(xicijγij)2\mu_{ij}(x_i; c_{ij}, \gamma_{ij}) = \frac{1}{1 + \left(\frac{x_i - c_{ij}}{\gamma_{ij}}\right)^2}6 rules.
  • Partition visualization: X-ANFIS achieves uniform, evenly-spaced partitions, in contrast to MO-ANFIS, which clusters partitions around dense areas and leaves gaps (violating coverage).
  • Semantic plane analysis: Only X-ANFIS consistently yields membership function plots that satisfy consistency, completeness, and desired distinguishability.

A summary table of design components is provided below.

Component Method/Constraint Role in Explainability
Membership Function Cauchy; semantically spread via FCM Stability, well-partitioned, interpretable
Explainability Loss μij(xi;cij,γij)=11+(xicijγij)2\mu_{ij}(x_i; c_{ij}, \gamma_{ij}) = \frac{1}{1 + \left(\frac{x_i - c_{ij}}{\gamma_{ij}}\right)^2}7 on μij(xi;cij,γij)=11+(xicijγij)2\mu_{ij}(x_i; c_{ij}, \gamma_{ij}) = \frac{1}{1 + \left(\frac{x_i - c_{ij}}{\gamma_{ij}}\right)^2}8 separation Enforces distinguishable, non-overlapping sets
Optimization Scheme Alternating performance and X-passes Recovers non-convex trade-offs

6. Computational Considerations, Limitations, and Extensions

Compared to evolutionary MOO approaches, X-ANFIS is μij(xi;cij,γij)=11+(xicijγij)2\mu_{ij}(x_i; c_{ij}, \gamma_{ij}) = \frac{1}{1 + \left(\frac{x_i - c_{ij}}{\gamma_{ij}}\right)^2}9–cijc_{ij}0 faster as it avoids population-based search, relying on efficient gradient updates. Scalarization methods lack coverage over non-convex regions, while the alternating approach improves search diversity. As cijc_{ij}1 increases (cijc_{ij}2), the fixed universe cijc_{ij}3 imposes a tension: to maintain cijc_{ij}4, coverage begins to conflict with distinguishability. This suggests that joint optimization over rule count (cijc_{ij}5) and distinguishability (cijc_{ij}6) would enhance model efficacy. Future extensions include adapting to Mamdani-type inference, incorporating structure learning for adaptive rule cardinality, and formal convergence analysis of the alternating scheme (Khaled et al., 22 Feb 2026).

7. Relationship to Neural Explainability-Constraint Paradigms

X-ANFIS is part of a broader class of explainability-constraint frameworks, including xNN—explainable neural networks—that enforce interpretability through explicit architecture constraints such as sparsity, orthogonality, and smoothness (Yang et al., 2019). While xNN achieves interpretability via additive sparse subnetworks, projection pursuit with orthogonality, and roughness penalization, X-ANFIS instantiates explainability within fuzzy rule induction and partition integrity. Both approaches emphasize constraints imposed during training—rather than post-hoc analysis or black-box explanation—to produce inherently transparent, trustworthy systems.

In summary, explainability constraints in X-ANFIS operationalize semantic interpretability as a mathematically well-posed training objective. Alternating between accuracy and distinguishability optimizations, combined with stable Cauchy membership functions and semantic initialization, yields adaptive neuro-fuzzy systems that navigate the accuracy–explainability trade-off and populate otherwise inaccessible regions of the Pareto frontier.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (2)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Explainability Constraints (X-ANFIS).