Abductive explanations of classifiers under constraints: Complexity and properties (2409.12154v1)

Published 18 Sep 2024 in cs.AI

Abstract: Abductive explanations (AXp's) are widely used for understanding decisions of classifiers. Existing definitions are suitable when features are independent. However, we show that ignoring constraints when they exist between features may lead to an explosion in the number of redundant or superfluous AXp's. We propose three new types of explanations that take into account constraints and that can be generated from the whole feature space or from a sample (such as a dataset). They are based on a key notion of coverage of an explanation, the set of instances it explains. We show that coverage is powerful enough to discard redundant and superfluous AXp's. For each type, we analyse the complexity of finding an explanation and investigate its formal properties. The final result is a catalogue of different forms of AXp's with different complexities and different formal guarantees.

Authors (2)

Martin Cooper (3 papers)
Leila Amgoud (7 papers)

Citations (5)

View on Semantic Scholar

Summary

Abductive Explanations of Classifiers under Constraints: Complexity and Properties

The paper "Abductive Explanations of Classifiers under Constraints: Complexity and Properties" addresses a critical aspect of machine learning explainability, particularly the limitations of current abductive explanations (AXp's) when feature constraints are present. Existing AXp frameworks implicitly assume feature independence, which can lead to redundant or superfluous explanations when constraints (both integrity constraints and dependency constraints) exist.

Overview and Contributions

1. Novel Abductive Explanation Types:

The paper introduces three new types of explanations—coverage-based PI-explanation (CPI-Xp), minimal CPI-explanation (mCPI-Xp), and preferred CPI-explanation (pCPI-Xp)—to address the limitations of existing AXp frameworks in constrained feature spaces. These new types leverage the notion of coverage, which is the set of instances a particular explanation can account for.

CPI-Xp: A fundamental concept introduced is coverage-based explanations that prioritize explanations able to account for more instances, essentially addressing dependencies among features.
mCPI-Xp: This type introduces minimality, focusing on subset-minimal explanations that discard any irrelevant information.
pCPI-Xp: Preferred explanations further refine minimal explanations by choosing non-equivalent ones, ensuring uniqueness in terms of coverage.

2. Complexity Analysis:

The paper explores the computational complexity associated with generating different types of explanations. It finds that while incorporating constraints leads to better explanations, it tends to increase computational complexity significantly. For instance:

Testing whether a set is a weak AXpc is co-NP-complete, whereas testing if it is a CPI-Xp is $\Pi_2^P$ -complete.
Discovering one CPI-Xp involves potentially multiple calls to a $\Sigma_2^{\rm P}$ oracle. However, approximations through datasets yield tractable solutions, significantly reducing complexity.

3. Sample-based Explanations:

The authors propose generating explanations based on a sample dataset, circumventing exhaustive search over the entire feature space. Sample-based explanations (denoted by a "d-" prefix such as d-CPI-Xp) have drastically reduced complexities as they avoid the comprehensive checking required by full feature-space analyses.

4. Formal Comparison and Properties:

The paper assesses the explanation types against several properties: Success, Non-Triviality, Irreducibility, Coherence, Consistency, Independence, and Non-Equivalence. Preferred coverage-based explanations (pCPI-Xp) satisfy the most properties, indicating their robustness in producing unique, independent, and consistent explanations.

Implications and Future Directions

The implications of this research are multifold. From a practical standpoint, the ability to provide more precise and less redundant explanations in constrained feature spaces enhances model transparency and user trust. Theoretical implications include a better understanding of the role of constraints in deriving meaningful explanations and the development of computational methods to address these challenges.

Future research could extend to characterizing the family of explainers satisfying an optimized subset of the desirable properties and adapting these definitions to on-the-fly explainability in real-time AI systems. Additionally, learning the constraints from data or optimizing the derivation of global explanations could prove beneficial.

In summary, the paper advances the field of explainability by proposing new methods to generate meaningful explanations when features are interdependent or constrained, introducing computational methods to handle such complexities effectively.

PDF Markdown

Related Papers

Find Related Papers

YouTube

Show All Videos