Papers
Topics
Authors
Recent
Search
2000 character limit reached

EXP: Explainability Score for AI Transparency

Updated 30 June 2026
  • EXP is a metric that quantifies AI explainability by assessing how transparent and interpretable a model's decision-making process is.
  • It integrates machine-centric methods (e.g., impact scores and coverage metrics) with questionnaire and language-model-based techniques to ensure robust evaluation.
  • EXP is used for benchmarking XAI methods, calibrating explainability standards, and correlating with user effectiveness across diverse AI domains.

Explainability Score (EXP) is a quantitative or structured qualitative metric designed to assess the degree to which AI or ML systems render their internal decision-making transparent and understandable to various stakeholders. EXP scores are motivated by the need for rigorous, repeatable evaluation of explainability across diverse AI models, domains, and explanation modalities.

1. Formal Definitions and Variants of EXP

Multiple formulations of EXP have been proposed, each grounded in distinct theoretical and methodological traditions:

A. Machine-Centric Impact Score and Impact Coverage

Lin et al. (Lin et al., 2019) introduce a composite machine-centric EXP metric for image classification systems, constructed from two fundamental measures:

  • Impact Score (II): For input xx and classifier NN, an explainability method MM returns a binary mask cxc \subseteq x identifying “critical factors.” The ablated input x=xcx' = x - c is classified as (y,z)=N(x)(y', z') = N(x'). With threshold τ(0,1)\tau \in (0, 1), II is defined as:

I=1ni=1n[(yiyi)(ziτzi)]I = \frac{1}{n} \sum_{i=1}^n \left[ (y'_i \neq y_i) \vee (z'_i \leq \tau z_i) \right]

where xx0 is the predicted label and xx1 the confidence. xx2 counts only label flips.

  • Impact Coverage (xx3): For adversarial perturbations xx4, measures IoU overlap between xx5 and mask xx6:

xx7

  • Composite EXP: A weighted sum (after normalization),

xx8

with xx9 balancing general and adversarial scenarios.

B. Questionnaire and Factor-Analysis-Based EXP

Chen & Eickhoff (Chen et al., 2023) operationalize EXP for information retrieval via a weighted sum over latent explainability factors derived from a 19-item user questionnaire:

NN0

where NN1 is the set of factors, NN2 and NN3 the respective factor and item loadings, NN4 the user’s response, and values normalized to NN5.

C. Degree of Explainability (DoX/EXP) via Information Pertinence

Sovrano & Vitali (Sovrano et al., 2021) define EXP as the average coverage of archetypal question-aspect pairs by pertinent details extracted from explanation texts, estimated via LLM embeddings and cosine similarity. For details NN6, aspects NN7, question templates NN8, and content NN9:

MM0

where MM1 is the pertinence of detail MM2 to question MM3.

2. Factor Structure and Measurement Dimensions

The explainability captured by EXP is inherently multidimensional. Chen & Eickhoff (Chen et al., 2023) identify six core factors through exploratory factor analysis:

Factor Group Description
Global Interpretability Roadblocks Understanding overall system logic
Local Interpretability Roadblocks Grasping why specific results were returned
Transparency Utility Visibility of internal decision signals
Justification Utility Perceived soundness of provided reasons
Granularity Utility Appropriateness of explanation detaillevel
Sufficiency Utility Adequacy of information to enable actionable insight

Each factor aggregates specific survey items, with empirically determined item and factor weights.

3. Algorithmic Implementations

A. Machine-Centric Procedures (Vision)

  • Identify salient region MM4 per explanation method MM5 for each test input MM6.
  • Ablate MM7 to produce MM8.
  • Record change in predicted label (MM9) and/or drop in confidence by cxc \subseteq x0 (cxc \subseteq x1).
  • For adversarial inputs, compute IoU of cxc \subseteq x2 with patch cxc \subseteq x3 (cxc \subseteq x4).
  • Normalize and aggregate as cxc \subseteq x5.

B. Language-Model-Based EXP (DoXpy Pipeline)

  • Extract subject–predicate–object “details” cxc \subseteq x6 using dependency parsing from explanation text cxc \subseteq x7.
  • For each aspect cxc \subseteq x8 and question archetype cxc \subseteq x9, form x=xcx' = x - c0, embed with a sentence encoder.
  • Compute pertinence x=xcx' = x - c1; filter details by threshold x=xcx' = x - c2 and redundancy x=xcx' = x - c3.
  • Sum pertinence scores to yield per-question, per-aspect, and final EXP as described above.

C. Questionnaire Scoring Pipeline

  • Administer structured questionnaire to users post-system interaction.
  • Calculate weighted sum of responses per factor; aggregate and normalize for final x=xcx' = x - c4.

4. Empirical Validation and Interpretation

A. Model Impact Analysis (Vision)

On ResNet-50/ImageNet, Impact Score x=xcx' = x - c5 quantifies the causal salience of regions picked out by XAI methods:

  • GSInquire: x=xcx' = x - c6 (confidence/label strongly affected by ablating mask)
  • Expected Gradients: x=xcx' = x - c7
  • SHAP: x=xcx' = x - c8
  • LIME: x=xcx' = x - c9

GSInquire identified adversarial patches with (y,z)=N(x)(y', z') = N(x')0 up to (y,z)=N(x)(y', z') = N(x')1–(y,z)=N(x)(y', z') = N(x')2, while LIME was rarely aligned ((y,z)=N(x)(y', z') = N(x')3–(y,z)=N(x)(y', z') = N(x')4).

B. Human Alignment Studies

In retrieval (Chen et al., 2023) and domain tasks (Sovrano et al., 2021), EXP scores calculated post-explanation correlate with human utility and effectiveness:

  • In IR, EXP distinguished systems with and without transparent signal visualizations ((y,z)=N(x)(y', z') = N(x')5 vs (y,z)=N(x)(y', z') = N(x')6).
  • In both finance and healthcare, increase in EXP for more comprehensive XAI corresponded to higher user task effectiveness, with statistical significance (y,z)=N(x)(y', z') = N(x')7.

C. Qualitative Checklist Approaches

Winikoff et al. (Winikoff et al., 14 Feb 2025) provide a structured scoresheet covering source code availability, explanation veracity, global/local explanation features, concepts, and automation, functioning as a multidimensional rubric rather than a scalar score.

5. Scope, Generalization, and Limitations

Strengths

  • Machine-centric EXP: Does not require human annotations or visual inspection; measures model response to perturbation directly (Lin et al., 2019).
  • Factor-based and DoX EXP: Captures breadth and relevance of explanations in terms of archetypal queries and user-elicited dimensions (Chen et al., 2023, Sovrano et al., 2021).
  • Empirical alignment: Correlates with user task success and subjective perception of understanding.

Limitations

  • Dependence on perturbation operator: Machine-centric (y,z)=N(x)(y', z') = N(x')8 is sensitive to how regions are ablated; other masking types may alter results (Lin et al., 2019).
  • Thresholds and normalization: Choices such as (y,z)=N(x)(y', z') = N(x')9 or item/factor weighting affect sensitivity and comparability.
  • Domain specificity: Vision-centric metrics may not transfer directly to NLP or recommendation without redefining "deletion" or "mask."
  • Coverage vs. faithfulness: EXP quantifies the presence and informativeness of explanations but does not guarantee their correctness or faithfulness to model internals (Sovrano et al., 2021).
  • Scoresheet limitations: The qualitative approach (Winikoff et al., 14 Feb 2025) ensures broad coverage but lacks aggregation or comparability unless a custom scoring rubric is imposed.

6. Practical Applications and Extensions

  • Benchmarking XAI algorithms: Machine-centric and DoX-based EXP can be used to empirically compare fidelity and informativeness across explanation algorithms (e.g., LIME, SHAP, GSInquire, TreeSHAP) (Lin et al., 2019, Sovrano et al., 2021).
  • Calibration of explainability requirements: Scoresheets serve to map stakeholder needs onto system features and support traceable, standardized evaluation (Winikoff et al., 14 Feb 2025).
  • Extension to new domains: EXP can be adapted to recommender systems, QA, and others by recalibrating questions, aspects, and item/factor structure (Chen et al., 2023).
  • Integration with fidelity metrics: Combining user-centric and machine-centric scores may yield composite measures of both explainability and faithfulness (Chen et al., 2023).

7. Future Directions and Open Issues

Potential advancements include:

  • Dynamic and human-calibrated weighting: Learning weighting parameters (e.g., τ(0,1)\tau \in (0, 1)0 in machine-centric EXP or factor weights in SSE) from empirical user or task data (Lin et al., 2019, Chen et al., 2023).
  • Perturbation robustness: Employing smoother or domain-appropriate perturbations to better assess actual criticality of identified features.
  • Multi-modal, multi-level explainability: Extending algorithms and rubrics to support sequence models, structured data, and mixed modalities.
  • Faithfulness–explainability tradeoffs: Investigating relationships between scores measuring coverage and informativeness (EXP) and those quantifying faithfulness to underlying decision logic.

Explainability Score (EXP), therefore, represents a family of metrics and frameworks—quantitative, factor-based, or rubric-driven—that enable standardized, reproducible, and context-sensitive evaluation of AI system explainability, each with distinct methodological foundations and domains of applicability (Lin et al., 2019, Chen et al., 2023, Sovrano et al., 2021, Winikoff et al., 14 Feb 2025).

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Explainability Score (EXP).