Human-Centered Explainable AI

Updated 9 December 2025

Human-Centered Explainable AI is a multidisciplinary paradigm that blends algorithmic transparency with human-centered design to align explanations with user needs and mental models.
It employs diverse interpretability methods—from white-box models to post-hoc feature-attribution and counterfactual techniques—to balance fidelity, clarity, and usability.
The framework emphasizes iterative user research, ethical standards, and adaptive interfaces to foster calibrated trust, effective decision-making, and compliance across domains.

Human-centered Explainable AI (XAI) is a multidisciplinary paradigm that integrates algorithmic transparency with rigorous human-computer interaction (HCI) and user experience (UX) design to align machine-generated explanations with the needs, mental models, and workflows of diverse stakeholders. Unlike system-centric notions of XAI—centered primarily on algorithmic faithfulness or model introspection—human-centered XAI (HC-XAI) studies how explanations are perceived, understood, and acted upon by real users, with a focus on fostering accurate understanding, calibrated trust, appropriate reliance, and ethical adoption across domains as varied as healthcare, finance, software engineering, and consumer applications (Liao et al., 2021, Ma, 28 Oct 2024, Silva et al., 14 Apr 2025).

1. Foundations: Definitions, Motivation, and Stakeholder Needs

HC-XAI reframes transparency as a property fundamentally relative to the human explainee. The objective is not to maximize technical interpretability in the abstract, but to answer: “What explanation does a given person or stakeholder need to form an accurate mental model, calibrate trust, and make good decisions?” (Ma, 28 Oct 2024). This motivation rests on three pillars:

Mental Models: Users construct internal theories about system operation; misalignment with actual model behavior can lead to inappropriate trust or misuse.
Trust Calibration: Explanations should enable users to distinguish when to trust, distrust, override, or supplement AI outputs, rather than simply increasing general trust.
Socio-Legal & Ethical Imperatives: Laws (e.g., GDPR) and normative principles of fairness, accountability, and safety demand that systems provide “meaningful information about the logic involved,” tailored to the impacted parties.

Stakeholders in HC-XAI are heterogeneous:

AI experts and developers: model debugging, inspection, optimization.
Domain experts: actionable, example-based narratives, feature-level diagnostics.
Non-expert end-users: high-level rationales, simple “why” and “how” answers, mitigated bias.
Regulators, business users, impacted individuals: evidence for compliance, fairness audits, recourse procedures, avenues for appeal (Ma, 28 Oct 2024).

2. Technical Landscape: Algorithmic Methods and Trade-offs

XAI algorithms are classified along axes of form, scope, and fidelity–interpretability trade-off (Liao et al., 2021).

Directly interpretable (white-box) models

Linear/logistic regression, shallow decision trees, rule sets, generalized additive models (GA²Ms): inherently transparent, but often less accurate on complex tasks.

Post-hoc methods for black-box models

Feature-attribution (e.g., SHAP, LIME, Integrated Gradients): assign quantitative importance scores to input features for a specific prediction:
- SHAP: $\phi_i(v)=\sum_{S\subseteq N\setminus\{i\}}\frac{|S|!(|N|-|S|-1)!}{|N|!}\,[v(S\cup\{i\})-v(S)]$
- Integrated Gradients: $\mathrm{IG}_i(x) = (x_i - x'_i) \int_0^1 \frac{\partial F(x' + \alpha(x - x'))}{\partial x_i} \, d\alpha$ .
Example-based (prototypes, criticisms, influence functions): retrieve representative or contrasting instances, quantify their effect on model outputs.
Rule-extraction (surrogate models, compact tree induction): globally approximate a black-box model with simpler, auditable logic.
Counterfactual and contrastive methods: enumerate minimal input perturbations that flip outputs; answer “why not y'?” and “how to become y'?” (Liao et al., 2021, Ma, 28 Oct 2024).

Key trade-offs

Fidelity vs. interpretability: e.g., a sparse, simple surrogate may not accurately capture model logic.
Local vs. global: instance-based explanations have high local fidelity but limited coverage; global surrogates are broadly informative yet can obscure local quirks.
Computation vs. clarity: calculation of exact Shapley values is intractable for large models. Approximate methods risk instability (Liao et al., 2021).

3. Human-Centered Design: From User Research to Explanation Interface

User needs and question-driven design

Mapping stakeholders to explanation needs is too coarse; formative user research—contextual interviews, participatory design, task analysis—elicits the full spectrum of “why”, “why not”, “what-if”, and “how to” questions that explanations must satisfy (Ma, 28 Oct 2024). The XAI Question Bank (∼50 queries) guides selection and mapping of explanation types to user tasks (Liao et al., 2021).

Explanation interface design dimensions (Ma, 28 Oct 2024):

Presence and type: with/without, feature attribution vs. counterfactual vs. example-based.
Interactivity: static vs. on-demand “what-if” manipulations.
Complexity/granularity and progressive disclosure: number of features, layering from global summary to local detail.
Modality: textual, visual, hybrid.
Adaptability: context- and expertise-tailored explanations.
Bias mitigation/workflow integration: forced pre-prediction, anchoring-awareness.

Visual and interaction design principles

Present explanations in familiar units and everyday language; minimize cognitive load through clear visual and textual encodings (Shajalal et al., 23 Apr 2024).
Support hierarchical, exploratory navigation, contrastive and counterfactual queries. Rich interfaces (filtering, sorting, sliders) enable users to interactively probe model logic (Nguyen et al., 21 Mar 2024).

Prototyping and iterative evaluation

Early and frequent prototyping (e.g., wizard-of-Oz, mockups), technology probes in context, and heuristic evaluation across HCI, domain, and UX experts are recommended (Shajalal et al., 23 Apr 2024, Ma, 28 Oct 2024).

4. Evaluation Methodology: Metrics, Experimental Design, and Human Studies

Empirical evaluation in HC-XAI targets multiple outcome dimensions (Ma, 28 Oct 2024, Rong et al., 2022, Mangold et al., 14 Oct 2025):

Understanding: subjective (self-report transparency/likert), objective (forward simulation: $Accuracy_{sim} = (1/N) \sum 1[\hat{y}_i^{user} = f(x_i)]$ ), counterfactual simulation.
Trust and reliance: agreement rate, trust calibration via over-/under-/appropriate-reliance, behavioral proxies (switch rates).
Cognitive load: NASA-TLX, time-to-decision.
Satisfaction and usability: SUS, explanation satisfaction scales.
Human–AI collaboration: team accuracy, task success, efficiency (accuracy/time).
Fairness and bias: statistical parity, group-wise metrics.
Appropriate reliance: RAIR and RSR, capturing whether humans override AI when needed, and trust helpful AI recommendations (Rogha, 2023, Morrison et al., 2023).

Experimental design patterns

Between- or within-subjects; controls for no-explanation, placebo/random explanations (Rong et al., 2022).
Multimodal data and models supply a comprehensive testbed (tabular, text, vision, sequential, RL).
Metrics must be validated, reliably scored, and triangulated via behavioral data, self-report, and proxy tasks.

Pitfalls and human factors

High information density/excessive transparency can degrade performance and trust; progressive disclosure is superior (Ma, 28 Oct 2024).
Cognitive biases—anchoring, automation bias, illusory understanding, reliance on placebic explanations—require interface safeguards (Liao et al., 2021).
Lay users often misinterpret technical visualizations (saliency, SHAP), highlighting the need for domain-adapted formats (Shajalal et al., 23 Apr 2024).

5. Theories, Frameworks, and Emerging Architectures in HC-XAI

Cognitive and social foundations

Miller's four properties of human explanation: contrastiveness, selectivity, social transfer, and causal over statistical justification, motivate counterfactual, selective, and dialogic explanations (Liao et al., 2021).
Malle's dual-process: information processing (content) and impression management (presentation) guide interface phrasing and modality choice.

Theory-driven frameworks

Wang et al.: map explanation goals, reasoning processes (inductive, counterfactual), and causal types to interface elements, identifying method–need gaps (Liao et al., 2021, Ma, 28 Oct 2024).
Social transparency frameworks: expose "4W" of user–AI interactions (What, Who, Why, When) for collective sense-making (Liao et al., 2021).

Multi-layered and holistic system architectures

Three-layered model: 1) XAI foundation (directly interpretable or post-hoc explainable models), 2) human-centered explanation layer (cognitive-load adaptation, expertise-aware complexity tuning), 3) dynamic feedback (real-time user input, continuous parameter and model refinement). Quantitatively improved decision accuracy, trust, interpretability, and regulatory compliance in field deployments (Silva et al., 14 Apr 2025).
Holistic XAI frameworks (HXAI) embed explainability across data, analysis setup, learning, model output, quality, and communication channel, orchestrated by LLM-powered agents that produce stakeholder-specific narratives (Paterakis et al., 15 Aug 2025).

Intrinsic interpretability vs. post-hoc methods

Recent consensus highlights systematic disagreement between post-hoc explainers, motivating interpretable-by-design architectures (e.g., Modular MLPs, interpretable mixture-of-experts, conditional computation routing) to achieve real-time, consistent, actionable, and faithfully human-understandable outputs (Swamy et al., 2023, Swamy, 28 May 2025).

6. Best Practices, Limitations, and Future Research Directions

Best practices and guidelines

Always start with explicit user questions, then match XAI method (feature attribution, counterfactual, example, rule) accordingly (Liao et al., 2021, Ma, 28 Oct 2024).
Offer multi-level, progressive explanations; enable users to move fluidly from global system overviews to fine-grained local justifications, with interactivity and conversational flow (Liao et al., 2021, Nguyen et al., 21 Mar 2024).
Manage cognitive load via concise, context-aligned, personally relevant explanations, and clear communication of limitations and uncertainties.
Anchor trust to observed performance and enable iterative feedback; surface unmet explanation needs rapidly with user-centered evaluation cycles.
Address multi-stakeholder needs; integrate fairness, privacy, and security by design (Ma, 28 Oct 2024, Silva et al., 14 Apr 2025).

Open challenges

Measuring "actionable understanding" beyond satisfaction or self-report remains an open problem.
Operationalization of social theory in algorithmic and interface design is underexplored.
Adaptive, personalized, and longitudinal explanations that evolve with user expertise, context, and goals have yet to be realized at scale (Ma, 28 Oct 2024, Silva et al., 14 Apr 2025).
Developing open, standardized evaluation frameworks and meta-analytic infrastructure is essential for benchmarking and generalization (Ma et al., 20 Feb 2024).
Conversational and generative explainers, especially when powered by LLMs, raise challenges of factuality and regulatory compliance (Paraschou et al., 13 Jun 2025, Meske et al., 8 Aug 2025).

7. Conclusion and Outlook

Human-centered XAI transforms explanations from technical artifacts into sociotechnical instruments that enable end-users—across levels of expertise and domains—to make sense of, trust, and appropriately act with AI. Success in HC-XAI combines algorithmic development, empirical HCI evaluation, participatory co-design, and theory-driven frameworks. The research agenda prioritizes adaptive, context-sensitive, and actionable communication that respects the diversity of user goals and safeguards ethical, accountable, and fair AI deployment. Progress in this direction is evidenced by the emergence of adaptive frameworks, stakeholder-aligned workflows, and rigorous evaluation protocols that collectively ensure explanations are not just reflections of model mechanics, but enablers of individual and societal understanding, trust, and agency (Liao et al., 2021, Ma, 28 Oct 2024, Silva et al., 14 Apr 2025, Nguyen et al., 21 Mar 2024).

References

(Liao et al., 2021) Human-Centered Explainable AI (XAI): From Algorithms to User Experiences
(Ma, 28 Oct 2024) Towards Human-centered Design of Explainable Artificial Intelligence (XAI): A Survey of Empirical Studies
(Silva et al., 14 Apr 2025) A Multi-Layered Research Framework for Human-Centered AI: Defining the Path to Explainability and Trust
(Nguyen et al., 21 Mar 2024) How Human-Centered Explainable AI Interface Are Designed and Evaluated: A Systematic Survey
(Paterakis et al., 15 Aug 2025) A Comprehensive Perspective on Explainable AI across the Machine Learning Workflow
(Swamy et al., 2023) The future of human-centric eXplainable Artificial Intelligence (XAI) is not post-hoc explanations
(Swamy, 28 May 2025) A Human-Centric Approach to Explainable AI for Personalized Education
(Ma et al., 20 Feb 2024) OpenHEXAI: An Open-Source Framework for Human-Centered Evaluation of Explainable Machine Learning
(Shajalal et al., 23 Apr 2024) Explaining AI Decisions: Towards Achieving Human-Centered Explainability in Smart Home Environments
(Rogha, 2023) Explain To Decide: A Human-Centric Review on the Role of Explainable Artificial Intelligence in AI-assisted Decision Making
(Rong et al., 2022) Towards Human-centered Explainable AI: A Survey of User Studies for Model Explanations
(Mangold et al., 14 Oct 2025) On the Design and Evaluation of Human-centered Explainable AI Systems: A Systematic Review and Taxonomy