Papers
Topics
Authors
Recent
Search
2000 character limit reached

Agentic Explainability at Scale: Between Corporate Fears and XAI Needs

Published 16 Apr 2026 in cs.HC and cs.AI | (2604.14984v1)

Abstract: As companies enter the race for agentic AI adoption, fears surface around agentic autonomy and its subsequent risks. These fears compound as companies scale their agentic AI adoption with low-code applications, without a comparable scaling in their governance processes and expertise resulting in a phenomenon known as "Agent Sprawl". While shadow AI tools can help with agentic discovery and identification, few observability tools offer insights into the agents' configuration and settings or the decision-making process during agent-to-agent communication and orchestration. This paper explores AI governance professionals' concerns in enterprise settings, while offering design-time and runtime explainability techniques as suggested by AI governance experts for addressing those fears. Finally, we provide a preliminary prototype of an Agentic AI Card that can help companies feel at ease deploying agents at scale.

Authors (2)

Summary

  • The paper demonstrates through surveys and interviews that unchecked agentic AI can trigger agent sprawl and uncontrolled permission inheritances, elevating data leak risks.
  • The paper advocates design-time and runtime explainability solutions—such as Agent Cards, dependency graphs, and operational guardrails—to ensure traceability and mitigate autonomous actions.
  • The research highlights a paradigm shift from interpretability to auditability, emphasizing that robust monitoring and dynamic logging are essential for mitigating cascading failures in multi-agent systems.

Agentic Explainability in Enterprise: Governance Challenges and Scalable XAI Solutions

Introduction

The proliferation of agentic AI within enterprise environments has catalyzed the migration from static AI models to scalable, autonomous multi-agent systems. This transition introduces substantial complexity not only in the technical architecture but also in risk management, accountability, and explainability. The paper "Agentic Explainability at Scale: Between Corporate Fears and XAI Needs" (2604.14984) systematically examines these concerns, elucidates governance professionals’ fears regarding unchecked agentic autonomy, and proposes structured mechanisms for explainability tailored for large-scale enterprise deployments. The study utilizes both qualitative interviews with AI governance practitioners and quantitative analysis from a survey of executives, providing empirical grounding for its recommendations.

Shifting Risks: Agent Sprawl and Autonomy

Enterprise adoption of low-code agentic AI platforms has enabled the rapid deployment of thousands of agents, often by non-technical personnel, without commensurate increases in governance oversight. This phenomenon, termed "Agent Sprawl," creates a digital ecosystem wherein operational boundaries are blurred and unchecked agents propagate duplicative, resource-consuming workflows. The risks associated with agentic autonomy are not only volumetric but also qualitatively distinct; permission inheritance and uncontrolled agent-to-agent communication facilitate data leakage, adversarial influence, and loss of causal traceability. The paper presents strong evidence, supported by survey data, that these risks are not hypothetical but a systemic reality, as enterprises face tangible failures in monitoring and accountability.

Governance Concerns: Cascade Effects and Accountability

Three primary concerns consistently emerged from the empirical research:

  1. Daisy-Chain Reactions: The unpredictability and opacity of agent-to-agent connectivity, wherein a chain of interactions between agents (including third-party entities) can induce uncontrolled consequences. Practitioners fear adversarial persuasion or confidential data leakage arising from deep, unmonitored chains, resulting in amplified downstream harm and loss of causal understanding.
  2. Permission Inheritance Blast Radius: Agents inherit access privileges of their creators and may propagate those privileges to other agents, implicitly bypassing institutional access controls. This exposes enterprises to security breaches, particularly when high-clearance agents grant unauthorized low-clearance agents access to sensitive assets.
  3. Accountability of Autonomous Actions: Autonomous agents can execute actions without direct human validation, including irreversible operations (e.g., database deletions, information exfiltration). The absence of clear ownership and the inability to intervene (“kill switch” deficit) compounds fears over runaway autonomy and lack of liability traceability.

Explainability Requirements for Agentic AI at Scale

The empirical results demonstrate a fundamental shift from interpretability-centric XAI (e.g., model feature attribution) toward auditability and traceability. AI governance professionals demand visibility into agent lineage, connectivity, and permission flows rather than technical transparency at the model level alone. Design-time and runtime explainability techniques are prioritized to address risks before, during, and after agentic actions.

Design-Time Mechanisms

  • Agent Inventory/Registry: Establishing a comprehensive catalog of all agents (ownership, deployment status, active/inactive) is identified as a prerequisite for scalable governance. This enables leadership to strategize around adoption and risk, serving as a foundation for subsequent interventions.
  • Agent Cards: Modeled after "model cards," Agent Cards document dynamic agent characteristics including interacting entities, autonomy level, risk assessments, and evidence of least privilege. These standardized disclosures create a single source of truth to differentiate legitimate agents from shadow AI experiments and to inform both enterprise leadership and technical evaluators.
  • Dependency Graphs: Visual representations of agent interactions and tool dependencies allow practitioners to assess blast radii and proactively manage risk. These graphs provide layered visibility into direct and indirect connectivity, informing response strategies for cascade failures.

Runtime Mechanisms

  • Deep Observability: Decision traces, capturing chain-of-thought and tool invocations, are necessary for uncovering the reasoning underlying agent decisions and debugging complex agent interactions. Observable traces allay fears of black-box behavior in high-stakes environments.
  • Contextual Traceability: Runtime logs must include the agent’s operating context (e.g., user metadata, authentication, permission scope) to ensure least-privilege operations and detect lateral movement or privilege escalation.
  • Operational Monitoring and Guardrails: Real-time monitoring enables practitioners to terminate deviating agents (“kill switch”), limiting harm and reinforcing accountability. Automated safeguards that block policy-violating actions are rated as critical by a vast majority of governance executives.

Prototype and Implementation Implications

The paper showcases a prototype Agent Card for a Security Monitoring Agent, synthesizing design-time and runtime explainability requirements into a standardized documentation artifact. The authors argue for automatic generation and continual updating of Agent Cards via platform integrations and evaluation suites to ensure documentation fidelity and dynamic risk assessment.

Theoretical and Practical Implications

The research advances the formalization of XAI in agentic contexts by explicitly linking explainability to operational and governance requirements at scale. Practically, enterprises must re-engineer governance frameworks to accommodate agent inventories, enforce permission boundaries, and deploy observability infrastructure. Theoretically, the findings reveal that as agentic systems become more autonomous, auditability supersedes interpretability, and organizational trust hinges on systemic transparency and intervention capabilities.

The empirical data showing >80% criticality for automated safeguards and transparent audit trails signifies a paradigm shift in enterprise XAI expectations. The necessity of runtime measures reinforces the need for new technical architectures capable of granular monitoring, contextual logging, and real-time control of multi-agent ecosystems.

Speculation on Future Developments

Anticipated future directions include:

  • Integration of explainability requirements directly into agent platforms and SDKs, enabling continuous automated risk evaluation.
  • Regulatory frameworks mandating auditability artifacts for enterprise agentic deployments.
  • Advancements in dependency graph analytics for dynamic threat detection and blast radius containment.
  • Enhanced operational guardrails (kill switches) and HITL interfaces for scaling human oversight in multi-agent environments.
  • Research into consensus-driven reasoning among agents to mitigate cascading errors and amplify systemic resilience.

Conclusion

The paper provides a rigorous analysis of agentic AI explainability and governance challenges within large-scale enterprise contexts. The findings highlight a migration from interpretability to auditability, demanding tools for agent inventorying, dynamic disclosure, dependency mapping, and runtime observability. Both practitioners and executives prioritize mechanisms ensuring traceability, least-privilege enforcement, and actionable operational controls. The proposed architectural and governance interventions are foundational for safe, trustworthy scaling of agentic AI. These insights offer actionable guidance for researchers and industry stakeholders as agentic ecosystems become increasingly pervasive and autonomous.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Collections

Sign up for free to add this paper to one or more collections.