MITRE ATLAS Adversarial ML Taxonomy

Updated 23 November 2025

MITRE ATLAS Adversarial ML Taxonomy is a structured classification scheme that defines and maps AI adversarial tactics along MLOps phases, enabling systematic threat analysis.
It offers actionable mitigation strategies and defense recommendations based on detailed red-teaming case studies and real-world adversarial incidents.
The taxonomy supports secure MLOps by linking specific attack techniques to phase-specific controls, guiding both operational resilience and future research.

The MITRE ATLAS (Adversarial Threat Landscape for Artificial-Intelligence Systems) taxonomy provides a comprehensive, continuously updated catalog of adversarial AI threats, with a particular focus on structuring these threats across the machine learning operations (MLOps) lifecycle. By mapping AI-focused attack tactics and techniques to discrete phases of MLOps workflows, the framework enables systematic threat assessment, actionable mitigation, and principled research into open attack surfaces unique to AI and ML-infrastructure. This taxonomy integrates real-world incidents and red-teaming exercises, offering both granular attack-path analysis and practical defense recommendations across administrative, developmental, deployment, and operational ML contexts (Patel et al., 30 May 2025).

1. Mapping ATLAS Tactics to the MLOps Lifecycle

The MLOps pipeline is generally segmented into six primary phases: Administrative Setup, Data Collection, Model Development, Approval Workflow (e.g., QA/testing), Model Deployment, and Monitoring. ATLAS identifies five high-level adversarial tactics; each typically manifests during specific MLOps phases.

ATLAS Tactic (ID)	Associated MLOps Phases
Reconnaissance (AML.TA0002)	Admin Setup, Data Collection
Resource Development (AML.TA0003)	Admin Setup, Data Collection, Model Development
Discovery (AML.TA0008)	Model Development, Approval Workflow
Collection (AML.TA0009)	Approval Workflow
Impact (AML.TA0011)	Deployment, Monitoring

This mapping establishes a threat surface continuum throughout the ML system’s lifecycle, supporting both phase-specific threat modeling and end-to-end pipeline defense design.

2. ATLAS Tactics, Representative Techniques, and Concrete Examples

ATLAS further decomposes each tactic into techniques/sub-techniques, associated definitions, and emblematic real-world or red-team case studies. Select details appear below.

Reconnaissance (AML.TA0002)

This encompasses initial information gathering actions—both OSINT and active probing—to profile targeted ML assets:

Search for Victim’s Public Research (AML.T0000): Crawling journals, preprints, or technical blogs to extract architectural and toolchain intelligence. Example: 2020 adversarial-evasion strategies against Cylance malware detectors leveraged published research disclosures.
Active Scanning (AML.T0006): Network port and endpoint scanning reveals live ML APIs.

Resource Development (AML.TA0003)

In this preparatory phase, adversaries acquire the technical means to enable and scale attacks:

Acquire Public ML Artifacts (AML.T0002): Download datasets/models via open cloud buckets or ML marketplaces.
Obtain Capabilities (AML.T0016), Develop Capabilities (AML.T0017): Use and creation of adversarial ML frameworks (e.g., CleverHans, ART), including code to operationalize poisoning/evasion attacks.
Poison Training Data (AML.T0020), Publish Poisoned Assets (AML.T0019, AML.T0058): Injection of backdoored or mislabeled samples into major public datasets; e.g., 2024 large-scale poisoning of Wikipedia prior to dataset crawls.

Discovery (AML.TA0008)

After initial intrusion, adversaries enumerate internal ML resources:

Discover ML Artifacts (AML.T0007): Search for model weights and schemas in file shares or container registries.
LLM Meta Prompt Extraction (AML.T0056): Recovery of system prompts embedded in commercial services; e.g., 2020 red-team extraction from a facial recognition API.

Collection (AML.TA0009)

This covers exfiltration of sensitive ML assets:

ML Artifact Collection (AML.T0035): Copying of model binaries and logs, as in the 2020 SpiderSilk compromise of Clearview AI’s private repositories.
Harvesting Credentials (AML.T0037): Extraction of model registry tokens and configuration secrets.

Impact (AML.TA0011)

Attacks here seek to degrade ML integrity, availability, or escalate costs:

Evade ML Model (AML.T0015): Generation of adversarial examples that optimize $x^* = \mathop{\mathrm{argmax}}_{\|\delta\| \leq \epsilon} L(f(x+\delta), y)$ (not in the primary source, but standard).
Denial of ML Service (AML.T0029), Cost Harvesting (AML.T0034): Abuse of inference APIs to exhaust resources or inflate operational expenses.
Intellectual Property Theft (AML.T0048.004): Backdooring OEM Android models (“DeepPayload”).

3. Categorization of Mitigation Strategies

ATLAS provides a taxonomy aligning defense mechanisms with each tactic/technique, featuring both preventive and detective controls:

Attack Category	Representative Mitigations
Reconnaissance	Limit public research release; OSINT portal hardening
Resource Development	AI Bill of Materials (AML.M0023); code signing (AML.M0013); artifact provenance verification; network egress filtering
Discovery	ML query rate-limiting (AML.M0004); ensemble/query noise (AML.M0023); inference API authentication
Collection	Encrypt information at rest (M1041); IAM roles; secret scanning; key rotation
Impact	Model hardening (AML.M0003): adversarial training, input restoration; access controls; IPS and telemetry logging

Many mitigations are explicit early-stage controls, emphasizing the importance of defensive design from inception.

4. Real-World Case Studies and Red-Teaming Insights

Numerous incidents and adversarial simulations illustrate the taxonomy’s practical applicability:

2020 Cylance Evasion: Attackers exploited OSINT from adversarial ML research to defeat malware classifiers.
Clearview AI Compromise: Exposed source and credentials via insufficient website hardening.
MITRE Red-Teaming (Physical Attacks): Deployment of adversarial physical artifacts (e.g., camera-print stickers) to subvert facial recognition.
Large-Scale Wikipedia Poisoning (2024): Timely insertion of poisoned samples before dataset acquisition, illustrating vulnerabilities in open data pipeline stages.
ShadowRay (2024): Adversarial use of distributed job APIs for resource exhaustion attacks.

These examples substantiate the phase- and tactic-mapping by linking theoretical attack surfaces to empirical adversarial outcomes.

5. Outstanding Research Gaps and Open Challenges

Critical MLOps security challenges remain, spanning technical, organizational, and legal domains:

AI-Driven Social Engineering: Growth in generative deepfake and phishing attacks; demand for real-time deepfake detectors, robust multi-factor authentication.
Malicious Repositories & LLM Hallucinations: Need for automated detection of poisoned ML packages and hallucinated model/code outputs.
APTs Misusing AI: Lack of legal/ethical frameworks and detection technologies for state-sponsored generative malware.
Benchmarking Defenses: Scarcity of dynamic, representative MLOps red-team/adaptive adversarial testbeds.
Collaborative Red-Teaming: Demand for standardized, industry-wide red-team frameworks targeting MLOps security.
Secure MLOps by Construction: Emphasis on early integration of security via threat modeling, secure CI/CD, continuous patching.
Open vs. Closed Source: Tension between transparency (for audit/fairness) and risk (via public reconnaissance).
Operator Cyber-Hygiene: Cross-disciplinary practitioner training, enforcement of secure coding, incentives for compliance.
Transparency vs. Trust: Mechanisms for non-leaking dissemination of AI governance information.
Copyright, Privacy & Compliance: Implementation of automated data audits, enforceable proprietary-data constraints, and development of post-quantum resilient ML pipelines.

This suggests that MLOps security is inseparable from organizational policy, regulatory compliance, and evolving threat intelligence, requiring both adaptive defenses and principled best practices (Patel et al., 30 May 2025).

6. Significance and Foundation for Secure MLOps

By synthesizing tactics, techniques, real-world cases, and actionable mitigations, the MITRE ATLAS taxonomy establishes a foundational framework for end-to-end secure MLOps design and governance. Early-stage, principled security integration is essential for maintaining trust, protecting intellectual property, and mitigating rapidly evolving ML-specific threats. The taxonomy’s systematic structure enables reproducible red-teaming, facilitates regulatory compliance, and informs future research into the robust and resilient operation of AI-powered systems (Patel et al., 30 May 2025).

PDF Markdown Chat (Pro)

References (1)

Towards Secure MLOps: Surveying Attacks, Mitigation Strategies, and Research Challenges (2025)

Whiteboard

Generate a whiteboard explanation of this topic.

Follow Topic

Get notified by email when new papers are published related to MITRE ATLAS Adversarial ML Taxonomy.