Measuring Attack Success in Agentic AI Coding Editors
Develop an objective and reproducible methodology to measure the success of prompt injection attacks against agentic AI coding editors that autonomously execute terminal commands, explicitly accounting for semantic variations in executed commands and distinguishing malicious command executions from benign setup actions.
Sponsor
References
At last, how to effectively measure the success of these attacks is also an open question since we need to consider the semantic variations in executed commands and distinguish malicious actions from benign ones.
— "Your AI, My Shell": Demystifying Prompt Injection Attacks on Agentic AI Coding Editors
(2509.22040 - Liu et al., 26 Sep 2025) in Section 1 (Introduction)