Improve ASTRA’s computational efficiency
Develop algorithmic and implementation techniques to reduce the memory footprint and runtime of the ASTRA attention-based prompt injection attack, which currently requires extracting and backpropagating through large attention matrices and is approximately twice as slow as Greedy Coordinate Gradient (GCG) under equal forward-pass budgets.
References
We leave the question of efficiency of our attacks to future work.
— May I have your Attention? Breaking Fine-Tuning based Prompt Injection Defenses using Architecture-Aware Attacks
(2507.07417 - Pandya et al., 10 Jul 2025) in Section 7.3, Discussion: Limitations of ASTRA