Stealth benefit of image semantics when text alone suffices
Ascertain whether incorporating image semantics within the Cross-modal Adversarial Multimodal Obfuscation (CAMO) prompts provides additional stealth benefits in scenarios where textual cues alone are sufficient to execute the attack.
Sponsor
References
In cases where textual cues alone suffice, the added value of image semantics in enhancing stealth is unclear.
— Cross-Modal Obfuscation for Jailbreak Attacks on Large Vision-Language Models
(2506.16760 - Jiang et al., 20 Jun 2025) in Section: Limitation and Future Work