Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language Models (2505.15406v1)

Published 21 May 2025 in cs.SD, cs.AI, and eess.AS

Abstract: The rise of Large Audio LLMs (LAMs) brings both potential and risks, as their audio outputs may contain harmful or unethical content. However, current research lacks a systematic, quantitative evaluation of LAM safety especially against jailbreak attacks, which are challenging due to the temporal and semantic nature of speech. To bridge this gap, we introduce AJailBench, the first benchmark specifically designed to evaluate jailbreak vulnerabilities in LAMs. We begin by constructing AJailBench-Base, a dataset of 1,495 adversarial audio prompts spanning 10 policy-violating categories, converted from textual jailbreak attacks using realistic text to speech synthesis. Using this dataset, we evaluate several state-of-the-art LAMs and reveal that none exhibit consistent robustness across attacks. To further strengthen jailbreak testing and simulate more realistic attack conditions, we propose a method to generate dynamic adversarial variants. Our Audio Perturbation Toolkit (APT) applies targeted distortions across time, frequency, and amplitude domains. To preserve the original jailbreak intent, we enforce a semantic consistency constraint and employ Bayesian optimization to efficiently search for perturbations that are both subtle and highly effective. This results in AJailBench-APT, an extended dataset of optimized adversarial audio samples. Our findings demonstrate that even small, semantically preserved perturbations can significantly reduce the safety performance of leading LAMs, underscoring the need for more robust and semantically aware defense mechanisms.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (12)
  1. Zirui Song (21 papers)
  2. Qian Jiang (12 papers)
  3. Mingxuan Cui (4 papers)
  4. Mingzhe Li (85 papers)
  5. Lang Gao (14 papers)
  6. Zeyu Zhang (143 papers)
  7. Zixiang Xu (45 papers)
  8. Yanbo Wang (54 papers)
  9. Chenxi Wang (66 papers)
  10. Guangxian Ouyang (3 papers)
  11. Zhenhao Chen (12 papers)
  12. Xiuying Chen (80 papers)

Summary

We haven't generated a summary for this paper yet.