Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Audio Adversarial Examples: Attacks Using Vocal Masks (2102.02417v2)

Published 4 Feb 2021 in cs.SD, cs.AI, and eess.AS

Abstract: We construct audio adversarial examples on automatic Speech-To-Text systems . Given any audio waveform, we produce an another by overlaying an audio vocal mask generated from the original audio. We apply our audio adversarial attack to five SOTA STT systems: DeepSpeech, Julius, Kaldi, wav2letter@anywhere and CMUSphinx. In addition, we engaged human annotators to transcribe the adversarial audio. Our experiments show that these adversarial examples fool State-Of-The-Art Speech-To-Text systems, yet humans are able to consistently pick out the speech. The feasibility of this attack introduces a new domain to study machine and human perception of speech.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Kai Yuan Tay (1 paper)
  2. Lynnette Ng (3 papers)
  3. Wei Han Chua (1 paper)
  4. Lucerne Loke (1 paper)
  5. Danqi Ye (1 paper)
  6. Melissa Chua (1 paper)

Summary

We haven't generated a summary for this paper yet.