Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Adversarial Attacks and Defenses for Speech Recognition Systems (2103.17122v1)

Published 31 Mar 2021 in eess.AS, cs.CR, and cs.SD

Abstract: The ubiquitous presence of machine learning systems in our lives necessitates research into their vulnerabilities and appropriate countermeasures. In particular, we investigate the effectiveness of adversarial attacks and defenses against automatic speech recognition (ASR) systems. We select two ASR models - a thoroughly studied DeepSpeech model and a more recent Espresso framework Transformer encoder-decoder model. We investigate two threat models: a denial-of-service scenario where fast gradient-sign method (FGSM) or weak projected gradient descent (PGD) attacks are used to degrade the model's word error rate (WER); and a targeted scenario where a more potent imperceptible attack forces the system to recognize a specific phrase. We find that the attack transferability across the investigated ASR systems is limited. To defend the model, we use two preprocessing defenses: randomized smoothing and WaveGAN-based vocoder, and find that they significantly improve the model's adversarial robustness. We show that a WaveGAN vocoder can be a useful countermeasure to adversarial attacks on ASR systems - even when it is jointly attacked with the ASR, the target phrases' word error rate is high.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Sonal Joshi (7 papers)
  2. Yiwen Shao (14 papers)
  3. Jan Trmal (11 papers)
  4. Najim Dehak (71 papers)
  5. Sanjeev Khudanpur (74 papers)
  6. Jesus Villalba (47 papers)
  7. Piotr Żelasko (36 papers)
Citations (27)

Summary

We haven't generated a summary for this paper yet.