Did you hear that? Adversarial Examples Against Automatic Speech Recognition (1801.00554v1)

Published 2 Jan 2018 in cs.CL and cs.CR

Abstract: Speech is a common and effective way of communication between humans, and modern consumer devices such as smartphones and home hubs are equipped with deep learning based accurate automatic speech recognition to enable natural interaction between humans and machines. Recently, researchers have demonstrated powerful attacks against machine learning models that can fool them to produceincorrect results. However, nearly all previous research in adversarial attacks has focused on image recognition and object detection models. In this short paper, we present a first of its kind demonstration of adversarial attacks against speech classification model. Our algorithm performs targeted attacks with 87% success by adding small background noise without having to know the underlying model parameter and architecture. Our attack only changes the least significant bits of a subset of audio clip samples, and the noise does not change 89% the human listener's perception of the audio clip as evaluated in our human study.

PDF Abstract

Adversarial Attacks on Automatic Speech Recognition Systems

The paper "Did you hear that? Adversarial Examples Against Automatic Speech Recognition" presents an important advancement in the paper of adversarial attacks within the field of machine learning, particularly focusing on the domain of automatic speech recognition (ASR). The research delineates a novel approach to generating adversarial examples that target neural-network-based speech recognition models, achieving a significant 87% success rate in targeted attacks. This attack method is notable for its gradient-free approach, circumventing the typical barriers posed by non-differentiable components in ASR models.

Methodology and Key Findings

The methodology is rooted in the genetic algorithm, a gradient-free optimization technique. This approach is essential since traditional methods, such as FGSM and Carlini & Wagner, rely on gradient calculations, which are infeasible given the non-differentiable nature of the initial processing layers in ASR models, particularly those utilizing Mel Frequency Cepstral Coefficients (MFCCs). The proposed method operates under a black-box threat model, allowing it to function without explicit knowledge of the victim model's architecture or parameters.

Key results from this research demonstrate that the adversarial noise can alter the least significant bits of a subset of samples in audio files, making these modifications imperceptible to human listeners 89% of the time. This subtle yet effective noise is sufficient to redirect the ASR model's predictions to the attacker's target label.

Evaluation and Analysis

The experimental evaluation utilized a speech commands dataset and a convolutional neural network model for keyword recognition, achieving an overall classification accuracy of 90% on non-adversarial inputs. The attacks were rigorously evaluated across 500 randomly selected audio clips, with adversarial counterparts generated to be misclassified into alternate target labels. This extensive evaluation underscores the vulnerability of ASR systems to adversarial interference.

Human perceptual studies affirm that the generated adversarial examples generally remain inconspicuous to human listeners, with a majority of 89% accurately identifying the audio clip as its original label, even when the model was fooled into an incorrect classification.

Implications and Future Directions

From a practical standpoint, these findings highlight significant security implications for ASR systems used in consumer electronics, enhancing the importance of designing robust defenses against adversarial attacks. The research also provides a foundation for exploring more formidable attack vectors, such as those occurring in a physically instantiated environment where attacks are delivered via speakers and captured through microphones.

Theoretically, this research invites further exploration into the nuances of non-differentiable components typical of ASR systems and the potential for more elaborate disarming methods within white-box settings using MFCC inversion or other innovative techniques.

This paper paves the way for future research focused on:

Evaluating similar attacks on more sophisticated ASR models aligned with complex sentence construction.
Investigating robust defense mechanisms and resilience strategies for protecting ASR systems against adversarial noise.
Examining over-the-air attack scenarios that more closely mimic realistic settings of voice-command based interactions.

Overall, this paper contributes to the broader understanding of the vulnerabilities inherent in state-of-the-art machine learning models, underscoring the urgent need for advancing both offensive and defensive strategies within this domain.

PDF Markdown Bookmark Chat (Pro)

Authors (3)

Moustafa Alzantot (14 papers)
Bharathan Balaji (19 papers)
Mani Srivastava (58 papers)

Citations (247)

View on Semantic Scholar

Did you hear that? Adversarial Examples Against Automatic Speech Recognition (1801.00554v1)

Adversarial Attacks on Automatic Speech Recognition Systems

Methodology and Key Findings

Evaluation and Analysis

Implications and Future Directions

Related Papers