Localized Uncertainty Attacks (2106.09222v1)

Published 17 Jun 2021 in stat.ML, cs.CR, cs.CV, and cs.LG

Abstract: The susceptibility of deep learning models to adversarial perturbations has stirred renewed attention in adversarial examples resulting in a number of attacks. However, most of these attacks fail to encompass a large spectrum of adversarial perturbations that are imperceptible to humans. In this paper, we present localized uncertainty attacks, a novel class of threat models against deterministic and stochastic classifiers. Under this threat model, we create adversarial examples by perturbing only regions in the inputs where a classifier is uncertain. To find such regions, we utilize the predictive uncertainty of the classifier when the classifier is stochastic or, we learn a surrogate model to amortize the uncertainty when it is deterministic. Unlike $\ell_p$ ball or functional attacks which perturb inputs indiscriminately, our targeted changes can be less perceptible. When considered under our threat model, these attacks still produce strong adversarial examples; with the examples retaining a greater degree of similarity with the inputs.

Authors (6)

Ousmane Amadou Dia (2 papers)
Theofanis Karaletsos (28 papers)
Caner Hazirbas (19 papers)
Cristian Canton Ferrer (32 papers)
Ilknur Kaynar Kabul (3 papers)
Erik Meijer (10 papers)

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Localized Uncertainty Attacks (2106.09222v1)

Summary

Related Papers