Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Attacking Adversarial Attacks as A Defense (2106.04938v1)

Published 9 Jun 2021 in cs.LG and cs.CR

Abstract: It is well known that adversarial attacks can fool deep neural networks with imperceptible perturbations. Although adversarial training significantly improves model robustness, failure cases of defense still broadly exist. In this work, we find that the adversarial attacks can also be vulnerable to small perturbations. Namely, on adversarially-trained models, perturbing adversarial examples with a small random noise may invalidate their misled predictions. After carefully examining state-of-the-art attacks of various kinds, we find that all these attacks have this deficiency to different extents. Enlightened by this finding, we propose to counter attacks by crafting more effective defensive perturbations. Our defensive perturbations leverage the advantage that adversarial training endows the ground-truth class with smaller local Lipschitzness. By simultaneously attacking all the classes, the misled predictions with larger Lipschitzness can be flipped into correct ones. We verify our defensive perturbation with both empirical experiments and theoretical analyses on a linear model. On CIFAR10, it boosts the state-of-the-art model from 66.16% to 72.66% against the four attacks of AutoAttack, including 71.76% to 83.30% against the Square attack. On ImageNet, the top-1 robust accuracy of FastAT is improved from 33.18% to 38.54% under the 100-step PGD attack.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Boxi Wu (36 papers)
  2. Heng Pan (14 papers)
  3. Li Shen (363 papers)
  4. Jindong Gu (101 papers)
  5. Shuai Zhao (116 papers)
  6. Zhifeng Li (74 papers)
  7. Deng Cai (181 papers)
  8. Xiaofei He (70 papers)
  9. Wei Liu (1135 papers)
Citations (28)

Summary

We haven't generated a summary for this paper yet.