Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Strong Data Augmentation Sanitizes Poisoning and Backdoor Attacks Without an Accuracy Tradeoff (2011.09527v1)

Published 18 Nov 2020 in cs.CR and cs.LG

Abstract: Data poisoning and backdoor attacks manipulate victim models by maliciously modifying training data. In light of this growing threat, a recent survey of industry professionals revealed heightened fear in the private sector regarding data poisoning. Many previous defenses against poisoning either fail in the face of increasingly strong attacks, or they significantly degrade performance. However, we find that strong data augmentations, such as mixup and CutMix, can significantly diminish the threat of poisoning and backdoor attacks without trading off performance. We further verify the effectiveness of this simple defense against adaptive poisoning methods, and we compare to baselines including the popular differentially private SGD (DP-SGD) defense. In the context of backdoors, CutMix greatly mitigates the attack while simultaneously increasing validation accuracy by 9%.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Eitan Borgnia (9 papers)
  2. Valeriia Cherepanova (16 papers)
  3. Liam Fowl (25 papers)
  4. Amin Ghiasi (11 papers)
  5. Jonas Geiping (73 papers)
  6. Micah Goldblum (96 papers)
  7. Tom Goldstein (226 papers)
  8. Arjun Gupta (24 papers)
Citations (122)

Summary

We haven't generated a summary for this paper yet.