Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Benford's law: what does it say on adversarial images? (2102.04615v2)

Published 9 Feb 2021 in cs.CV, cs.AI, and cs.LG

Abstract: Convolutional neural networks (CNNs) are fragile to small perturbations in the input images. These networks are thus prone to malicious attacks that perturb the inputs to force a misclassification. Such slightly manipulated images aimed at deceiving the classifier are known as adversarial images. In this work, we investigate statistical differences between natural images and adversarial ones. More precisely, we show that employing a proper image transformation and for a class of adversarial attacks, the distribution of the leading digit of the pixels in adversarial images deviates from Benford's law. The stronger the attack, the more distant the resulting distribution is from Benford's law. Our analysis provides a detailed investigation of this new approach that can serve as a basis for alternative adversarial example detection methods that do not need to modify the original CNN classifier neither work on the raw high-dimensional pixels as features to defend against attacks.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. João G. Zago (1 paper)
  2. Fabio L. Baldissera (1 paper)
  3. Eric A. Antonelo (4 papers)
  4. Rodrigo T. Saad (1 paper)
Citations (2)

Summary

We haven't generated a summary for this paper yet.