On Physical Adversarial Patches for Object Detection (1906.11897v1)

Published 20 Jun 2019 in cs.CV, cs.CR, cs.LG, and stat.ML

Abstract: In this paper, we demonstrate a physical adversarial patch attack against object detectors, notably the YOLOv3 detector. Unlike previous work on physical object detection attacks, which required the patch to overlap with the objects being misclassified or avoiding detection, we show that a properly designed patch can suppress virtually all the detected objects in the image. That is, we can place the patch anywhere in the image, causing all existing objects in the image to be missed entirely by the detector, even those far away from the patch itself. This in turn opens up new lines of physical attacks against object detection systems, which require no modification of the objects in a scene. A demo of the system can be found at https://youtu.be/WXnQjbZ1e7Y.

Authors (2)

Mark Lee (14 papers)
Zico Kolter (38 papers)

Citations (157)

View on Semantic Scholar

Summary

Exploring Physical Adversarial Patch Attacks on Object Detection

The paper "On Physical Adversarial Patches for Object Detection" presents a novel methodology for conducting adversarial patch attacks on object detection systems, specifically targeting the YOLOv3 detector. Unlike prior adversarial attacks requiring the perturbation to directly overlap the object of interest, this paper introduces a method whereby a patch, strategically placed anywhere within the image frame, effectively suppresses the detection of all objects present. This approach significantly expands the scope of physical adversarial attacks on machine learning systems, circumventing the need to alter the objects themselves within a scene.

Methodological Approach

The authors employ a method leveraging projected gradient descent (PGD) combined with expectation over transformations for designing these adversarial patches. This technique optimizes an objective function tailored for object detection systems, distinguishing itself from the DPatch methodology, which the authors argue suffers from inherent limitations, particularly in physical application scenarios.

A fundamental component of this paper lies in the robust demonstration of the attack on YOLOv3. The attack method is tested both in a controlled dataset environment (COCO) and in real-time using a physical setup with webcam feeds. The researchers utilized untargeted PGD, a method primarily not focused on guiding the perturbation toward any predetermined target label but instead concentrated on suppressing detections broadly.

Experimental Results

The empirical evaluation showed substantial results, particularly evident through mAP (mean Average Precision) metrics. When applied to YOLOv3, their adversarial patch reduced the mAP from 55.4% to single-digit values across various confidence thresholds. These results starkly contrasted with those obtained via DPatch, which only achieved marginally reduced mAP values: 39.6% at a 0.001 confidence threshold, for example, vs. 13.8% through the proposed method.

The research emphasizes that their approach is effective across different transformations, accounting for rotations, scale variations, and translations, as well as testing under varied image qualities and lighting conditions. The attack's efficacy in the physical domain is demonstrated by utilizing printed patches under real-world conditions, maintaining its effectiveness across multiple placements and distances from the camera.

Implications and Future Directions

This paper introduces significant implications for the security of object detection systems, highlighting a potential vulnerability in autonomous systems like those employed in vehicular technologies, where object detection plays a critical safety role. The attack's ability to negate detections of objects such as pedestrians or traffic signals with non-overlapping patches suggests an urgent security concern that must be addressed.

Theoretically, this paper enriches the understanding of vulnerabilities in object detection networks and extends the adversarial example literature into physical contexts without necessitating overlap with targeted objects. Future research directions could explore mitigation strategies and more robust defenses within detection frameworks to counteract this newly uncovered vulnerability. Moreover, refining these adversarial patches could lead to more nuanced understanding and insights into adversarial robustness in deep learning models.

In conclusion, by demonstrating a viable and effective physical adversarial attack on object detection systems, the authors have uncovered new vectors for potential malicious activities, urging the security community to investigate stronger safeguards and detection strategies. This paper significantly contributes to both the practical and theoretical discussions on adversarial attacks in machine learning.

PDF Markdown

Related Papers

Find Related Papers

YouTube

Show All Videos