Optical Adversarial Attack (2108.06247v2)

Published 13 Aug 2021 in cs.AI

Abstract: We introduce OPtical ADversarial attack (OPAD). OPAD is an adversarial attack in the physical space aiming to fool image classifiers without physically touching the objects (e.g., moving or painting the objects). The principle of OPAD is to use structured illumination to alter the appearance of the target objects. The system consists of a low-cost projector, a camera, and a computer. The challenge of the problem is the non-linearity of the radiometric response of the projector and the spatially varying spectral response of the scene. Attacks generated in a conventional approach do not work in this setting unless they are calibrated to compensate for such a projector-camera model. The proposed solution incorporates the projector-camera model into the adversarial attack optimization, where a new attack formulation is derived. Experimental results prove the validity of the solution. It is demonstrated that OPAD can optically attack a real 3D object in the presence of background lighting for white-box, black-box, targeted, and untargeted attacks. Theoretical analysis is presented to quantify the fundamental performance limit of the system.

Authors (3)

Abhiram Gnanasambandam (12 papers)
Alex M. Sherman (1 paper)
Stanley H. Chan (63 papers)

Citations (58)

View on Semantic Scholar

Summary

Analyzing the Optical Adversarial Attack Framework

The paper "Optical Adversarial Attack" by Abhiram Gnanasambandam, Alex M. Sherman, and Stanley H. Chan introduces a novel adversarial attack method that operates in the physical space. The proposed technique, OPtical ADversarial Attack (OPAD), innovatively perturbs the visual input to artificial intelligence systems using structured illumination, thereby obviating the need to physically alter the objects. This approach leverages a low-cost combination of a projector, camera, and computational model to deceive image classifiers.

Fundamental Advances and Challenges Addressed

OPAD emerges as a non-invasive adversarial methodology that stands out for explicitly incorporating the complexities of the optical devices in the pipeline, a gap often left unaddressed by traditional digital and physical attack models. The principle involves crafting perturbations that are perceived through the manipulation of light, a concept set in the non-linear domain of radiometric response influenced by the projector and the spatially varying spectral characteristics of the scene.

The core challenge acknowledged by OPAD is the non-linear optical transformation introduced by the projector-camera model that skews straightforward digital adversarial patterns when projected into the physical field. Traditional digital adversarial strategies, when deployed directly, falter without the calibration enabled by OPAD which compensates for these optical aberrations.

Methodological Framework

The research incorporates the projector-camera model into the adversarial attack formulation via a detailed algorithmic framework. The process involves the development of a model for prediction targeting, encompassing constraints that address the non-standard interaction between optical hardware characteristics and the classification objectives. Essentially, the algorithm first derives a comprehensive understanding of the optical path through calibration, then reformulates the perturbation introduction mechanism. This ensures the perturbations remain effective after projection onto physical entities, capturing the target appearance while remaining constrained within the visible imperceptibility bounds.

Experimental Validation and Results

The validation of OPAD is robust, with experimental results proving the attack's versatility across multiple real-world settings. The paper evidences successful adversarial attacks against a suite of image classifiers under various configurations, including white-box and black-box models, showing adaptability to targeted and untargeted attacks. The result is the manipulation of classifier decisions while accounting for environmental lighting changes, demonstrating both precision and consistency in real-world scenarios.

Quantitatively, the OPAD system yielded a 48% success rate across a diverse array of trials, notably outperforming uncalibrated optical perturbation methods. The authors also conduct thorough theoretical analyses to outline the feasibility limits imposed by object reflectivity and color saturation, factors which traditionally hinder practical adversarial implementations.

Implications and Future Prospects

The practical implications of OPAD are substantial, reflecting a wide applicability potentially extending from combating malicious use to fortifying defense mechanisms for image recognition systems. The detailed understanding of optical adversarial properties further opens avenues for research into robust machine learning paradigms that account for real-world variability. Furthermore, the contribution made toward comprehending the optical challenges presents an opportunity for exploring the development of adversarial training standards that are capable of proactively countering such optical attacks.

Looking forward, the work invites exploration into additional optical adversarial strategies and their potential extensions to other sensory modalities beyond vision. The nuanced comprehension of the interplay between physical optics and machine perception not only enriches the current adversarial landscape but also sets a profound stage for the future discourse in securing AI systems.

PDF Markdown

Related Papers

YouTube

Show All Videos