Physical 3D Adversarial Attacks against Monocular Depth Estimation in Autonomous Driving (2403.17301v2)
Abstract: Deep learning-based monocular depth estimation (MDE), extensively applied in autonomous driving, is known to be vulnerable to adversarial attacks. Previous physical attacks against MDE models rely on 2D adversarial patches, so they only affect a small, localized region in the MDE map but fail under various viewpoints. To address these limitations, we propose 3D Depth Fool (3D$2$Fool), the first 3D texture-based adversarial attack against MDE models. 3D$2$Fool is specifically optimized to generate 3D adversarial textures agnostic to model types of vehicles and to have improved robustness in bad weather conditions, such as rain and fog. Experimental results validate the superior performance of our 3D$2$Fool across various scenarios, including vehicles, MDE models, weather conditions, and viewpoints. Real-world experiments with printed 3D textures on physical vehicle models further demonstrate that our 3D$2$Fool can cause an MDE error of over 10 meters.
- Tesla ai day. https://youtu.be/j0z4FweCy4M?t=5295, a.
- Tesla use per-pixel depth estimation with self-supervised learning. https://youtu.be/hx7BXih7zx8?t=1334, b.
- Automold. https://github.com/UjjwalSaxena/Automold--Road-Augmentation-Library, 2022-12-20.
- Synthesizing robust adversarial examples. In Proceedings of the 35th International Conference on Machine Learning, (ICML), pages 284–293, 2018.
- Adversarial patch. arXiv preprint arXiv:1712.09665, 2017.
- Towards evaluating the robustness of neural networks. In 2017 IEEE Symposium on Security and Privacy (SP), pages 39–57, Los Alamitos, CA, USA, 2017. IEEE Computer Society.
- Adversarial attacks on monocular pose estimation. In 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 12500–12505, 2022.
- Physical attack on monocular depth estimation with optimal adversarial patches. In Computer Vision – ECCV 2022, pages 514–532, Cham, 2022. Springer Nature Switzerland.
- Gesture recognition using a depth camera for human robot collaboration on assembly line. Procedia Manufacturing, 3:518–525, 2015. 6th International Conference on Applied Human Factors and Ergonomics (AHFE 2015) and the Affiliated Conferences, AHFE 2015.
- Vision for mobile robot navigation: a survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(2):237–267, 2002.
- CARLA: An open urban driving simulator. In Proceedings of the 1st Annual Conference on Robot Learning, pages 1–16, 2017.
- Depth map prediction from a single image using a multi-scale deep network. In Neural Information Processing Systems, 2014.
- Robust physical-world attacks on deep learning visual classification. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1625–1634, 2018.
- Digging into self-supervised monocular depth prediction. 2019.
- Explaining and harnessing adversarial examples. CoRR, abs/1412.6572, 2014.
- Aparate: Adaptive adversarial patch for cnn-based monocular depth estimation for autonomous navigation, 2023a.
- Saam: Stealthy adversarial attack on monoculor depth estimation. ArXiv, abs/2308.03108, 2023b.
- Neural 3d mesh renderer. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.
- Adam: A method for stochastic optimization. In 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings, 2015.
- Deeper depth prediction with fully convolutional residual networks. In 3D Vision (3DV), 2016 Fourth International Conference on, pages 239–248. IEEE, 2016.
- Learning depth from single monocular images using deep convolutional neural fields. IEEE T. Pattern Analysis and Machine Intelligence, 2015.
- Deep photo style transfer. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017.
- Slowtrack: Increasing the latency of camera-based perception in autonomous driving using adversarial examples. arXiv preprint arXiv:2312.09520, 2023a.
- Wip: Towards the practicality of the adversarial attack on object tracking in autonomous driving. In ISOC Symposium on Vehicle Security and Privacy (VehicleSec), 2023b.
- Trafficpredict: Trajectory prediction for heterogeneous traffic-agents. Proceedings of the AAAI Conference on Artificial Intelligence, 33:6120–6127, 2019.
- Understanding deep image representations by inverting them. In 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 5188–5196, Los Alamitos, CA, USA, 2015. IEEE Computer Society.
- CoMoGAN: continuous model-guided image-to-image translation. In CVPR, 2021.
- Monocular depth estimation using neural regression forest. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 5506–5514, 2016.
- Self-supervised Monocular Depth Estimation: Let’s Talk About The Weather. In The International Conference on Computer Vision (ICCV), 2023.
- Accessorize to a crime: Real and stealthy attacks on state-of-the-art face recognition. In Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security, page 1528–1540, New York, NY, USA, 2016. Association for Computing Machinery.
- Real-time stereo reconstruction in robotically assisted minimally invasive surgery. In Medical Image Computing and Computer-Assisted Intervention – MICCAI 2010, pages 275–282, Berlin, Heidelberg, 2010. Springer Berlin Heidelberg.
- Dta: Physical camouflage attacks using differentiable transformation network. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 15305–15314, 2022.
- Active: Towards highly transferable 3d physical camouflage for universal and robust vehicle evasion. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 4305–4314, 2023.
- Fooling automated surveillance cameras: Adversarial patches to attack person detection. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pages 49–55, Los Alamitos, CA, USA, 2019. IEEE Computer Society.
- Rain rendering for evaluating and improving robustness to bad weather. International Journal of Computer Vision, 2020.
- Fca: Learning a 3d full-coverage vehicle camouflage for multi-view physical adversarial attack. In Proceedings of the AAAI conference on artificial intelligence, pages 2414–2422, 2022.
- Dual attention suppression attack: Generate adversarial camouflage in physical world. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 8561–8570, Los Alamitos, CA, USA, 2021. IEEE Computer Society.
- Self-supervised monocular depth hints. In The International Conference on Computer Vision (ICCV), 2019.
- The Temporal Opportunist: Self-Supervised Multi-Frame Monocular Depth. In Computer Vision and Pattern Recognition (CVPR), 2021.
- Targeted adversarial perturbations for monocular depth prediction. In Advances in neural information processing systems, 2020.
- Physical adversarial attack on vehicle detector in the carla simulator. ArXiv, abs/2007.16118, 2020.
- Adversarial patch attacks on monocular depth estimation networks. IEEE Access, 8:179094–179104, 2020.
- Quantization aware attack: Enhancing transferable adversarial attacks by model quantization. IEEE Transactions on Information Forensics and Security, 19:3265–3278, 2024.
- Camou: Learning physical vehicle camouflages to adversarially attack detectors in the wild. In International Conference on Learning Representations, 2018.