CosPGD: an efficient white-box adversarial attack for pixel-wise prediction tasks (2302.02213v3)
Abstract: While neural networks allow highly accurate predictions in many tasks, their lack of robustness towards even slight input perturbations often hampers their deployment. Adversarial attacks such as the seminal projected gradient descent (PGD) offer an effective means to evaluate a model's robustness and dedicated solutions have been proposed for attacks on semantic segmentation or optical flow estimation. While they attempt to increase the attack's efficiency, a further objective is to balance its effect, so that it acts on the entire image domain instead of isolated point-wise predictions. This often comes at the cost of optimization stability and thus efficiency. Here, we propose CosPGD, an attack that encourages more balanced errors over the entire image domain while increasing the attack's overall efficiency. To this end, CosPGD leverages a simple alignment score computed from any pixel-wise prediction and its target to scale the loss in a smooth and fully differentiable way. It leads to efficient evaluations of a model's robustness for semantic segmentation as well as regression models (such as optical flow, disparity estimation, or image restoration), and it allows it to outperform the previous SotA attack on semantic segmentation. We provide code for the CosPGD algorithm and example usage at https://github.com/shashankskagnihotri/cospgd.
- A high-quality denoising dataset for smartphone cameras. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1692–1700, 2018. doi: 10.1109/CVPR.2018.00182.
- Square attack: a query-efficient black-box adversarial attack via random search. 2020.
- On the robustness of semantic segmentation models to adversarial attacks, 2017. URL https://arxiv.org/abs/1711.09856.
- Adversarial patch, 2017. URL https://arxiv.org/abs/1712.09665.
- Analysis of explainers of black box deep neural networks for computer vision: A survey, 2019. URL https://arxiv.org/abs/1911.12116.
- A naturalistic open source movie for optical flow evaluation. In A. Fitzgibbon et al. (Eds.), editor, European Conf. on Computer Vision (ECCV), Part IV, LNCS 7577, pages 611–625. Springer-Verlag, October 2012.
- Towards evaluating the robustness of neural networks. In 2017 ieee symposium on security and privacy (sp), pages 39–57. IEEE, 2017.
- Rethinking atrous convolution for semantic image segmentation, 2017.
- Simple baselines for image restoration, 2022.
- The cityscapes dataset for semantic urban scene understanding, 2016.
- Reliable evaluation of adversarial robustness with an ensemble of diverse parameter-free attacks. In ICML, 2020.
- Mind the box: l1subscript𝑙1l_{1}italic_l start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-apgd for sparse adversarial attacks on image classifiers. In ICML, 2021.
- Robustbench: a standardized adversarial robustness benchmark. CoRR, abs/2010.09670, 2020. URL https://arxiv.org/abs/2010.09670.
- Robustbench: a standardized adversarial robustness benchmark. In Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 2), 2021. URL https://openreview.net/forum?id=SSKZPJCt7B.
- Flownet: Learning optical flow with convolutional networks. In IEEE International Conference on Computer Vision (ICCV), 2015. URL http://lmb.informatik.uni-freiburg.de/Publications/2015/DFIB15.
- The PASCAL Visual Object Classes Challenge 2012 (VOC2012) Results. http://www.pascal-network.org/challenges/VOC/voc2012/workshop/index.html.
- Flownet: Learning optical flow with convolutional networks, 2015. URL https://arxiv.org/abs/1504.06852.
- Generating targeted adversarial attacks and assessing their effectiveness in fooling deep neural networks. In 2022 IEEE International Conference on Signal Processing and Communications (SPCOM), pages 1–5, 2022. doi: 10.1109/SPCOM55316.2022.9840784.
- Imagenet-trained cnns are biased towards texture; increasing shape bias improves accuracy and robustness, 2018. URL https://arxiv.org/abs/1811.12231.
- Shortcut learning in deep neural networks. Nature Machine Intelligence, 2(11):665–673, nov 2020. doi: 10.1038/s42256-020-00257-z. URL https://doi.org/10.1038%2Fs42256-020-00257-z.
- Explaining and harnessing adversarial examples, 2014. URL https://arxiv.org/abs/1412.6572.
- Segpgd: An effective and efficient adversarial attack for evaluating and boosting segmentation robustness. 2022a. doi: 10.48550/ARXIV.2207.12391. URL https://arxiv.org/abs/2207.12391.
- Segpgd: An effective and efficient adversarial attack for evaluating and boosting segmentation robustness. page 5, 2022b. doi: 10.48550/ARXIV.2207.12391. URL https://arxiv.org/abs/2207.12391.
- Semantic contours from inverse detectors. In International Conference on Computer Vision (ICCV), 2011.
- Hypercolumns for object segmentation and fine-grained localization. In 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 447–456, 2015. doi: 10.1109/CVPR.2015.7298642.
- Deep residual learning for image recognition, 2015. URL https://arxiv.org/abs/1512.03385.
- Benchmarking neural network robustness to common corruptions and perturbations, 2019. URL https://arxiv.org/abs/1903.12261.
- Natural adversarial examples, 2019. URL https://arxiv.org/abs/1907.07174.
- Flownet 2.0: Evolution of optical flow estimation with deep networks, 2016. URL https://arxiv.org/abs/1612.01925.
- Black-box adversarial attacks with limited queries and information. In Proceedings of the 35th International Conference on Machine Learning, ICML 2018, July 2018. URL https://arxiv.org/abs/1804.08598.
- Adversarial example generation with syntactically controlled paraphrase networks. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pages 1875–1885, New Orleans, Louisiana, June 2018. Association for Computational Linguistics. doi: 10.18653/v1/N18-1170. URL https://aclanthology.org/N18-1170.
- Testing robustness against unforeseen adversaries, 2019. URL https://arxiv.org/abs/1908.08016.
- Adversarial attacks for image segmentation on multiple lightweight models. IEEE Access, 8:31359–31370, 2020. doi: 10.1109/ACCESS.2020.2973069.
- Imagenet classification with deep convolutional neural networks. In F. Pereira, C.J. Burges, L. Bottou, and K.Q. Weinberger, editors, Advances in Neural Information Processing Systems, volume 25. Curran Associates, Inc., 2012. URL https://proceedings.neurips.cc/paper/2012/file/c399862d3b9d6b76c8436e924a68c45b-Paper.pdf.
- Adversarial examples in the physical world, 2016. URL https://arxiv.org/abs/1607.02533.
- Adversarial machine learning at scale, 2017.
- Revisiting stereo depth estimation from a sequence-to-sequence perspective with transformers. 2020. doi: 10.48550/ARXIV.2011.02910. URL https://arxiv.org/abs/2011.02910.
- A convnet for the 2020s, 2022. URL https://arxiv.org/abs/2201.03545.
- Towards deep learning models resistant to adversarial attacks, 2017. URL https://arxiv.org/abs/1706.06083.
- A large dataset to train convolutional networks for disparity, optical flow, and scene flow estimation. In IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), 2016. URL http://lmb.informatik.uni-freiburg.de/Publications/2016/MIFDB16. arXiv:1512.02134.
- Object scene flow for autonomous vehicles. In Conference on Computer Vision and Pattern Recognition (CVPR), 2015.
- Deepfool: a simple and accurate method to fool deep neural networks, 2015. URL https://arxiv.org/abs/1511.04599.
- Textattack: A framework for adversarial attacks, data augmentation, and adversarial training in nlp, 2020. URL https://arxiv.org/abs/2005.05909.
- Deep multi-scale convolutional neural network for dynamic scene deblurring. In CVPR, July 2017.
- Semantically equivalent adversarial rules for debugging NLP models. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 856–865, Melbourne, Australia, July 2018. Association for Computational Linguistics. doi: 10.18653/v1/P18-1079. URL https://aclanthology.org/P18-1079.
- U-net: Convolutional networks for biomedical image segmentation, 2015. URL https://arxiv.org/abs/1505.04597.
- Decoupling direction and norm for efficient gradient-based l2 adversarial attacks and defenses, 2019.
- Proximal splitting adversarial attacks for semantic segmentation, 2023.
- A perturbation-constrained adversarial attack for evaluating the robustness of optical flow, 2022. URL https://arxiv.org/abs/2203.13214.
- Towards understanding adversarial robustness of optical flow networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 8916–8924, 2022.
- Local aggressive adversarial attacks on 3d point cloud, 2021. URL https://arxiv.org/abs/2105.09090.
- Raft: Recurrent all-pairs field transforms for optical flow, 2020. URL https://arxiv.org/abs/2003.12039.
- Berkay Mayalı username: mberkay0. mberkay0/pretrained-backbones-unet. https://github.com/mberkay0/pretrained-backbones-unet, 2023.
- Multiclass asma vs targeted pgd attack in image segmentation, 2022. URL https://arxiv.org/abs/2208.01844.
- Better diffusion models further improve adversarial training, 2023.
- Image quality assessment: from error visibility to structural similarity. IEEE Transactions on Image Processing, 13(4):600–612, 2004. doi: 10.1109/TIP.2003.819861.
- Targeted adversarial perturbations for monocular depth prediction. In Advances in neural information processing systems, 2020a.
- Fast is better than free: Revisiting adversarial training, 2020b. URL https://arxiv.org/abs/2001.03994.
- Lessons and insights from creating a synthetic optical flow benchmark. In A. Fusiello et al. (Eds.), editor, ECCV Workshop on Unsolved Problems in Optical Flow and Stereo Estimation, Part II, LNCS 7584, pages 168–177. Springer-Verlag, October 2012.
- Aggregated residual transformations for deep neural networks, 2016. URL https://arxiv.org/abs/1611.05431.
- Restormer: Efficient transformer for high-resolution image restoration. In CVPR, 2022.
- 3d adversarial attacks beyond point cloud, 2021. URL https://arxiv.org/abs/2104.12146.
- Hengshuang Zhao. semseg. https://github.com/hszhao/semseg, 2019.
- Pyramid scene parsing network. In CVPR, 2017.