Revisiting the Adversarial Robustness of Vision Language Models: a Multimodal Perspective (2404.19287v3)

Published 30 Apr 2024 in cs.CV

Abstract: Pretrained vision-LLMs (VLMs) like CLIP exhibit exceptional generalization across diverse downstream tasks. While recent studies reveal their vulnerability to adversarial attacks, research to date has primarily focused on enhancing the robustness of image encoders against image-based attacks, with defenses against text-based and multimodal attacks remaining largely unexplored. To this end, this work presents the first comprehensive study on improving the adversarial robustness of VLMs against attacks targeting image, text, and multimodal inputs. This is achieved by proposing multimodal contrastive adversarial training (MMCoA). Such an approach strengthens the robustness of both image and text encoders by aligning the clean text embeddings with adversarial image embeddings, and adversarial text embeddings with clean image embeddings. The robustness of the proposed MMCoA is examined against existing defense methods over image, text, and multimodal attacks on the CLIP model. Extensive experiments on 15 datasets across two tasks reveal the characteristics of different adversarial defense methods under distinct distribution shifts and dataset complexities across the three attack types. This paves the way for a unified framework of adversarial robustness against different modality attacks, opening up new possibilities for securing VLMs against multimodal attacks. The code is available at https://github.com/ElleZWQ/MMCoA.git.

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

References (45)

Authors (5)

Wanqi Zhou (9 papers)
Shuanghao Bai (10 papers)
Qibin Zhao (66 papers)
Badong Chen (83 papers)
Danilo P. Mandic (70 papers)

Citations (3)

View on Semantic Scholar

Tweets

https://twitter.com/CSVisionPapers/status/1785783884631638097

Revisiting the Adversarial Robustness of Vision Language Models: a Multimodal Perspective (2404.19287v3)

Related Papers

Tweets