Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

AICAttack: Adversarial Image Captioning Attack with Attention-Based Optimization (2402.11940v4)

Published 19 Feb 2024 in cs.CV, cs.CR, and cs.LG

Abstract: Recent advances in deep learning research have shown remarkable achievements across many tasks in computer vision (CV) and NLP. At the intersection of CV and NLP is the problem of image captioning, where the related models' robustness against adversarial attacks has not been well studied. This paper presents a novel adversarial attack strategy, AICAttack (Attention-based Image Captioning Attack), designed to attack image captioning models through subtle perturbations on images. Operating within a black-box attack scenario, our algorithm requires no access to the target model's architecture, parameters, or gradient information. We introduce an attention-based candidate selection mechanism that identifies the optimal pixels to attack, followed by a customised differential evolution method to optimise the perturbations of pixels' RGB values. We demonstrate AICAttack's effectiveness through extensive experiments on benchmark datasets against multiple victim models. The experimental results demonstrate that our method outperforms current leading-edge techniques by achieving consistently higher attack success rates.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Jiyao Li (5 papers)
  2. Mingze Ni (8 papers)
  3. Yifei Dong (20 papers)
  4. Tianqing Zhu (85 papers)
  5. Wei Liu (1135 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.