Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Raising the Bar of AI-generated Image Detection with CLIP (2312.00195v2)

Published 30 Nov 2023 in cs.CV

Abstract: The aim of this work is to explore the potential of pre-trained vision-LLMs (VLMs) for universal detection of AI-generated images. We develop a lightweight detection strategy based on CLIP features and study its performance in a wide variety of challenging scenarios. We find that, contrary to previous beliefs, it is neither necessary nor convenient to use a large domain-specific dataset for training. On the contrary, by using only a handful of example images from a single generative model, a CLIP-based detector exhibits surprising generalization ability and high robustness across different architectures, including recent commercial tools such as Dalle-3, Midjourney v5, and Firefly. We match the state-of-the-art (SoTA) on in-distribution data and significantly improve upon it in terms of generalization to out-of-distribution data (+6% AUC) and robustness to impaired/laundered data (+13%). Our project is available at https://grip-unina.github.io/ClipBased-SyntheticImageDetection/

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Davide Cozzolino (36 papers)
  2. Giovanni Poggi (29 papers)
  3. Riccardo Corvi (6 papers)
  4. Matthias Nießner (177 papers)
  5. Luisa Verdoliva (51 papers)
Citations (42)
Github Logo Streamline Icon: https://streamlinehq.com