Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media Forensics (2403.14077v4)

Published 21 Mar 2024 in cs.AI and cs.CR

Abstract: DeepFakes, which refer to AI-generated media content, have become an increasing concern due to their use as a means for disinformation. Detecting DeepFakes is currently solved with programmed machine learning algorithms. In this work, we investigate the capabilities of multimodal LLMs in DeepFake detection. We conducted qualitative and quantitative experiments to demonstrate multimodal LLMs and show that they can expose AI-generated images through careful experimental design and prompt engineering. This is interesting, considering that LLMs are not inherently tailored for media forensic tasks, and the process does not require programming. We discuss the limitations of multimodal LLMs for these tasks and suggest possible improvements.

PDF HTML Abstract

Evaluating ChatGPT for DeepFake Detection in Media Forensics

The paper, "Can ChatGPT Detect DeepFakes? A Study of Using Multimodal LLMs for Media Forensics," explores the potential of employing multimodal LLMs for the detection of DeepFakes, a prevalent concern in the digital era. This work critically examines whether the capabilities of LLMs, specifically those that can process both textual and visual inputs, could be harnessed for media forensics without relying on complex programming.

Methodology and Experimental Design

The researchers focus on ChatGPT, particularly its iterations that can handle multimodal data, such as images and text. The paper opts for identifying AI-generated face images, which are among the earliest and most notorious forms of DeepFakes produced through models like GANs and diffusion models. The authors systematically design experiments leveraging the GPT4V model, assessing its ability to determine whether a given image is AI-generated using various types of prompts.

A significant aspect of the paper is the exploration of different prompts, ranging from simple binary queries to more context-rich ones. The paper highlights that simpler prompts often lead to higher rejection rates by the LLMs, thereby necessitating more intricate prompt engineering for effective DeepFake detection. This approach suggests that the inherent capabilities of LLMs, which are not originally tailored for media forensics, can still be deployed effectively with carefully designed user-text interfaces.

Key Findings and Performance Metrics

The paper reports an AUC score of approximately 75% when using multimodal LLMs for detecting AI-generated face images. This AUC score indicates the LLMs' discernment capabilities, albeit limited compared to advanced programmed methodologies. While the model shows satisfactory performance in detecting AI-generated images, its accuracy drops significantly when identifying real images, owing to the absence of explicit semantic inconsistencies typically associated with AI-generated content.

The rejection rate of the LLM's responses plays a critical role in evaluating its practical utility. The paper emphasizes that effective prompt engineering can notably reduce rejection rates and improve the accuracy of the model's predictions. Through iterative querying and context-providing prompts, the efficacy of LLMs can be enhanced to discern between authentic and AI-generated content more reliably.

Implications and Speculative Future Directions

This research demonstrates the prospect of integrating LLMs into media forensics, offering a user-friendly, intuitive interface without requiring deep programming expertise. The paper underscores that while LLMs can be a part of the media forensics toolkit, they currently lag behind traditional methods in accuracy. This shortfall is partly due to LLMs' reliance on semantic reasoning rather than signal-level details, which are more targeted in current DeepFake detection tools.

The paper speculates on the future development of LLMs in this domain, suggesting enhancements such as improved prompting strategies, incorporation of feedback mechanisms, and hybrid integration with established signal-processing techniques. These developments would potentially increase the LLMs’ utility in comprehensive media forensic tasks, including video analysis and the identification of text-image mis-contextualization.

Conclusion

In conclusion, the paper provides a thorough examination of ChatGPT's abilities to detect DeepFakes, contributing valuable insights to the field of media forensics. While current performance may not compete with specialized detection systems, the promise of LLMs lies in their ease of accessibility and the significant potential to evolve. This paper encourages further exploration toward utilizing multimodal LLMs, enhancing their precision, and integrating them effectively with existing forensic methodologies for more robust media analysis solutions.

PDF Markdown Bookmark Chat (Pro)

References (41)

Authors (10)

Shan Jia (26 papers)
Reilin Lyu (1 paper)
Kangran Zhao (1 paper)
Yize Chen (57 papers)
Zhiyuan Yan (81 papers)
Yan Ju (10 papers)
Chuanbo Hu (16 papers)
Xin Li (980 papers)
Baoyuan Wu (107 papers)
Siwei Lyu (125 papers)

Citations (15)

View on Semantic Scholar

Related Papers

Find Related Papers

Tweets

https://twitter.com/aivnte/status/1771084748258959787

YouTube

Show All Videos