Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs (2406.08772v2)

Published 13 Jun 2024 in cs.CV and cs.CL

Abstract: Current multimodal misinformation detection (MMD) methods often assume a single source and type of forgery for each sample, which is insufficient for real-world scenarios where multiple forgery sources coexist. The lack of a benchmark for mixed-source misinformation has hindered progress in this field. To address this, we introduce MMFakeBench, the first comprehensive benchmark for mixed-source MMD. MMFakeBench includes 3 critical sources: textual veracity distortion, visual veracity distortion, and cross-modal consistency distortion, along with 12 sub-categories of misinformation forgery types. We further conduct an extensive evaluation of 6 prevalent detection methods and 15 large vision-LLMs (LVLMs) on MMFakeBench under a zero-shot setting. The results indicate that current methods struggle under this challenging and realistic mixed-source MMD setting. Additionally, we propose an innovative unified framework, which integrates rationales, actions, and tool-use capabilities of LVLM agents, significantly enhancing accuracy and generalization. We believe this study will catalyze future research into more realistic mixed-source multimodal misinformation and provide a fair evaluation of misinformation detection methods.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Xuannan Liu (17 papers)
  2. Zekun Li (73 papers)
  3. Peipei Li (29 papers)
  4. Shuhan Xia (3 papers)
  5. Xing Cui (13 papers)
  6. Linzhi Huang (7 papers)
  7. Huaibo Huang (58 papers)
  8. Weihong Deng (71 papers)
  9. Zhaofeng He (31 papers)
Citations (7)