Negative Object Presence Evaluation (NOPE) to Measure Object Hallucination in Vision-Language Models (2310.05338v2)

Published 9 Oct 2023 in cs.CV and cs.CL

Abstract: Object hallucination poses a significant challenge in vision-language (VL) models, often leading to the generation of nonsensical or unfaithful responses with non-existent objects. However, the absence of a general measurement for evaluating object hallucination in VL models has hindered our understanding and ability to mitigate this issue. In this work, we present NOPE (Negative Object Presence Evaluation), a novel benchmark designed to assess object hallucination in VL models through visual question answering (VQA). We propose a cost-effective and scalable approach utilizing LLMs to generate 29.5k synthetic negative pronoun (NegP) data of high quality for NOPE. We extensively investigate the performance of 10 state-of-the-art VL models in discerning the non-existence of objects in visual questions, where the ground truth answers are denoted as NegP (e.g., "none"). Additionally, we evaluate their standard performance on visual questions on 9 other VQA datasets. Through our experiments, we demonstrate that no VL model is immune to the vulnerability of object hallucination, as all models achieve accuracy below 10\% on NegP. Furthermore, we uncover that lexically diverse visual questions, question types with large scopes, and scene-relevant objects capitalize the risk of object hallucination in VL models.

References (100)

Citations (29)

View on Semantic Scholar

Collections

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Paper Prompts

Explore 10 Community Prompts

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Negative Object Presence Evaluation (NOPE) to Measure Object Hallucination in Vision-Language Models (2310.05338v2)

Collections

Summary

Paper Prompts

Follow-up Questions

Authors (5)

Don't miss out on important new AI/ML research

Negative Object Presence Evaluation (NOPE) to Measure Object Hallucination in Vision-Language Models (2310.05338v2)

Collections

Summary

Paper Prompts

Follow-up Questions

Related Papers

Authors (5)

Don't miss out on important new AI/ML research