Hallucination Benchmark in Medical Visual Question Answering (2401.05827v2)

Published 11 Jan 2024 in cs.CL, cs.AI, and cs.CV

Abstract: The recent success of large language and vision models (LLVMs) on vision question answering (VQA), particularly their applications in medicine (Med-VQA), has shown a great potential of realizing effective visual assistants for healthcare. However, these models are not extensively tested on the hallucination phenomenon in clinical settings. Here, we created a hallucination benchmark of medical images paired with question-answer sets and conducted a comprehensive evaluation of the state-of-the-art models. The study provides an in-depth analysis of current models' limitations and reveals the effectiveness of various prompting strategies.

References (8)

Citations (5)

View on Semantic Scholar

Collections

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Paper Prompts

Explore 10 Community Prompts

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Hallucination Benchmark in Medical Visual Question Answering (2401.05827v2)

Collections

Summary

Paper Prompts

Follow-up Questions

Authors (3)

Tweets

Don't miss out on important new AI/ML research

Hallucination Benchmark in Medical Visual Question Answering (2401.05827v2)

Collections

Summary

Paper Prompts

Follow-up Questions

Related Papers

Authors (3)

Tweets

Don't miss out on important new AI/ML research