Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models (2407.05131v2)

Published 6 Jul 2024 in cs.LG, cs.AI, cs.CL, cs.CV, and cs.CY

Abstract: The recent emergence of Medical Large Vision LLMs (Med-LVLMs) has enhanced medical diagnosis. However, current Med-LVLMs frequently encounter factual issues, often generating responses that do not align with established medical facts. Retrieval-Augmented Generation (RAG), which utilizes external knowledge, can improve the factual accuracy of these models but introduces two major challenges. First, limited retrieved contexts might not cover all necessary information, while excessive retrieval can introduce irrelevant and inaccurate references, interfering with the model's generation. Second, in cases where the model originally responds correctly, applying RAG can lead to an over-reliance on retrieved contexts, resulting in incorrect answers. To address these issues, we propose RULE, which consists of two components. First, we introduce a provably effective strategy for controlling factuality risk through the calibrated selection of the number of retrieved contexts. Second, based on samples where over-reliance on retrieved contexts led to errors, we curate a preference dataset to fine-tune the model, balancing its dependence on inherent knowledge and retrieved contexts for generation. We demonstrate the effectiveness of RULE on medical VQA and report generation tasks across three datasets, achieving an average improvement of 47.4% in factual accuracy. We publicly release our benchmark and code in https://github.com/richard-peng-xia/RULE.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Peng Xia (25 papers)
  2. Kangyu Zhu (5 papers)
  3. Haoran Li (166 papers)
  4. Hongtu Zhu (81 papers)
  5. Yun Li (154 papers)
  6. Gang Li (579 papers)
  7. Linjun Zhang (70 papers)
  8. Huaxiu Yao (103 papers)
Citations (13)

Summary

We haven't generated a summary for this paper yet.