Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

MAIRA-Seg: Enhancing Radiology Report Generation with Segmentation-Aware Multimodal Large Language Models (2411.11362v1)

Published 18 Nov 2024 in cs.CV and cs.CL

Abstract: There is growing interest in applying AI to radiology report generation, particularly for chest X-rays (CXRs). This paper investigates whether incorporating pixel-level information through segmentation masks can improve fine-grained image interpretation of multimodal LLMs (MLLMs) for radiology report generation. We introduce MAIRA-Seg, a segmentation-aware MLLM framework designed to utilize semantic segmentation masks alongside CXRs for generating radiology reports. We train expert segmentation models to obtain mask pseudolabels for radiology-specific structures in CXRs. Subsequently, building on the architectures of MAIRA, a CXR-specialised model for report generation, we integrate a trainable segmentation tokens extractor that leverages these mask pseudolabels, and employ mask-aware prompting to generate draft radiology reports. Our experiments on the publicly available MIMIC-CXR dataset show that MAIRA-Seg outperforms non-segmentation baselines. We also investigate set-of-marks prompting with MAIRA and find that MAIRA-Seg consistently demonstrates comparable or superior performance. The results confirm that using segmentation masks enhances the nuanced reasoning of MLLMs, potentially contributing to better clinical outcomes.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (16)
  1. Harshita Sharma (13 papers)
  2. Valentina Salvatelli (19 papers)
  3. Shaury Srivastav (5 papers)
  4. Kenza Bouzid (9 papers)
  5. Shruthi Bannur (15 papers)
  6. Daniel C. Castro (28 papers)
  7. Maximilian Ilse (11 papers)
  8. Sam Bond-Taylor (10 papers)
  9. Mercy Prasanna Ranjit (1 paper)
  10. Fabian Falck (20 papers)
  11. Fernando Pérez-García (16 papers)
  12. Anton Schwaighofer (13 papers)
  13. Hannah Richardson (5 papers)
  14. Maria Teodora Wetscherek (6 papers)
  15. Stephanie L. Hyland (20 papers)
  16. Javier Alvarez-Valle (19 papers)
X Twitter Logo Streamline Icon: https://streamlinehq.com