Longitudinal Data and a Semantic Similarity Reward for Chest X-Ray Report Generation (2307.09758v4)
Abstract: Radiologists face high burnout rates, partially due to the increasing volume of Chest X-rays (CXRs) requiring interpretation and reporting. Automated CXR report generation holds promise for reducing this burden and improving patient care. While current models show potential, their diagnostic accuracy is limited. Our proposed CXR report generator integrates elements of the radiologist workflow and introduces a novel reward for reinforcement learning. Our approach leverages longitudinal data from a patient's prior CXR study and effectively handles cases where no prior study exist, thus mirroring the radiologist's workflow. In contrast, existing models typically lack this flexibility, often requiring prior studies for the model to function optimally. Our approach also incorporates all CXRs from a patient's study and distinguishes between report sections through section embeddings. Our reward for reinforcement learning leverages CXR-BERT, which forces our model to learn the clinical semantics of radiology reporting. We conduct experiments on publicly available datasets -- MIMIC-CXR and Open-i IU X-ray -- with metrics shown to more closely correlate with radiologists' assessment of reporting. Results from our study demonstrate that the proposed model generates reports that are more aligned with radiologists' reports than state-of-the-art models, such as those utilising LLMs, reinforcement learning, and multi-task learning. The proposed model improves the diagnostic accuracy of CXR report generation, which could one day reduce radiologists' workload and enhance patient care. Our Hugging Face checkpoint (https://huggingface.co/aehrc/cxrmate) and code (https://github.com/aehrc/cxrmate) are publicly available.
- Understanding and Appreciating Burnout in Radiologists. RadioGraphics 42, E137–E139.
- Making the Most of Text Semantics to Improve Biomedical Vision–Language Processing, in: ECCV, pp. 1–21.
- Cross-modal Memory Networks for Radiology Report Generation, in: IJCNLP, pp. 5904–5914.
- Generating Radiology Reports via Memory-driven Transformer, in: EMNLP, pp. 1439–1449.
- Controllable Chest X-Ray Report Generation from Longitudinal Representations, in: Findings of the Association for Computational Linguistics: EMNLP, Singapore. pp. 4891–4904.
- Improving the Factual Correctness of Radiology Report Generation with Semantic Rewards, in: EMNLP, pp. 4348–4360.
- Preparing a collection of radiology examinations for distribution and retrieval. Journal of the American Medical Informatics Association 23, 304–310.
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding, in: NAACL, pp. 4171–4186.
- The Effectiveness of Image Augmentation in Deep Learning Networks for Detecting COVID-19: A Geometric Transformation Perspective. Frontiers in Medicine 8.
- Lateral Chest X-Ray for Physicians. Journal of the Royal Society of Medicine 98, 310–312.
- LoRA: Low-rank adaptation of large language models, in: ICLR.
- CheXpert: A Large Chest Radiograph Dataset with Uncertainty Labels and Expert Comparison, in: AAAI, pp. 590–597.
- RadGraph: Extracting Clinical Entities and Relations from Radiology Reports, in: NeurIPS Datasets and Benchmarks Track (Round 1).
- MIMIC-CXR-JPG, a large publicly available database of labeled chest radiographs. PhysioNet.
- MIMIC-III, a freely accessible critical care database. Scientific Data 3, 160035.
- Chest radiographs and machine learning – Past, present and future. Journal of Medical Imaging and Radiation Oncology 65, 538–544.
- The chest radiograph. The Ulster Medical Journal 81, 143–148.
- UniXGen: A Unified Vision-Language Model for Multi-View Chest X-ray Generation and Report Generation. ArXiv:2302.12172 [cs, eess].
- LLM-CXR: Instruction-Finetuned LLM for CXR Image Understanding and Generation, in: ICLR.
- LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day, in: NeurIPS Datasets and Benchmarks Track.
- Automatic evaluation of summaries using N-gram co-occurrence statistics, in: NAACL, pp. 71–78.
- Clinically accurate chest X-ray report generation, in: MLHC, pp. 249–269.
- Decoupled Weight Decay Regularization, in: ICLR.
- Improving Factual Completeness and Consistency of Image-to-Text Radiology Report Generation, in: NAACL, pp. 5288–5304.
- Med-Flamingo: a Multimodal Medical Few-shot Learner. ArXiv:2307.15189 [cs].
- Uncertainty-aware report generation for chest X-rays by variational topic inference. Medical Image Analysis 82, 102603.
- Improving chest X-ray report generation by leveraging warm starting. Artificial Intelligence in Medicine 144, 102633.
- BLEU: a method for automatic evaluation of machine translation, in: ACL, p. 311.
- Self-Critical Sequence Training for Image Captioning, in: CVPR, pp. 1179–1195.
- Grand Challenges in Radiology. Frontiers in Radiology 1.
- Combining Automatic Labelers and Expert Annotations for Accurate Radiology Report Labeling Using BERT, in: EMNLP, pp. 1500–1519.
- XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models. ArXiv:2306.07971 [cs].
- Llama 2: Open Foundation and Fine-Tuned Chat Models. ArXiv:2307.09288 [cs].
- CIDEr: Consensus-Based Image Description Evaluation, in: CVPR, pp. 4566–4575.
- Neural machine translation with byte-level subwords, in: AAAI, pp. 9154–9160.
- A Learning Algorithm for Continually Running Fully Recurrent Neural Networks. Neural Computation 1, 270–280.
- CvT: Introducing Convolutions to Vision Transformers, in: ICCV, pp. 22–31.
- DeltaNet: Conditional Medical Report Generation for COVID-19 Diagnosis, in: ICCL, pp. 2952–2961.
- Weakly Supervised Contrastive Learning for Chest X-Ray Report Generation, in: EMNLP, pp. 4009–4015.
- MedXChat: Bridging CXR Modalities with a Unified Multimodal Large Model. ArXiv:2312.02233 [cs].
- Evaluating progress in automatic chest X-ray radiology report generation. Patterns , 100802.
- BERTScore: Evaluating Text Generation with BERT, in: ICLR.
- Utilizing Longitudinal Chest X-Rays and Reports to Pre-fill Radiology Reports, in: MICCAI. volume 14224, pp. 189–198.
- Aaron Nicolson (13 papers)
- Jason Dowling (9 papers)
- Bevan Koopman (37 papers)