Multilingual Natural Language Processing Model for Radiology Reports -- The Summary is all you need! (2310.00100v4)
Abstract: The impression section of a radiology report summarizes important radiology findings and plays a critical role in communicating these findings to physicians. However, the preparation of these summaries is time-consuming and error-prone for radiologists. Recently, numerous models for radiology report summarization have been developed. Nevertheless, there is currently no model that can summarize these reports in multiple languages. Such a model could greatly improve future research and the development of Deep Learning models that incorporate data from patients with different ethnic backgrounds. In this study, the generation of radiology impressions in different languages was automated by fine-tuning a model, publicly available, based on a multilingual text-to-text Transformer to summarize findings available in English, Portuguese, and German radiology reports. In a blind test, two board-certified radiologists indicated that for at least 70% of the system-generated summaries, the quality matched or exceeded the corresponding human-written summaries, suggesting substantial clinical reliability. Furthermore, this study showed that the multilingual model outperformed other models that specialized in summarizing radiology reports in only one language, as well as models that were not specifically designed for summarizing radiology reports, such as ChatGPT.
- The radiology report as seen by radiologists and referring clinicians: results of the cover and rover surveys. Radiology, 259(1):184–195, 2011.
- Chestxraybert: A pretrained language model for chest radiology report summarization. IEEE Transactions on Multimedia, 2021.
- Fast abstractive summarization with reinforce-selected sentence rewriting. arXiv preprint arXiv:1805.11080, 2018.
- Preparing a collection of radiology examinations for distribution and retrieval. Journal of the American Medical Informatics Association, 23(2):304–310, 2016.
- Medical image registration. Physics in medicine & biology, 46(3):R1, 2001.
- Mimic-cxr, a de-identified publicly available database of chest radiographs with free-text reports. Scientific data, 6(1):317, 2019.
- Toward best practices in radiology reporting. Radiology, 252(3):852–856, 2009.
- Chatgpt in healthcare: A taxonomy and systematic review. medRxiv, pages 2023–03, 2023.
- Fine-tuning bert models for summarizing german radiology findings. In Proceedings of the 4th Clinical Natural Language Processing Workshop, pages 30–40, 2022.
- Chin-Yew Lin. ROUGE: A package for automatic evaluation of summaries. In Text Summarization Branches Out, pages 74–81, Barcelona, Spain, July 2004. Association for Computational Linguistics. URL https://www.aclweb.org/anthology/W04-1013.
- Stepwise extractive summarization and planning with structured transformers. arXiv preprint arXiv:2010.02744, 2020.
- Colin Raffel nconstant google. Experimental t5 pre-trained model checkpoints. URL https://github.com/google-research/text-to-text-transfer-transformer/blob/main/released_checkpoints.md#t511.
- Exploring the limits of transfer learning with a unified text-to-text transformer. The Journal of Machine Learning Research, 21(1):5485–5551, 2020.
- Richard J Roberts. Pubmed central: The genbank of the published literature, 2001.
- Let’s have a chat! a conversation with chatgpt: Technology, applications, and limitations. arXiv preprint arXiv:2302.13817, 2023.
- Noam Shazeer. Glu variants improve transformer. arXiv preprint arXiv:2002.05202, 2020.
- Attention is all you need. Advances in neural information processing systems, 30, 2017.
- Pointer networks. Advances in neural information processing systems, 28, 2015.
- Improving radiologic communication in oncology: a single-centre experience with structured reporting for cancer patients. Insights into Imaging, 11:1–11, 2020.
- mt5: A massively multilingual pre-trained text-to-text transformer. arXiv preprint arXiv:2010.11934, 2020.
- Bidirectional long short-term memory networks for relation classification. In Proceedings of the 29th Pacific Asia conference on language, information and computation, pages 73–78, 2015.
- Learning to summarize radiology findings. arXiv preprint arXiv:1809.04698, 2018.
- Mariana Lindo (2 papers)
- Ana Sofia Santos (3 papers)
- André Ferreira (13 papers)
- Jianning Li (31 papers)
- Gijs Luijten (10 papers)
- Gustavo Correia (3 papers)
- Moon Kim (16 papers)
- Benedikt Michael Schaarschmidt (1 paper)
- Cornelius Deuschl (3 papers)
- Johannes Haubold (9 papers)
- Jens Kleesiek (80 papers)
- Jan Egger (94 papers)
- Victor Alves (26 papers)