When Radiology Report Generation Meets Knowledge Graph (2002.08277v1)

Published 19 Feb 2020 in cs.CV, cs.AI, cs.LG, and eess.IV

Abstract: Automatic radiology report generation has been an attracting research problem towards computer-aided diagnosis to alleviate the workload of doctors in recent years. Deep learning techniques for natural image captioning are successfully adapted to generating radiology reports. However, radiology image reporting is different from the natural image captioning task in two aspects: 1) the accuracy of positive disease keyword mentions is critical in radiology image reporting in comparison to the equivalent importance of every single word in a natural image caption; 2) the evaluation of reporting quality should focus more on matching the disease keywords and their associated attributes instead of counting the occurrence of N-gram. Based on these concerns, we propose to utilize a pre-constructed graph embedding module (modeled with a graph convolutional neural network) on multiple disease findings to assist the generation of reports in this work. The incorporation of knowledge graph allows for dedicated feature learning for each disease finding and the relationship modeling between them. In addition, we proposed a new evaluation metric for radiology image reporting with the assistance of the same composed graph. Experimental results demonstrate the superior performance of the methods integrated with the proposed graph embedding module on a publicly accessible dataset (IU-RR) of chest radiographs compared with previous approaches using both the conventional evaluation metrics commonly adopted for image captioning and our proposed ones.

View on arXiv

Authors (6)

Yixiao Zhang (44 papers)
Xiaosong Wang (42 papers)
Ziyue Xu (58 papers)
Qihang Yu (44 papers)
Alan Yuille (294 papers)
Daguang Xu (91 papers)

Citations (251)

View on Semantic Scholar

Summary

Overview of "When Radiology Report Generation Meets Knowledge Graph"

The paper "When Radiology Report Generation Meets Knowledge Graph" addresses the critical task of automating radiology report generation, with a specific focus on leveraging a knowledge graph to improve the accuracy and relevance of generated reports. The authors propose a novel method combining deep learning techniques with graph convolutional networks (GCNs) to enhance the report generation process by embedding rich prior knowledge about radiological findings within a structured graph. This approach offers a refined understanding of interrelated findings, which is vital for accurate clinical reports.

Methodology

The proposed methodology involves integrating a pre-constructed knowledge graph into the existing deep learning framework for radiology report generation. This graph is composed of nodes representing multiple disease findings related to chest radiographs, connected based on clinical relevance and anatomical relationships. The core of the method involves:

Graph Embedding with GCNs: Graph convolutional networks are used for propagating features across the graph, allowing the network to capture the interdependencies between different disease findings.
Node Feature Initialization: Initial features for each node are derived using a spatial attention mechanism applied to the convolutional neural network (CNN) outputs, focusing on extracting feature representations specific to each disease finding.
Multi-label Classification and Report Generation: The process is divided into two stages where the model is initially trained for classification to identify multiple clinical findings, followed by training a two-level LSTM decoder for generating coherent and accurate radiology reports.
New Evaluation Metric: The research introduces the Medical Image Report Quality Index (MIRQI), which evaluates the correctness of disease mentions and their attributes in generated reports, emphasizing clinically significant information over traditional N-gram-based metrics.

Experimental Results

The authors validate their approach using the IU-RR dataset of chest radiographs, demonstrating that their model consistently outperforms prior models on both classification and report generation tasks. Quantitatively, the model achieves a 2% improvement in Area Under Curve (AUC) on average for classification tasks, illustrating better discrimination among disease findings. Additionally, the MIRQI metric shows significant improvements in assessing the clinical correctness of generated reports, with attributable enhancements in accurately identifying both positive and negative disease mentions.

Implications and Future Directions

This research offers promising implications for automated medical documentation systems, particularly in radiology, where workload and demand often surpass available resources. By employing a knowledge graph, the model can potentially generalize better across various radiology domains as it inherently treats the relationships among findings in a structured manner.

The paper opens pathways for future exploration, including scaling the approach to encompass more sophisticated graph structures that integrate additional medical knowledge. Such structures could further enhance the granularity of interrelated findings modeling, improving the clinical relevance of automated reports. Moreover, extending similar graph-based methodologies to other medical imaging fields could offer broader applicability and validation of the proposed techniques.

In conclusion, the integration of a knowledge graph within the radiology report generation process presents an innovative approach to improve the clinical accuracy and relevance of automated reports. The introduction of new evaluation metrics aimed at capturing clinical correctness underscores the practical applicability of these advancements in real-world healthcare systems.

PDF Markdown

Related Papers

Find Related Papers