A New 2.5D Representation for Lymph Node Detection using Random Sets of Deep Convolutional Neural Network Observations (1406.2639v1)

Published 6 Jun 2014 in cs.CV, cs.LG, and cs.NE

Abstract: Automated Lymph Node (LN) detection is an important clinical diagnostic task but very challenging due to the low contrast of surrounding structures in Computed Tomography (CT) and to their varying sizes, poses, shapes and sparsely distributed locations. State-of-the-art studies show the performance range of 52.9% sensitivity at 3.1 false-positives per volume (FP/vol.), or 60.9% at 6.1 FP/vol. for mediastinal LN, by one-shot boosting on 3D HAAR features. In this paper, we first operate a preliminary candidate generation stage, towards 100% sensitivity at the cost of high FP levels (40 per patient), to harvest volumes of interest (VOI). Our 2.5D approach consequently decomposes any 3D VOI by resampling 2D reformatted orthogonal views N times, via scale, random translations, and rotations with respect to the VOI centroid coordinates. These random views are then used to train a deep Convolutional Neural Network (CNN) classifier. In testing, the CNN is employed to assign LN probabilities for all N random views that can be simply averaged (as a set) to compute the final classification probability per VOI. We validate the approach on two datasets: 90 CT volumes with 388 mediastinal LNs and 86 patients with 595 abdominal LNs. We achieve sensitivities of 70%/83% at 3 FP/vol. and 84%/90% at 6 FP/vol. in mediastinum and abdomen respectively, which drastically improves over the previous state-of-the-art work.

Citations (530)

View on Semantic Scholar

Summary

The paper introduces a novel 2.5D CNN approach that decomposes CT volumes into orthogonal views for efficient lymph node detection.
It employs random perturbations in scale, translation, and rotation to generate multiple candidate views, significantly enhancing detection sensitivity.
The method achieved sensitivities of 70–83% at 3 FP/vol and up to 90% at 6 FP/vol, marking a notable improvement over previous techniques.

A New 2.5D Representation for Lymph Node Detection using Random Sets of Deep Convolutional Neural Network Observations

The paper by H. R. Roth et al. presents a novel approach in the domain of automated lymph node (LN) detection using a 2.5D representation. The impetus for this work arises from the challenge of identifying LNs in computed tomography (CT) images, where the task is complicated by LNs' low contrast and variability in size, shape, and distribution.

Methodology

The authors introduce a two-stage framework. Initially, they employ a candidate generation step aiming for near-complete sensitivity in detecting LNs, albeit with a high false-positive (FP) count. To refine this, they propose a 2.5D decompositional strategy. This involves resampling volumes of interest (VOIs) into orthogonal views that are randomly perturbed in terms of scale, translation, and rotation. These views are processed through a deep convolutional neural network (CNN) designed to generate probabilistic classifications of LN presence.

Key to this methodology is the use of 3-channel images analogous to RGB color channels in conventional vision tasks. The axial, coronal, and sagittal views within each VOI are aligned as these channels, facilitating efficient processing within common CNN architectures. This innovation sidesteps the computational burdens typically associated with direct 3D CNNs, while maintaining the dimensional structure necessary for effective LN classification.

Results

The technique was evaluated on two distinct datasets, comprising 90 CT volumes for mediastinal LNs and 86 for abdominal LNs. The results were compelling, with the method yielding sensitivities of 70% and 83% at 3 FP/vol for mediastinal and abdominal LNs, respectively, improving to 84% and 90% at 6 FP/vol. These figures mark a significant advancement over prior benchmarks in the field.

Implications and Future Work

This research offers practical solutions for improving the accuracy and efficiency of LN detection systems, which have vital implications for diagnostic radiology, especially in oncology and inflammation assessments. The adoption of a 2.5D approach not only leverages the strengths of CNNs in image classification but also paves the way for extending similar methodologies to other 3D object recognition tasks within medical imaging.

The paper also underscores the importance of extensive and varied datasets to enhance the generalizability of CNN models. Future investigations might focus on enriching training datasets or optimizing networks to better handle the complexities inherent in medical images. Additionally, the implementation of advanced fusion techniques for CNN predictions could further refine detection performance.

In conclusion, the introduction of a 2.5D representation for LN detection presents a robust pathway for reducing false positives without compromising sensitivity. This paper reflects significant progress in the evolution of computer-aided detection systems, with potential applications extending beyond the immediate scope of LN detection.

PDF Markdown