Segmentation Quality and Volumetric Accuracy in Medical Imaging (2404.17742v2)
Abstract: Current medical image segmentation relies on the region-based (Dice, F1-score) and boundary-based (Hausdorff distance, surface distance) metrics as the de-facto standard. While these metrics are widely used, they lack a unified interpretation, particularly regarding volume agreement. Clinicians often lack clear benchmarks to gauge the "goodness" of segmentation results based on these metrics. Recognizing the clinical relevance of volumetry, we utilize relative volume prediction error (vpe) to directly assess the accuracy of volume predictions derived from segmentation tasks. Our work integrates theoretical analysis and empirical validation across diverse datasets. We delve into the often-ambiguous relationship between segmentation quality (measured by Dice) and volumetric accuracy in clinical practice. Our findings highlight the critical role of incorporating volumetric prediction accuracy into segmentation evaluation. This approach empowers clinicians with a more nuanced understanding of segmentation performance, ultimately improving the interpretation and utility of these metrics in real-world healthcare settings.
- The medical segmentation decathlon. Nature communications, 13(1):4128, 2022.
- Tumor volume: a basic and specific response predictor in radiotherapy. Radiotherapy and oncology, 47(2):167–174, 1998.
- nnu-net: a self-configuring method for deep learning-based biomedical image segmentation. Nature methods, 18(2):203–211, 2021.
- Treatment of pancreatic cancer tumors with intensity-modulated radiation therapy (imrt) using the volume at risk approach (vara): employing dose-volume histogram (dvh) and normal tissue complication probability (ntcp) to evaluate small bowel toxicity. Medical Dosimetry, 27(2):121–129, 2002.
- Metrics for evaluating 3d medical image segmentation: analysis, selection, and tool. BMC medical imaging, 15:1–28, 2015.
- Image segmentation evaluation: a survey of methods. Artificial Intelligence Review, 53(8):5637–5674, 2020.
- Wikipedia contributors. Qm-am-gm-hm inequalities — Wikipedia, the free encyclopedia, 2024. URL https://en.wikipedia.org/w/index.php?title=QM-AM-GM-HM_inequalities&oldid=1210323463. [Online; accessed 8-April-2024].
- Dynamic linear transformer for 3d biomedical image segmentation. In International Workshop on Machine Learning in Medical Imaging, pages 171–180. Springer, 2022.