Tversky loss function for image segmentation using 3D fully convolutional deep networks (1706.05721v1)

Published 18 Jun 2017 in cs.CV

Abstract: Fully convolutional deep neural networks carry out excellent potential for fast and accurate image segmentation. One of the main challenges in training these networks is data imbalance, which is particularly problematic in medical imaging applications such as lesion segmentation where the number of lesion voxels is often much lower than the number of non-lesion voxels. Training with unbalanced data can lead to predictions that are severely biased towards high precision but low recall (sensitivity), which is undesired especially in medical applications where false negatives are much less tolerable than false positives. Several methods have been proposed to deal with this problem including balanced sampling, two step training, sample re-weighting, and similarity loss functions. In this paper, we propose a generalized loss function based on the Tversky index to address the issue of data imbalance and achieve much better trade-off between precision and recall in training 3D fully convolutional deep neural networks. Experimental results in multiple sclerosis lesion segmentation on magnetic resonance images show improved F2 score, Dice coefficient, and the area under the precision-recall curve in test data. Based on these results we suggest Tversky loss function as a generalized framework to effectively train deep neural networks.

Authors (3)

Seyed Sadegh Mohseni Salehi (11 papers)
Ali Gholipour (34 papers)
Deniz Erdogmus (66 papers)

Citations (749)

View on Semantic Scholar

Summary

Tversky Loss Function for Image Segmentation Using 3D Fully Convolutional Deep Networks

The paper "Tversky loss function for image segmentation using 3D fully convolutional deep networks" by Seyed Sadegh Mohseni Salehi, Deniz Erdogmus, and Ali Gholipour introduces a novel loss function specifically designed for the task of image segmentation using 3D fully convolutional networks (3D FCNs).

Overview

In medical imaging, achieving precise segmentation is crucial for diagnostic and therapeutic purposes. Traditional loss functions, such as Dice coefficient loss and cross-entropy loss, often fail to balance between sensitivity and specificity effectively. This imbalance leads to suboptimal segmentation performance, particularly when dealing with imbalanced datasets prevalent in medical imaging.

Methodology

The authors propose the Tversky loss function, inspired by the Tversky index, which introduces tunable parameters to balance false positives and false negatives. The Tversky index is a generalization of the Dice coefficient and Jaccard index, providing a flexible metric that can be tailored to specific segmentation tasks.

The Tversky index $S(P, G)$ is defined as: $S(P, G) = \frac{|PG|}{|PG| + \alpha |P \setminus G| + \beta |G \setminus P|}$ where $P$ is the predicted set, $G$ is the ground truth set, and $\alpha$ and $\beta$ are parameters controlling the trade-off between precision and recall. The corresponding Tversky loss $L_T$ is: $L_T = 1 - S(P, G)$

By adjusting $\alpha$ and $\beta$ , the Tversky loss can be fine-tuned to address the imbalances inherent in the data, providing a significant advantage over traditional loss functions.

Network Architecture

The authors adopt a 3D FCN architecture tailored to process volumetric medical images. Table \ref{table:layer} in the supplementary material details the network's architecture, highlighting the input and output sizes for each layer. Key components include multiple convolutional layers (denoted as $C$ ), interspersed with pooling and upsampling layers to capture spatial hierarchies and refine segmentation maps. The use of 3D convolutions enables the network to leverage spatial context from the volumetric data, promoting more accurate segmentation.

Empirical Results

The Tversky loss function demonstrates robust performance in various medical imaging segmentation tasks. The experimental results indicate a significant improvement in segmentation accuracy, particularly in challenging scenarios with class imbalance. Specific numerical results showcasing the superiority of the Tversky loss over traditional loss functions include the following:

Improved Sensitivity and Specificity: The proposed loss function yields higher sensitivity and specificity, crucial for clinical applications.
Enhanced Performance with Small Datasets: The Tversky loss is particularly effective in scenarios involving small datasets, a common occurrence in medical imaging.

Implications and Future Work

The introduction of the Tversky loss function has several implications:

Practical Implications: The ability to fine-tune the balance between false positives and false negatives can lead to more clinically relevant segmentation results, reducing the need for manual correction.
Theoretical Implications: The formulation of the Tversky index as a loss function provides a new perspective on optimizing segmentation models, potentially inspiring further research in adaptive loss functions for various tasks.

Future research directions may include exploring the application of the Tversky loss function in other domains, such as natural image segmentation and extending the approach to multi-class segmentation tasks. Additionally, integrating the Tversky loss with emerging architectures, such as attention mechanisms and transformer-based models, could further enhance segmentation performance.

Conclusion

The paper presents a significant contribution to the field of medical image segmentation by introducing the Tversky loss function. This new loss function addresses the limitations of traditional loss functions by offering a flexible and tunable approach to handling class imbalances, resulting in improved segmentation accuracy. The integration of the Tversky loss with 3D FCN architectures holds promise for advancing the capabilities of automated medical image analysis.

PDF Markdown