Calibrating Segmentation Networks with Margin-based Label Smoothing (2209.09641v2)
Abstract: Despite the undeniable progress in visual recognition tasks fueled by deep neural networks, there exists recent evidence showing that these models are poorly calibrated, resulting in over-confident predictions. The standard practices of minimizing the cross entropy loss during training promote the predicted softmax probabilities to match the one-hot label assignments. Nevertheless, this yields a pre-softmax activation of the correct class that is significantly larger than the remaining activations, which exacerbates the miscalibration problem. Recent observations from the classification literature suggest that loss functions that embed implicit or explicit maximization of the entropy of predictions yield state-of-the-art calibration performances. Despite these findings, the impact of these losses in the relevant task of calibrating medical image segmentation networks remains unexplored. In this work, we provide a unifying constrained-optimization perspective of current state-of-the-art calibration losses. Specifically, these losses could be viewed as approximations of a linear penalty (or a Lagrangian term) imposing equality constraints on logit distances. This points to an important limitation of such underlying equality constraints, whose ensuing gradients constantly push towards a non-informative solution, which might prevent from reaching the best compromise between the discriminative performance and calibration of the model during gradient-based optimization. Following our observations, we propose a simple and flexible generalization based on inequality constraints, which imposes a controllable margin on logit distances. Comprehensive experiments on a variety of public medical image segmentation benchmarks demonstrate that our method sets novel state-of-the-art results on these tasks in terms of network calibration, whereas the discriminative performance is also improved.
- Dataset of breast ultrasound images. Data in brief 28, 104863.
- The medical segmentation decathlon. Nature communications 13, 4128.
- Advancing the cancer genome atlas glioma mri collections with expert segmentation labels and radiomic features. Scientific data 4, 1–13.
- Identifying the best machine learning algorithms for brain tumor segmentation, progression assessment, and overall survival prediction in the brats challenge. arXiv preprint arXiv:1811.02629 .
- Deep learning techniques for automatic MRI cardiac multi-structures segmentation and diagnosis: is the problem solved? IEEE TMI 37, 2514–2525.
- Nonlinear Programming. Athena Scientific, Belmont, MA.
- Weight uncertainty in neural network, in: ICML.
- Transunet: Transformers make strong encoders for medical image segmentation. arXiv preprint arXiv:2102.04306 .
- Local temperature scaling for probability calibration, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 6889–6899.
- Deep ensembles: A loss landscape perspective. arXiv preprint arXiv:1912.02757 .
- Dropout as a bayesian approximation: Representing model uncertainty in deep learning, in: ICML.
- On calibration of modern neural networks, in: ICML.
- Benchmarking neural network robustness to common corruptions and perturbations, in: International Conference on Learning Representations.
- Probabilistic backpropagation for scalable learning of bayesian neural networks, in: ICML.
- Spatially varying label smoothing: Capturing uncertainty from expert annotations, in: International Conference on Information Processing in Medical Imaging, pp. 677–688.
- A bayesian neural net to segment images with uncertainty estimates and good calibration, in: International Conference on Information Processing in Medical Imaging, pp. 3–15.
- Analyzing the quality and challenges of uncertainty estimations for brain tumor segmentation. Frontiers in neuroscience 14, 282.
- Improving calibration and out-of-distribution detection in deep models for medical image segmentation. IEEE Transactions on Artificial Intelligence .
- Confidence histograms for model reliability analysis and temperature calibration, in: Medical Imaging with Deep Learning.
- Simple and scalable predictive uncertainty estimation using deep ensembles, in: NeurIPS.
- Orthogonal ensemble networks for biomedical image segmentation, in: MICCAI.
- Focal loss for dense object detection, in: CVPR.
- Evaluation of prostate segmentation algorithms for mri: the promise12 challenge. Medical image analysis 18, 359–373.
- The devil is in the margin: Margin-based label smoothing for network calibration, in: CVPR.
- Structured and efficient variational deep learning with matrix gaussian posteriors, in: ICML.
- Does label smoothing mitigate label noise?, in: ICML.
- Abdomenct-1k: Is abdominal organ segmentation a solved problem? IEEE Transactions on Pattern Analysis and Machine Intelligence doi:10.1109/TPAMI.2021.3100536.
- Meta-cal: Well-controlled post-hoc calibration by ranking, in: ICML.
- Isles 2015 - a public evaluation benchmark for ischemic stroke lesion segmentation from multispectral mri. Medical Image Analysis 35, 250–269. URL: https://www.sciencedirect.com/science/article/pii/S1361841516301268, doi:https://doi.org/10.1016/j.media.2016.07.009.
- Confidence calibration and predictive uncertainty estimation for deep medical image segmentation. IEEE transactions on medical imaging 39, 3868–3878.
- Mrbrains challenge: online evaluation framework for brain image segmentation in 3t mri scans. Comput. Intell. Neurosci. 2015, 1.
- Mrbrains challenge: online evaluation framework for brain image segmentation in 3t mri scans. Computational intelligence and neuroscience 2015.
- The multimodal brain tumor image segmentation benchmark (brats). IEEE Transactions on Medical Imaging 34, 1993–2024. doi:10.1109/TMI.2014.2377694.
- Revisiting the calibration of modern neural networks, in: NeurIPS.
- Calibrating deep neural networks using focal loss, in: NeurIPS.
- When does label smoothing help?, in: NeurIPS.
- Obtaining well calibrated probabilities using bayesian binning, in: Twenty-Ninth AAAI Conference on Artificial Intelligence.
- Predicting good probabilities with supervised learning, in: Proceedings of the 22nd international conference on Machine learning, pp. 625–632.
- Attention u-net: Learning where to look for the pancreas. arXiv preprint arXiv:1804.03999 .
- Can you trust your model’s uncertainty? evaluating predictive uncertainty under dataset shift, in: NeurIPS.
- Regularizing neural networks by penalizing confident output distributions, in: ICLR.
- Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. Advances in large margin classifiers 10, 61–74.
- U-Net: Convolutional Networks for Biomedical Image Segmentation, in: Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015, pp. 234–241.
- Rethinking the inception architecture for computer vision, in: CVPR.
- Post-hoc uncertainty calibration for domain drift scenarios, in: CVPR.
- Aleatoric uncertainty estimation with test-time augmentation for medical image segmentation with convolutional neural networks. Neurocomputing 338, 34–45.
- Hyperparameter ensembles for robustness and uncertainty quantification, in: NeurIPS.
- Disturblabel: Regularizing cnn on the loss layer, in: CVPR.
- Mix-n-match: Ensemble and compositional methods for uncertainty calibration in deep learning, in: ICML.
- Confidence calibration for convolutional neural networks using structured dropout. arXiv preprint arXiv:1906.09551 .
- Unet++: Redesigning skip connections to exploit multiscale features in image segmentation. IEEE Transactions on Medical Imaging 39, 1856–1867. doi:10.1109/TMI.2019.2959609.
- Balamurali Murugesan (23 papers)
- Bingyuan Liu (28 papers)
- Adrian Galdran (36 papers)
- Ismail Ben Ayed (133 papers)
- Jose Dolz (97 papers)