Contrastive Training for Improved Out-of-Distribution Detection (2007.05566v1)

Published 10 Jul 2020 in cs.LG and stat.ML

Abstract: Reliable detection of out-of-distribution (OOD) inputs is increasingly understood to be a precondition for deployment of machine learning systems. This paper proposes and investigates the use of contrastive training to boost OOD detection performance. Unlike leading methods for OOD detection, our approach does not require access to examples labeled explicitly as OOD, which can be difficult to collect in practice. We show in extensive experiments that contrastive training significantly helps OOD detection performance on a number of common benchmarks. By introducing and employing the Confusion Log Probability (CLP) score, which quantifies the difficulty of the OOD detection task by capturing the similarity of inlier and outlier datasets, we show that our method especially improves performance in the `near OOD' classes -- a particularly challenging setting for previous methods.

Citations (219)

View on Semantic Scholar

Collections

Summary

The paper demonstrates that contrastive training significantly improves out-of-distribution detection, especially in challenging near OOD scenarios.
It employs a two-stage training process using a modified ResNet architecture and introduces the Class-wise Confusion Log Probability (CLP) metric.
Empirical results show reduced false positive rates and improved AUROC, AUPR scores on datasets like CIFAR-10 and CIFAR-100.

Analysis of Contrastive Training for Enhanced Out-of-Distribution Detection

This paper explores the domain of out-of-distribution (OOD) detection, tackling the problem using contrastive training techniques. The focal point of the research is to enhance the OOD detection capacity by implementing a contrastive learning paradigm, demonstrating improvements across several challenging datasets.

Core Contributions

The paper outlines a methodology that applies contrastive learning, a self-supervised approach, to improve the detection of OOD examples in images. Specifically, the authors investigate the class-wise detection performance across datasets, such as CIFAR-10 and CIFAR-100, using contrastive loss to distinguish between inlier and outlier classes. The paper reveals significant advancements in detecting OOD instances, particularly in environments where outlier and inlier classes possess high visual similarity, termed the near OOD regime.

Methodology Details

The research employs a ResNet-50 architecture augmented with a width multiplier of 3, indicating an increase in model capacity to support contrastive learning. The training process is divided into two stages: a contrastive pretraining phase followed by a fine-tuning phase incorporating both supervised and contrastive losses. The optimization process employs the LARS optimizer and a sophisticated learning rate schedule, adhering to best practices for large-batch training.

The concept of Class-wise Confusion Log Probability (CLP) is introduced to quantify the performance of OOD detectors, providing a novel metric for assessing model confusion between inliers and outliers. An ensemble of ResNet-34 models is trained over an extended dataset comprising multiple image repositories, which aids in computing accurate CLP scores for various dataset pairs.

Results and Numerical Findings

The paper reports substantial enhancements in performance metrics such as AUROC, AUPR, and detection accuracy. Notably, the empirical findings exhibit a reduction in the False Positive Rate (FPR) at a 95% True Positive Rate (TPR), especially for CIFAR-10 vs. CIFAR-100 and CIFAR-100 vs. CIFAR-10 dataset pairs. The introduction of label smoothing and contrastive training plays a critical role in these improvements, as evidenced by comparative results against baseline models.

The reported results assert that for datasets comprising structured noise like Gaussian noise or unrelated content like Places365, the proposed methodology achieves nearly perfect detection scores, demonstrating the robustness of the contrastive approach to varying OOD conditions.

Implications and Future Directions

The enhanced detection capabilities presented in this paper have profound implications for real-world applications where distinguishing novel examples is crucial, such as autonomous systems and security screenings. The promising results suggest that contrastive training can be further explored in various domains beyond image classification, potentially extending to text or multimodal OOD detection.

Potential avenues for future work include refining the CLP metric to better accommodate class imbalance or integrating this approach with other self-supervised learning strategies to further bolster the detection of challenging OOD scenarios. Additionally, investigating other architectures or scaling the method to larger datasets may offer insights into the scalability and generalization of the proposed technique.

Overall, this paper offers substantial evidence for the efficacy of contrastive learning in enhancing OOD detection, providing a rigorous foundation for future explorations in the field.

Contrastive Training for Improved Out-of-Distribution Detection (2007.05566v1)

Collections

Summary

Analysis of Contrastive Training for Enhanced Out-of-Distribution Detection

Core Contributions

Methodology Details

Results and Numerical Findings

Implications and Future Directions

Paper Prompts

Follow-up Questions

Authors (13)

Contrastive Training for Improved Out-of-Distribution Detection (2007.05566v1)

Collections

Summary

Analysis of Contrastive Training for Enhanced Out-of-Distribution Detection

Core Contributions

Methodology Details

Results and Numerical Findings

Implications and Future Directions

Paper Prompts

Follow-up Questions

Related Papers

Authors (13)