VOS: Learning What You Don't Know by Virtual Outlier Synthesis (2202.01197v4)

Published 2 Feb 2022 in cs.LG and cs.CV

Abstract: Out-of-distribution (OOD) detection has received much attention lately due to its importance in the safe deployment of neural networks. One of the key challenges is that models lack supervision signals from unknown data, and as a result, can produce overconfident predictions on OOD data. Previous approaches rely on real outlier datasets for model regularization, which can be costly and sometimes infeasible to obtain in practice. In this paper, we present VOS, a novel framework for OOD detection by adaptively synthesizing virtual outliers that can meaningfully regularize the model's decision boundary during training. Specifically, VOS samples virtual outliers from the low-likelihood region of the class-conditional distribution estimated in the feature space. Alongside, we introduce a novel unknown-aware training objective, which contrastively shapes the uncertainty space between the ID data and synthesized outlier data. VOS achieves competitive performance on both object detection and image classification models, reducing the FPR95 by up to 9.36% compared to the previous best method on object detectors. Code is available at https://github.com/deeplearning-wisc/vos.

PDF Abstract

Virtual Outlier Synthesis (VOS): Advancements in Out-of-Distribution Detection

The task of out-of-distribution (OOD) detection has emerged as a pivotal concern within the safe deployment of neural networks, particularly when such networks may encounter data points considerably different from their training distribution. This challenge necessitates methodologies that can prevent models from making overconfident predictions on unknown data. Existing approaches often depend on real outlier datasets to aid regularization, a solution that is not always practical due to potential cost and feasibility limitations. In this context, the paper introduces a novel approach named Virtual Outlier Synthesis (VOS), which focuses on synthesizing virtual outliers to effectively regularize the model's decision boundaries.

VOS leverages a unique mechanism of generating these virtual outliers from the low-likelihood regions of the class-conditional distribution within the feature space. This innovation is coupled with a compelling unknown-aware training objective that establishes a contrastive uncertainty landscape between in-distribution (ID) and synthesized outlier data. Empirical results demonstrate that VOS achieves a substantial reduction in FPR95 by up to 9.36% in object detection scenarios when pitted against the best existing methods, all while maintaining performance on in-distribution tasks.

The framework for VOS is built around three core components: virtual outlier synthesis, effective model regularization, and the OOD detection mechanisms employed during inference. For synthesizing outliers, VOS circumvents the complexities of high-dimensional pixel space and focuses on feature space. Here, class-conditional multivariate Gaussian distributions encapsulate the feature representations of object instances. The virtual outliers are sampled based on the probabilistic thresholds from these distributions using lower-dimensional feature representations, thus avoiding the optimization hurdles often encountered with traditional generative models.

Key throughout the training process is the refined regulation of the model, ensuring that synthesized outliers remain on the peripheries of ID data, a task achieved through an uncertainty regularization loss. The proposed regularization loss applies a binary sigmoid layer across energy levels calculated from model outputs, without necessitating tuning multiple hyperparameters, unlike squared hinge loss configurations.

Moreover, VOS provides the flexibility to be utilized across both classification and object detection tasks, indicating its adaptability. Theoretical insights, complemented by exhaustive experimental evaluations, reflect the robustness of VOS in pushing the boundaries of OOD detection capabilities across both benchmark image classification and object detection datasets.

From a practical perspective, VOS offers substantial implications for real-world applications, notably in fields like autonomous driving, where safety and precision in detecting unknown or anomalous objects can be lifesaving. The adaptability in object-level context detection infers enhanced flexibility over traditional image-level tasks, hence fostering safer handling of anomalies.

The trajectory of VOS in future AI research could further explore enhanced synthesis methods that employ dynamic feature spaces or integrate more potent generative models while maintaining efficiency. The modular approach of VOS to various layers within neural architectures could also influence fine-grained interpretability and diagnostic capacities in models tackling diverse data domains.

Overall, the Virtual Outlier Synthesis approach detailed in this paper provides a compelling contribution to the OOD detection sphere, balancing between computational tractability and performance gains, thus encouraging further exploration and adaptation in real-world, safety-critical AI deployments.

PDF Markdown Bookmark Chat (Pro)

Authors (4)

Xuefeng Du (26 papers)
Zhaoning Wang (7 papers)
Mu Cai (21 papers)
Yixuan Li (183 papers)

Citations (260)

View on Semantic Scholar

Related Papers

Find Related Papers

YouTube

Show All Videos