ViM: Out-Of-Distribution with Virtual-logit Matching (2203.10807v1)

Published 21 Mar 2022 in cs.CV

Abstract: Most of the existing Out-Of-Distribution (OOD) detection algorithms depend on single input source: the feature, the logit, or the softmax probability. However, the immense diversity of the OOD examples makes such methods fragile. There are OOD samples that are easy to identify in the feature space while hard to distinguish in the logit space and vice versa. Motivated by this observation, we propose a novel OOD scoring method named Virtual-logit Matching (ViM), which combines the class-agnostic score from feature space and the In-Distribution (ID) class-dependent logits. Specifically, an additional logit representing the virtual OOD class is generated from the residual of the feature against the principal space, and then matched with the original logits by a constant scaling. The probability of this virtual logit after softmax is the indicator of OOD-ness. To facilitate the evaluation of large-scale OOD detection in academia, we create a new OOD dataset for ImageNet-1K, which is human-annotated and is 8.8x the size of existing datasets. We conducted extensive experiments, including CNNs and vision transformers, to demonstrate the effectiveness of the proposed ViM score. In particular, using the BiT-S model, our method gets an average AUROC 90.91% on four difficult OOD benchmarks, which is 4% ahead of the best baseline. Code and dataset are available at https://github.com/haoqiwang/vim.

Citations (255)

View on Semantic Scholar

Summary

The paper presents ViM, a method that fuses class-specific logits with projected feature residuals to construct a virtual-logit for improved out-of-distribution detection.
It leverages a novel residual analysis in feature space, achieving a 4% AUROC improvement on ImageNet benchmarks compared to traditional methods.
The approach is validated on diverse architectures, including CNNs and vision transformers, and is supported by a new large-scale, annotated OOD benchmark dataset.

An Analytical Overview of "ViM: Out-Of-Distribution with Virtual-logit Matching"

The paper "ViM: Out-Of-Distribution with Virtual-logit Matching" introduces a novel methodology for out-of-distribution (OOD) detection, which forms a critical aspect of deploying machine learning models in real-world settings. Unlike traditional OOD detection approaches that rely on a single input source, ViM innovatively merges the strengths of multiple sources — specifically extracting insights from both class-agnostic features and class-dependent logits. This methodology is characterized by the construction of a "virtual-logit" via residual analysis in feature space, which is designed to be congruent with class-specific logits.

Methodology and Technical Contributions

Motivational Foundation: The fragility of existing OOD methods, dependent on either features or logits alone, motivates the research. The observed variation and dimensional complexity in OOD data samples signify that a multidimensional approach could yield better performance.
Virtual-logit Matching (ViM): The ViM algorithm generates a score indicative of OOD-ness by combining a novel logit derived from feature deviations with existing logits. This involves projecting feature residuals onto a principal subspace and adjusting scales to construct a virtual OOD class logit that complements the conventional logits.
Enhanced Dataset Creation: Acknowledging the inadequacies in testing datasets, the authors introduce a new large-scale, meticulously annotated OOD benchmark dataset for ImageNet-1K, enriching the evaluation standards for comprehensive OOD methodologies.
Quantitative Evaluation: The technique's efficacy is evidenced by substantial improvements in AUROC over complex OOD benchmarks, illustrating a 4% performance leap over existing baselines using a CNN-based BiT-S model. These enhancements denote significant advances in handling diverse and nuanced OOD cases that have traditionally been challenging.

Comparative Analysis and Evaluation

The research comprehensively contrasts ViM with contemporary methods, employing models such as vision transformers and convolutional neural networks. The experimentation confirms that ViM maintains robust performance across various architectures, emphasizing its adaptability and resilience. This contrasts with methods that frequently compromise when shifting across network structures or OOD datasets. Notably, the research discusses the interplay of class-specific logits with residual analytics, illustrating the necessity of merging feature and logit spaces to counteract the OOD identification weaknesses intrinsic to each when used in isolation.

Implications and Future Directions

The paper's contributions suggest a shift in how OOD problems can be tackled, setting a precedent for integrating multiple data representations to enhance detection robustness. The performance implications include improved reliability in high-stakes environments, such as autonomous systems and medical diagnostics, where OOD detection accuracy is critical. Theoretically, ViM's success could spur further exploration into hybrid OOD detection approaches, encouraging the community to explore additional multidimensional fusion techniques.

Concluding Thoughts

The paper presents a significant advancement in the OOD detection landscape, both methodologically and in practical application. By showcasing a technique that effectively integrates diverse information sources, the authors set the stage for new research directions that could redefine outlier analysis and broaden our understanding of distributional safety in machine learning.