DRAEM -- A discriminatively trained reconstruction embedding for surface anomaly detection (2108.07610v2)

Published 17 Aug 2021 in cs.CV

Abstract: Visual surface anomaly detection aims to detect local image regions that significantly deviate from normal appearance. Recent surface anomaly detection methods rely on generative models to accurately reconstruct the normal areas and to fail on anomalies. These methods are trained only on anomaly-free images, and often require hand-crafted post-processing steps to localize the anomalies, which prohibits optimizing the feature extraction for maximal detection capability. In addition to reconstructive approach, we cast surface anomaly detection primarily as a discriminative problem and propose a discriminatively trained reconstruction anomaly embedding model (DRAEM). The proposed method learns a joint representation of an anomalous image and its anomaly-free reconstruction, while simultaneously learning a decision boundary between normal and anomalous examples. The method enables direct anomaly localization without the need for additional complicated post-processing of the network output and can be trained using simple and general anomaly simulations. On the challenging MVTec anomaly detection dataset, DRAEM outperforms the current state-of-the-art unsupervised methods by a large margin and even delivers detection performance close to the fully-supervised methods on the widely used DAGM surface-defect detection dataset, while substantially outperforming them in localization accuracy.

Authors (3)

Vitjan Zavrtanik (7 papers)
Matej Kristan (39 papers)
Danijel Skočaj (16 papers)

Citations (472)

View on Semantic Scholar

Summary

Discriminatively Trained Reconstruction Embedding for Surface Anomaly Detection

The paper presents the DRÆM, a novel approach for surface anomaly detection utilizing a discriminatively trained reconstruction anomaly embedding model. This method seeks to address limitations in existing surface anomaly detection techniques that primarily rely on generative models to detect anomalies. These conventional methods, which are trained using only anomaly-free images, often necessitate complex, handcrafted post-processing to localize anomalies effectively, thereby limiting their potential for optimal feature extraction.

The DRÆM paradigm shifts from a purely reconstructive to a discriminatively informed approach. It introduces a joint representation mechanism for an anomalous image and its anomaly-free reconstruction while concurrently learning a decision boundary that distinguishes between normal and anomalous examples. This framework allows for the direct localization of anomalies, eliminating the need for supplementary post-processing and thus simplifying the anomaly detection pipeline substantially.

In their empirical evaluation, the authors demonstrate the efficacy of DRÆM on the MVTec anomaly detection dataset. Here, DRÆM surpasses the performance of existing state-of-the-art unsupervised methods by a significant margin, achieving results that approach the efficacy of fully supervised methods on the DAGM surface-defect detection dataset. Notably, DRÆM not only achieves superior detection performance but also boasts a substantial improvement in localization accuracy compared to its predecessors.

The DRÆM model adopts an innovative training technique that leverages synthetic anomaly simulations, thus facilitating a robust model devoid of reliance on real anomalous samples. The architecture consists of two key components: a reconstructive sub-network and a discriminative sub-network. The reconstructive sub-network is responsible for encasing local patterns of an input image into a distribution similar to normal samples, thereby facilitating anomaly inpainting. Simultaneously, the discriminative sub-network exploits the joint appearance of the original and reconstructed images to produce precise anomaly segmentation maps. This joint training of reconstruction and discrimination capitalizes on the recognizable deviations introduced by synthetic anomalies to refine the decision boundary between normal and anomalous appearances.

Additional experiments detail a comprehensive ablation paper, verifying that the joint reconstruction-anomaly embedding intrinsically enhances the anomaly detection process. Various tests were conducted using different anomaly-generation datasets, revealing that DRÆM achieves remarkable detection and localization capabilities, even when using minimal texture variability in synthetic anomaly training. The findings underscore the model's aptitude for generalization across diverse anomaly configurations.

The research presents significant implications for practical and theoretical future advancements in AI-driven anomaly detection. By overcoming the need for real anomaly datasets and heavy manual annotation, DRÆM paves the way for developing more efficient, self-contained anomaly detection frameworks. Furthermore, the paper largely emphasizes the potential for discriminative learning, marking a noteworthy pivot from conventional reconstructive methods. This approach could inform broader applications across domains necessitating refined anomaly localization capabilities, such as in quality control and security monitoring systems.

DRÆM's methodological innovations promise to substantially influence ongoing developments in anomaly detection technology, particularly in how AI systems approach learning and generalizing from synthetic data variations. Subsequent research could explore real-world deployments of DRÆM to refine its practicality and validate its superiority over existing anomaly detection frameworks in applied settings.

PDF Markdown

Related Papers

Find Related Papers