The paper "GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection" explores advancements in using diffusion models for unsupervised anomaly detection. The primary goal is to improve how these models reconstruct images by introducing global and local adaptation mechanisms.
Key Contributions:
- Challenge with Uniform Treatment of Anomalies: Diffusion models traditionally treat all anomalies equally during image reconstruction, which can lead to uneven reconstruction performance across different anomaly types. This uniform approach fails to consider the complexity and variability of anomalies in the data.
- Global Perspective Adaptation: The authors propose customizing the denoising steps for each sample. By predicting a unique denoising step, the model evaluates the discrepancy between the image content and the priors from the diffusion model, allowing for more tailored and accurate reconstruction.
- Local Perspective Adaptation: Within a single image, reconstructing abnormal regions differs from normal regions. The paper suggests that standard Gaussian noise predictions deviate in abnormal areas. To address this, synthetic abnormal samples are introduced during training, encouraging the model to overcome the limitations of standard Gaussian distributions.
- Spatial-Adaptive Feature Fusion: During inference, a spatially adaptive feature fusion scheme is used to enhance performance further, enabling more precise reconstruction by focusing on the discrepancies between anomaly and normal regions within images.
Experimental Validation:
The model, named Global and Local Adaptive Diffusion (GLAD), is extensively tested on several datasets commonly used in anomaly detection:
- MVTec-AD
- MPDD
- VisA
- PCB-Bank: A printed circuit board dataset integrated by the authors.
Results from these experiments demonstrate the effectiveness of the GLAD model in achieving anomaly-free reconstructions while effectively retaining normal image information. This indicates that the proposed global and local adaptations are beneficial in handling diverse and complex anomalies in unsupervised settings.
Overall, the paper introduces a novel approach to enhancing the performance of diffusion models in anomaly detection by tailoring the reconstruction process both at a global and local level. This method provides increased flexibility and promises better handling of anomalies in varied datasets.