Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Dynamic Addition of Noise in a Diffusion Model for Anomaly Detection (2401.04463v2)

Published 9 Jan 2024 in cs.CV

Abstract: Diffusion models have found valuable applications in anomaly detection by capturing the nominal data distribution and identifying anomalies via reconstruction. Despite their merits, they struggle to localize anomalies of varying scales, especially larger anomalies such as entire missing components. Addressing this, we present a novel framework that enhances the capability of diffusion models, by extending the previous introduced implicit conditioning approach Meng et al. (2022) in three significant ways. First, we incorporate a dynamic step size computation that allows for variable noising steps in the forward process guided by an initial anomaly prediction. Second, we demonstrate that denoising an only scaled input, without any added noise, outperforms conventional denoising process. Third, we project images in a latent space to abstract away from fine details that interfere with reconstruction of large missing components. Additionally, we propose a fine-tuning mechanism that facilitates the model to effectively grasp the nuances of the target domain. Our method undergoes rigorous evaluation on prominent anomaly detection datasets VisA, BTAD and MVTec yielding strong performance. Importantly, our framework effectively localizes anomalies regardless of their scale, marking a pivotal advancement in diffusion-based anomaly detection.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (43)
  1. A survey of anomaly detection techniques in financial domain. Future Generation Computer Systems, 55:278–288, 2016. ISSN 0167-739X. doi: https://doi.org/10.1016/j.future.2015.01.001. URL https://www.sciencedirect.com/science/article/pii/S0167739X15000023.
  2. Ganomaly: Semi-supervised anomaly detection via adversarial training. In C. V. Jawahar, Hongdong Li, Greg Mori, and Konrad Schindler (eds.), Computer Vision – ACCV 2018, pp.  622–637, Cham, 2019. Springer International Publishing. ISBN 978-3-030-20893-6.
  3. Anomalib: A deep learning library for anomaly detection, 2022.
  4. Deep autoencoding models for unsupervised anomaly segmentation in brain mr images. In Alessandro Crimi, Spyridon Bakas, Hugo Kuijf, Farahani Keyvan, Mauricio Reyes, and Theo van Walsum (eds.), Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries, pp.  161–169, Cham, 2019. Springer International Publishing. ISBN 978-3-030-11723-8.
  5. Mvtec ad — a comprehensive real-world dataset for unsupervised anomaly detection. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.  9584–9592, 2019a. doi: 10.1109/CVPR.2019.00982.
  6. Mvtec ad–a comprehensive real-world dataset for unsupervised anomaly detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp.  9592–9600, 2019b.
  7. Improving unsupervised defect segmentation by applying structural similarity to autoencoders. In Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications. SCITEPRESS - Science and Technology Publications, 2019c. doi: 10.5220/0007364503720380. URL https://doi.org/10.5220%2F0007364503720380.
  8. Align your latents: High-resolution video synthesis with latent diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp.  22563–22575, 2023.
  9. Sub-image anomaly detection with deep pyramid correspondences. CoRR, abs/2005.02357, 2020. URL https://arxiv.org/abs/2005.02357.
  10. Padim: a patch distribution modeling framework for anomaly detection and localization. In International Conference on Pattern Recognition, pp. 475–489. Springer, 2021.
  11. H. Deng and X. Li. Anomaly detection via reverse distillation from one-class embedding. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.  9727–9736, Los Alamitos, CA, USA, jun 2022. IEEE Computer Society. doi: 10.1109/CVPR52688.2022.00951. URL https://doi.ieeecomputersociety.org/10.1109/CVPR52688.2022.00951.
  12. Diffusion models beat gans on image synthesis. In M. Ranzato, A. Beygelzimer, Y. Dauphin, P.S. Liang, and J. Wortman Vaughan (eds.), Advances in Neural Information Processing Systems, volume 34, pp.  8780–8794. Curran Associates, Inc., 2021. URL https://proceedings.neurips.cc/paper_files/paper/2021/file/49ad23d1ec9fa4bd8d77d02681df5cfa-Paper.pdf.
  13. Memorizing normality to detect anomaly: Memory-augmented deep autoencoder for unsupervised anomaly detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pp.  1705–1714, 2019.
  14. Cflow-ad: Real-time unsupervised anomaly detection with localization via conditional normalizing flows. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp.  98–107, 2022.
  15. Anomaly detection using deep learning based image completion. In 2018 17th IEEE international conference on machine learning and applications (ICMLA), pp.  1237–1242. IEEE, 2018.
  16. Classifier-free diffusion guidance. In NeurIPS 2021 Workshop on Deep Generative Models and Downstream Applications, 2021. URL https://openreview.net/forum?id=qw8AKxfYbI.
  17. Denoising diffusion probabilistic models. Advances in neural information processing systems, 33:6840–6851, 2020.
  18. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114, 2013.
  19. Diffwave: A versatile diffusion model for audio synthesis. In International Conference on Learning Representations, 2021. URL https://openreview.net/forum?id=a-xFK8Ymz5J.
  20. Cfa: Coupled-hypersphere-based feature adaptation for target-oriented anomaly localization. IEEE Access, 10:78446–78454, 2022. doi: 10.1109/ACCESS.2022.3193699.
  21. Simplenet: A simple network for image anomaly detection and localization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.  20402–20411, June 2023.
  22. Decoupled weight decay regularization. In International Conference on Learning Representations, 2019. URL https://openreview.net/forum?id=Bkg6RiCqY7.
  23. Sdedit: Guided image synthesis and editing with stochastic differential equations. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net, 2022. URL https://openreview.net/forum?id=aBsCjcPu_tE.
  24. Vt-adl: A vision transformer network for image anomaly detection and localization. In 2021 IEEE 30th International Symposium on Industrial Electronics (ISIE), pp.  01–06. IEEE, 2021.
  25. Anomaly detection with conditioned denoising diffusion models, 2023.
  26. Improved denoising diffusion probabilistic models. In International Conference on Machine Learning, pp. 8162–8171. PMLR, 2021.
  27. GLIDE: towards photorealistic image generation and editing with text-guided diffusion models. In Kamalika Chaudhuri, Stefanie Jegelka, Le Song, Csaba Szepesvári, Gang Niu, and Sivan Sabato (eds.), International Conference on Machine Learning, ICML 2022, 17-23 July 2022, Baltimore, Maryland, USA, volume 162 of Proceedings of Machine Learning Research, pp.  16784–16804. PMLR, 2022. URL https://proceedings.mlr.press/v162/nichol22a.html.
  28. Dreamfusion: Text-to-3d using 2d diffusion. In The Eleventh International Conference on Learning Representations, 2023. URL https://openreview.net/forum?id=FjNys5c7VyY.
  29. Self-supervised predictive convolutional attentive block for anomaly detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp.  13576–13586, 2022.
  30. High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp.  10684–10695, 2022.
  31. Towards total recall in industrial anomaly detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp.  14318–14328, 2022.
  32. Same same but differnet: Semi-supervised defect detection with normalizing flows. CoRR, abs/2008.12577, 2020. URL https://arxiv.org/abs/2008.12577.
  33. Adversarially learned one-class classifier for novelty detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.  3379–3388, 2018.
  34. Unsupervised anomaly detection with generative adversarial networks to guide marker discovery. CoRR, abs/1703.05921, 2017. URL http://arxiv.org/abs/1703.05921.
  35. Deep unsupervised learning using nonequilibrium thermodynamics. In International conference on machine learning, pp. 2256–2265. PMLR, 2015.
  36. Denoising diffusion implicit models, 2022.
  37. Revisiting reverse distillation for anomaly detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp.  24511–24520, 2023.
  38. Anoddpm: Anomaly detection with denoising diffusion probabilistic models using simplex noise. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp.  649–655, 2022. doi: 10.1109/CVPRW56347.2022.00080.
  39. Fastflow: Unsupervised anomaly detection and localization via 2d normalizing flows, 2021.
  40. Dræm - A discriminatively trained reconstruction embedding for surface anomaly detection. CoRR, abs/2108.07610, 2021a. URL https://arxiv.org/abs/2108.07610.
  41. Reconstruction by inpainting for visual anomaly detection. Pattern Recognition, 112:107706, 2021b.
  42. Diffusionad: Denoising diffusion for anomaly detection. arXiv preprint arXiv:2303.08730, 2023.
  43. Spot-the-difference self-supervised pre-training for anomaly detection and segmentation. In ECCV 2022, 2022.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Justin Tebbe (1 paper)
  2. Jawad Tayyub (7 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.