Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
184 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

RealNet: A Feature Selection Network with Realistic Synthetic Anomaly for Anomaly Detection (2403.05897v1)

Published 9 Mar 2024 in cs.CV

Abstract: Self-supervised feature reconstruction methods have shown promising advances in industrial image anomaly detection and localization. Despite this progress, these methods still face challenges in synthesizing realistic and diverse anomaly samples, as well as addressing the feature redundancy and pre-training bias of pre-trained feature. In this work, we introduce RealNet, a feature reconstruction network with realistic synthetic anomaly and adaptive feature selection. It is incorporated with three key innovations: First, we propose Strength-controllable Diffusion Anomaly Synthesis (SDAS), a diffusion process-based synthesis strategy capable of generating samples with varying anomaly strengths that mimic the distribution of real anomalous samples. Second, we develop Anomaly-aware Features Selection (AFS), a method for selecting representative and discriminative pre-trained feature subsets to improve anomaly detection performance while controlling computational costs. Third, we introduce Reconstruction Residuals Selection (RRS), a strategy that adaptively selects discriminative residuals for comprehensive identification of anomalous regions across multiple levels of granularity. We assess RealNet on four benchmark datasets, and our results demonstrate significant improvements in both Image AUROC and Pixel AUROC compared to the current state-o-the-art methods. The code, data, and models are available at https://github.com/cnulab/RealNet.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (55)
  1. Skip-ganomaly: Skip connected and adversarially trained encoder-decoder anomaly detection. In 2019 International Joint Conference on Neural Networks (IJCNN), pages 1–8. IEEE, 2019.
  2. Deep autoencoding models for unsupervised anomaly segmentation in brain mr images. In Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries: 4th International Workshop, BrainLes 2018, Held in Conjunction with MICCAI 2018, Granada, Spain, September 16, 2018, Revised Selected Papers, Part I 4, pages 161–169. Springer, 2019.
  3. Mvtec-ad: A comprehensive real-world dataset for unsupervised anomaly detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 9592–9600, 2019.
  4. Uninformed students: Student-teacher anomaly detection with discriminative latent embeddings. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 4183–4192, 2020.
  5. Describing textures in the wild. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 3606–3613, 2014.
  6. Sub-image anomaly detection with deep pyramid correspondences. arXiv preprint arXiv:2005.02357, 2020.
  7. Padim: a patch distribution modeling framework for anomaly detection and localization. In Pattern Recognition. ICPR International Workshops and Challenges: Virtual Event, January 10–15, 2021, Proceedings, Part IV, pages 475–489. Springer, 2021.
  8. Anomaly detection via reverse distillation from one-class embedding. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9737–9746, 2022.
  9. Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition, pages 248–255. Ieee, 2009.
  10. Diffusion models beat gans on image synthesis. Advances in Neural Information Processing Systems, 34:8780–8794, 2021.
  11. Few-shot defect image generation via defect-aware feature manipulation. In Proceedings of the AAAI Conference on Artificial Intelligence, pages 571–578, 2023.
  12. Cflow-ad: Real-time unsupervised anomaly detection with localization via conditional normalizing flows. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pages 98–107, 2022.
  13. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016.
  14. Exploring the importance of pretrained feature extractors for unsupervised anomaly detection and localization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2916–2925, 2023.
  15. Gans trained by a two time-scale update rule converge to a local nash equilibrium. Advances in neural information processing systems, 30, 2017.
  16. Denoising diffusion probabilistic models. Advances in Neural Information Processing Systems, 33:6840–6851, 2020.
  17. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In International conference on machine learning, pages 448–456. pmlr, 2015.
  18. Deep learning-based defect detection of metal parts: evaluating current methods in complex conditions. In 2021 13th International Congress on Ultra Modern Telecommunications and Control Systems and Workshops (ICUMT), pages 66–71. IEEE, 2021.
  19. Analyzing and improving the image quality of stylegan. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 8110–8119, 2020.
  20. Cutpaste: Self-supervised learning for anomaly detection and localization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9664–9674, 2021.
  21. Simplenet: A simple network for image anomaly detection and localization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 20402–20411, 2023.
  22. Explainable deep one-class classification. In International Conference on Learning Representations, 2021.
  23. Removing anomalies as noises for industrial defect localization. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 16166–16175, 2023.
  24. Vt-adl: A vision transformer network for image anomaly detection and localization. In 2021 IEEE 30th International Symposium on Industrial Electronics (ISIE), pages 01–06. IEEE, 2021.
  25. Improved denoising diffusion probabilistic models. In International Conference on Machine Learning, pages 8162–8171. PMLR, 2021.
  26. Poisson image editing. In ACM SIGGRAPH 2003 Papers, pages 313–318. 2003.
  27. Ken Perlin. An image synthesizer. ACM Siggraph Computer Graphics, 19(3):287–296, 1985.
  28. Inpainting transformer for anomaly detection. In Image Analysis and Processing–ICIAP 2022: 21st International Conference, Lecce, Italy, May 23–27, 2022, Proceedings, Part II, pages 394–406. Springer, 2022.
  29. Self-supervised predictive convolutional attentive block for anomaly detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 13576–13586, 2022.
  30. Towards total recall in industrial anomaly detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 14318–14328, 2022.
  31. Unsupervised anomaly detection with generative adversarial networks to guide marker discovery. In Information Processing in Medical Imaging: 25th International Conference, IPMI 2017, Boone, NC, USA, June 25-30, 2017, Proceedings, pages 146–157. Springer, 2017.
  32. Natural synthetic anomalies for self-supervised anomaly detection and localization. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXXI, pages 474–489. Springer, 2022.
  33. Unsupervised anomaly segmentation via deep feature reconstruction. Neurocomputing, 424:9–22, 2021.
  34. Denoising diffusion implicit models. In International Conference on Learning Representations, 2021.
  35. Efficientnet: Rethinking model scaling for convolutional neural networks. In International conference on machine learning, pages 6105–6114. PMLR, 2019.
  36. Anomaly detection neural network with dual auto-encoders gan and its industrial inspection applications. Sensors, 20(12):3336, 2020.
  37. Unsupervised anomaly detection for surface defects with dual-siamese network. IEEE Transactions on Industrial Informatics, 18(11):7707–7717, 2022.
  38. Revisiting reverse distillation for anomaly detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 24511–24520, 2023.
  39. Anoddpm: Anomaly detection with denoising diffusion probabilistic models using simplex noise. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 650–656, 2022.
  40. Softpatch: Unsupervised anomaly detection with noisy data. In Advances in Neural Information Processing Systems.
  41. Memseg: A semi-supervised method for image surface defect detection using differences and commonalities. Engineering Applications of Artificial Intelligence, 119:105835, 2023.
  42. Explicit boundary guided semi-push-pull contrastive learning for supervised anomaly detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 24490–24499, 2023.
  43. Patch svdd: Patch-level svdd for anomaly detection and segmentation. In Proceedings of the Asian Conference on Computer Vision, 2020.
  44. A unified model for multi-class anomaly detection. In Advances in Neural Information Processing Systems, 2022.
  45. Defect segmentation of hot-rolled steel strip surface by using convolutional auto-encoder and conventional image processing. In 2019 10th International Conference of Information and Communication Technology for Embedded Systems (IC-ICTES), pages 1–5. IEEE, 2019.
  46. Fastflow: Unsupervised anomaly detection and localization via 2d normalizing flows. arXiv preprint arXiv:2111.07677, 2021.
  47. Wide residual networks. In Proceedings of the British Machine Vision Conference (BMVC), pages 87.1–87.12. BMVA Press, 2016.
  48. Draem: A discriminatively trained reconstruction embedding for surface anomaly detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 8330–8339, 2021.
  49. Dsr: A dual subspace re-projection network for surface anomaly detection. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXXI, pages 539–554. Springer, 2022.
  50. Prototypical residual networks for anomaly detection and localization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 16281–16291, 2023a.
  51. The unreasonable effectiveness of deep features as a perceptual metric. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 586–595, 2018.
  52. Unsupervised surface anomaly detection with diffusion probabilistic model. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 6782–6791, 2023b.
  53. Destseg: Segmentation guided denoising student-teacher for anomaly detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3914–3923, 2023c.
  54. Ying Zhao. Omnial: A unified cnn framework for unsupervised anomaly localization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3924–3933, 2023.
  55. Spot-the-difference self-supervised pre-training for anomaly detection and segmentation. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXX, pages 392–408. Springer, 2022.
Citations (15)

Summary

  • The paper introduces RealNet, a framework that integrates diffusion-based anomaly synthesis (SDAS), anomaly-aware feature selection (AFS), and residual selection (RRS) for enhanced detection.
  • RealNet generates realistic synthetic anomalies while reducing feature redundancy to improve both image and pixel-level detection performance.
  • Evaluations on benchmark datasets demonstrate superior anomaly localization accuracy, paving the way for future generative approaches in industrial imaging.

Overview of RealNet: A Feature Selection Network for Anomaly Detection

The paper introduces RealNet, a novel framework for industrial image anomaly detection, focusing on the challenges of realistic anomaly synthesis and efficient feature selection. RealNet advances the field by incorporating Strength-controllable Diffusion Anomaly Synthesis (SDAS), Anomaly-aware Features Selection (AFS), and Reconstruction Residuals Selection (RRS). The paper presents a comprehensive evaluation on benchmark datasets, showcasing RealNet's performance improvements over existing methods.

RealNet's architecture is grounded on three pivotal innovations:

  1. Strength-controllable Diffusion Anomaly Synthesis (SDAS): SDAS leverages a diffusion-based process to generate synthetic anomalies with controllable strengths. This approach addresses the limitations of existing methods that rely on data augmentation or external datasets, offering improved realism and diversity in synthetic anomalies. The implementation of SDAS is grounded in modeling the anomalies as perturbations in low probability density regions of the normal data distribution.
  2. Anomaly-aware Features Selection (AFS): AFS systematically identifies the most relevant subset of pre-trained feature maps, enhancing the discriminatory capacity of RealNet while managing computational costs. This process significantly mitigates the dimensionality and redundancy issues commonly associated with high-dimensional, pre-trained CNN features. The feature selection is conducted in a self-supervised manner, utilizing synthetic anomalies generated by SDAS.
  3. Reconstruction Residuals Selection (RRS): RRS selectively processes reconstruction residuals to enhance localization accuracy across varying levels of granularity. By adaptively discarding less informative residuals, RRS improves RealNet's capability to identify and recall anomalous regions. The integration of RRS into the anomaly detection pipeline contributes to a tangible uplift in both Image AUROC and Pixel AUROC metrics, as evidenced by the experimental results.

The empirical evaluations conducted on datasets such as MVTec-AD, MPDD, BTAD, and VisA, demonstrate the superiority of RealNet in terms of image and pixel-level anomaly detection and localization. The inclusion of the Synthetic Industrial Anomaly Dataset (SIA), generated by SDAS, further facilitates the advancement of self-supervised anomaly detection approaches.

Implications and Future Directions

The research embodies a significant step towards improving the practical applicability of anomaly detection systems in industrial contexts. By addressing both the synthesis of realistic anomalies and the efficient use of pre-trained features, RealNet enhances the generalization capability of detection models to real-world data variations.

From a theoretical perspective, the successful integration of diffusion models in anomaly synthesis highlights the potential for leveraging generative frameworks in anomaly detection tasks. This approach opens up future research avenues exploring different generative architectures and their applications in creating highly plausible anomaly scenarios.

Additionally, the adaptive selection mechanisms developed for feature maps (AFS) and reconstruction residuals (RRS) in RealNet emphasize the importance of targeted, efficient model designs for handling large-scale pre-trained networks, which could inspire further research into optimization techniques within other domains of computer vision.

Conclusively, the RealNet framework is a robust contribution to the field of anomaly detection, with practical implications for improving safety and quality assurance across various industrial sectors. Future work could explore expanding RealNet's applicability to other anomaly detection scenarios outside of industrial imaging, such as cybersecurity and medical imaging.

Youtube Logo Streamline Icon: https://streamlinehq.com