Text-Guided Variational Image Generation for Industrial Anomaly Detection and Segmentation (2403.06247v2)
Abstract: We propose a text-guided variational image generation method to address the challenge of getting clean data for anomaly detection in industrial manufacturing. Our method utilizes text information about the target object, learned from extensive text library documents, to generate non-defective data images resembling the input image. The proposed framework ensures that the generated non-defective images align with anticipated distributions derived from textual and image-based knowledge, ensuring stability and generality. Experimental results demonstrate the effectiveness of our approach, surpassing previous methods even with limited non-defective data. Our approach is validated through generalization tests across four baseline models and three distinct datasets. We present an additional analysis to enhance the effectiveness of anomaly detection models by utilizing the generated images.
- Image anomaly detection and localization with position and neighborhood information. arXiv, 2022.
- Efficientad: Accurate visual anomaly detection at millisecond-level latencies. arXiv, 2023.
- Mvtec ad–a comprehensive real-world dataset for unsupervised anomaly detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 9592–9600, 2019.
- Beyond dents and scratches: Logical constraints in unsupervised anomaly detection and localization. International Journal of Computer Vision, 130(4):947–969, 2022.
- Vqgan-clip: Open domain image generation and editing with natural language guidance. In Computer Vision–ECCV 2022, pages 88–105. Springer, 2022.
- Padim: a patch distribution modeling framework for anomaly detection and localization. In Pattern Recognition. ICPR International Workshops and Challenges: Virtual Event, January 10–15, 2021, Proceedings, Part IV, pages 475–489. Springer, 2021.
- Anomaly detection via reverse distillation from one-class embedding. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9737–9746, 2022.
- Taming transformers for high-resolution image synthesis, 2020.
- Christiane Fellbaum. Wordnet and wordnets. In Encyclopedia of Language and Linguistics, pages 665–670. Elsevier, 2005.
- Generative adversarial nets. Advances in neural information processing systems, 27, 2014.
- Google. Google image search. https://www.google.com/imghp?hl=ko&ogbl, 2023. Accessed on 2023.
- Cflow-ad: Real-time unsupervised anomaly detection with localization via conditional normalizing flows. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pages 98–107, 2022.
- Winclip: Zero-/few-shot anomaly classification and segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 19606–19616, 2023.
- Clip-event: Connecting text and images with event structures. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 16420–16429, 2022.
- MidJourney. Midjourney home page. https://www.midjourney.com/home, 2023. Accessed on 2023.
- Vt-adl: A vision transformer network for image anomaly detection and localization. In 2021 IEEE 30th International Symposium on Industrial Electronics, pages 01–06. IEEE, 2021.
- Learning transferable visual models from natural language supervision. In International conference on machine learning, pages 8748–8763. Proceedings of Machine Learning Research, 2021.
- Zero-shot text-to-image generation. In International Conference on Machine Learning, pages 8821–8831. PMLR, 2021.
- Towards total recall in industrial anomaly detection, 2021.
- Same same but differnet: Semi-supervised defect detection with normalizing flows. In Proceedings of the IEEE/CVF winter conference on applications of computer vision, pages 1907–1916, 2021.
- Fully convolutional cross-scale-flows for image-based defect detection. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pages 1088–1097, 2022.
- Deep one-class classification. In International conference on machine learning, pages 4393–4402. Proceedings of Machine Learning Research, 2018.
- Deep semi-supervised anomaly detection. arXiv, 2019.
- A hierarchical transformation-discriminating generative model for few shot anomaly detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 8495–8504, 2021.
- Memseg: A semi-supervised method for image surface defect detection using differences and commonalities. Engineering Applications of Artificial Intelligence, 119:105835, 2023.
- Patch svdd: Patch-level svdd for anomaly detection and segmentation. In Proceedings of the Asian Conference on Computer Vision, 2020.
- Fastflow: Unsupervised anomaly detection and localization via 2d normalizing flows. arXiv, 2021.
- Draem-a discriminatively trained reconstruction embedding for surface anomaly detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 8330–8339, 2021.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.