DiffYOLO: Object Detection for Anti-Noise via YOLO and Diffusion Models (2401.01659v1)
Abstract: Object detection models represented by YOLO series have been widely used and have achieved great results on the high quality datasets, but not all the working conditions are ideal. To settle down the problem of locating targets on low quality datasets, the existing methods either train a new object detection network, or need a large collection of low-quality datasets to train. However, we propose a framework in this paper and apply it on the YOLO models called DiffYOLO. Specifically, we extract feature maps from the denoising diffusion probabilistic models to enhance the well-trained models, which allows us fine-tune YOLO on high-quality datasets and test on low-quality datasets. The results proved this framework can not only prove the performance on noisy datasets, but also prove the detection results on high-quality test datasets. We will supplement more experiments later (with various datasets and network architectures).
- Segdiff: Image segmentation with diffusion probabilistic models. arXiv preprint arXiv:2112.00390, 2021.
- Noisy networks for exploration. arXiv preprint arXiv:1706.10295, 2017.
- Vehicle detection and counting from vhr satellite images: Efforts and open issues. In IGARSS 2020-2020 IEEE International Geoscience and Remote Sensing Symposium, pages 256–259. IEEE, 2020.
- Yolox: Exceeding yolo series in 2021, 2021.
- R. Girshick. Fast r-cnn. In Proceedings of the IEEE international conference on computer vision, pages 1440–1448, 2015.
- Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 580–587, 2014.
- Neonatal face and facial landmark detection from video recordings. arXiv preprint arXiv:2302.04341, 2023.
- Denoising diffusion probabilistic models, 2020.
- Parameter-efficient transfer learning for nlp. In International Conference on Machine Learning, pages 2790–2799. PMLR, 2019.
- Towards better certified segmentation via diffusion models. arXiv preprint arXiv:2306.09949, 2023.
- Image-adaptive yolo for object detection in adverse weather conditions. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, pages 1792–1800, 2022.
- Anomaly detection with conditioned denoising diffusion models. arXiv preprint arXiv:2305.15956, 2023.
- Brain cancer segmentation using yolov5 deep neural network. arXiv preprint arXiv:2212.13599, 2022.
- You only look once: Unified, real-time object detection, 2016.
- Deep unsupervised learning using nonequilibrium thermodynamics, 2015.
- Ridge-based vessel segmentation in color images of the retina. IEEE transactions on medical imaging, 23(4):501–509, 2004.
- Dannet: A one-stage domain adaptation network for unsupervised nighttime semantic segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 15769–15778, 2021.
- Pp-yoloe: An evolved version of yolo, 2022.