Feature Corrective Transfer Learning: End-to-End Solutions to Object Detection in Non-Ideal Visual Conditions (2404.11214v2)
Abstract: A significant challenge in the field of object detection lies in the system's performance under non-ideal imaging conditions, such as rain, fog, low illumination, or raw Bayer images that lack ISP processing. Our study introduces "Feature Corrective Transfer Learning", a novel approach that leverages transfer learning and a bespoke loss function to facilitate the end-to-end detection of objects in these challenging scenarios without the need to convert non-ideal images into their RGB counterparts. In our methodology, we initially train a comprehensive model on a pristine RGB image dataset. Subsequently, non-ideal images are processed by comparing their feature maps against those from the initial ideal RGB model. This comparison employs the Extended Area Novel Structural Discrepancy Loss (EANSDL), a novel loss function designed to quantify similarities and integrate them into the detection loss. This approach refines the model's ability to perform object detection across varying conditions through direct feature map correction, encapsulating the essence of Feature Corrective Transfer Learning. Experimental validation on variants of the KITTI dataset demonstrates a significant improvement in mean Average Precision (mAP), resulting in a 3.8-8.1% relative enhancement in detection under non-ideal conditions compared to the baseline model, and a less marginal performance difference within 1.3% of the mAP@[0.5:0.95] achieved under ideal conditions by the standard Faster RCNN algorithm.
- Bryce E Bayer. Color imaging array, 1976. US Patent 3,971,065.
- Gaia: A transfer learning system of object detection that fits your needs. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 274–283, 2021.
- End-to-end object detection with transformers. In European conference on computer vision, pages 213–229. Springer, 2020.
- Raw camera data object detectors: an optimisation for automotive processing and transmission. Authorea Preprints, 2023.
- Object detection in remote sensing images based on deep transfer learning. Multimedia Tools and Applications, 81:12093 – 12109, 2021.
- Multitask aet with orthogonal tangent regularity for dark object detection. In Proceedings of the IEEE/CVF international conference on computer vision, pages 2553–2562, 2021.
- Vision meets robotics: The kitti dataset. The International Journal of Robotics Research, 32(11):1231–1237, 2013.
- Ross Girshick. Fast r-cnn. In Proceedings of the IEEE international conference on computer vision, pages 1440–1448, 2015.
- Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 580–587, 2014.
- Physics-based rendering for improving robustness to rain. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 10203–10212, 2019.
- Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016.
- Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531, 2015.
- Dsnet: Joint semantic learning for object detection in inclement weather conditions. IEEE transactions on pattern analysis and machine intelligence, 43(8):2623–2633, 2020.
- Transfer learning method for object detection model using genetic algorithm. J. Adv. Comput. Intell. Intell. Informatics, 26:776–783, 2022.
- Object detection in images with low light condition. In Photonics Applications in Astronomy, Communications, Industry, and High Energy Physics Experiments 2017, pages 250–259. SPIE, 2017.
- Microsoft coco: Common objects in context. In Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13, pages 740–755. Springer, 2014.
- Feature pyramid networks for object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 2117–2125, 2017.
- Unsupervised image-to-image translation networks. Advances in neural information processing systems, 30, 2017.
- Image-adaptive yolo for object detection in adverse weather conditions. In Proceedings of the AAAI Conference on Artificial Intelligence, pages 1792–1800, 2022.
- 3d object detection with sls-fusion network in foggy weather conditions. Sensors, 21(20):6711, 2021.
- Fusemodnet: Real-time camera and lidar based moving object detection for robust low-light autonomous driving. In Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, pages 0–0, 2019.
- Prior-based domain adaptive object detection for hazy and rainy conditions. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XIV 16, pages 763–780. Springer, 2020.
- A 3x3 isotropic gradient operator for image processing. a talk at the Stanford Artificial Project in, 1968:271–272, 1968.
- Transfer learning for object detection using state-of-the-art deep neural networks. In 2018 5th international conference on signal processing and integrated networks (SPIN), pages 78–83. IEEE, 2018.
- Rain rendering for evaluating and improving robustness to bad weather. International Journal of Computer Vision, 129:341–360, 2021.
- Saliencygan: Deep learning semisupervised salient object detection in the fog of iot. IEEE Transactions on Industrial Informatics, 16(4):2667–2676, 2019.
- An end-to-end cascaded image deraining and object detection neural network. IEEE Robotics and Automation Letters, 7(4):9541–9548, 2022.
- Chuheng Wei. Vehicle detecting and tracking application based on yolov5 and deepsort for bayer data. In 2022 17th International Conference on Control, Automation, Robotics and Vision (ICARCV), pages 843–849. IEEE, 2022.
- Enhanced object detection by integrating camera parameters into raw image-based faster r-cnn. In 2023 IEEE 26th International Conference on Intelligent Transportation Systems (ITSC), pages 4473–4478. IEEE, 2023.
- Making of night vision: Object detection under low-illumination. IEEE Access, 8:123075–123086, 2020.
- Transdet: Toward effective transfer learning for small-object detection. Remote Sensing, 2023.
- A clearer image: Improving object detection in real rainy conditions with two-stage processing. 2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW), pages 57–62, 2023.
- Bdd100k: A diverse driving dataset for heterogeneous multitask learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 2636–2645, 2020.
- A comprehensive survey on transfer learning. Proceedings of the IEEE, 109(1):43–76, 2020.
- Object detection in 20 years: A survey. Proceedings of the IEEE, 111(3):257–276, 2023.
- Chuheng Wei (4 papers)
- Guoyuan Wu (33 papers)
- Matthew J. Barth (23 papers)