Few-shot Oriented Object Detection with Memorable Contrastive Learning in Remote Sensing Images (2403.13375v1)
Abstract: Few-shot object detection (FSOD) has garnered significant research attention in the field of remote sensing due to its ability to reduce the dependency on large amounts of annotated data. However, two challenges persist in this area: (1) axis-aligned proposals, which can result in misalignment for arbitrarily oriented objects, and (2) the scarcity of annotated data still limits the performance for unseen object categories. To address these issues, we propose a novel FSOD method for remote sensing images called Few-shot Oriented object detection with Memorable Contrastive learning (FOMC). Specifically, we employ oriented bounding boxes instead of traditional horizontal bounding boxes to learn a better feature representation for arbitrary-oriented aerial objects, leading to enhanced detection performance. To the best of our knowledge, we are the first to address oriented object detection in the few-shot setting for remote sensing images. To address the challenging issue of object misclassification, we introduce a supervised contrastive learning module with a dynamically updated memory bank. This module enables the use of large batches of negative samples and enhances the model's capability to learn discriminative features for unseen classes. We conduct comprehensive experiments on the DOTA and HRSC2016 datasets, and our model achieves state-of-the-art performance on the few-shot oriented object detection task. Code and pretrained models will be released.
- A survey on object detection in optical remote sensing images. ISPRS journal of photogrammetry and remote sensing, 117:11–28, 2016.
- Circular oil tank detection from panchromatic satellite images: A new automated approach. IEEE Geoscience and Remote Sensing Letters, 12(6):1347–1351, 2015.
- Vehicle detection in remote sensing imagery based on salient information and local shape feature. Optik, 126(20):2485–2490, 2015.
- A probabilistic framework to detect buildings in aerial and satellite images. IEEE transactions on geoscience and remote sensing, 49(1):211–221, 2010.
- A sample update-based convolutional neural network framework for object detection in large-area remote sensing images. IEEE Geoscience and Remote Sensing Letters, 16(6):947–951, 2019.
- Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 580–587, 2014.
- A high resolution optical satellite image dataset for ship recognition and some new baselines. In ICPRAM, pages 324–331, 2017.
- Dota: A large-scale dataset for object detection in aerial images. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 3974–3983, 2018.
- Research progress on few-shot learning for remote sensing image interpretation. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 14:2387–2402, 2021.
- Few-shot object detection on remote sensing images. IEEE Transactions on Geoscience and Remote Sensing, 60:1–14, 2021.
- Double head predictor based few-shot object detection for aerial imagery. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 721–731, 2021.
- Prototype-cnn for few-shot object detection in remote sensing images. IEEE Transactions on Geoscience and Remote Sensing, 60:1–10, 2021.
- Few-shot object detection via context-aware aggregation for remote sensing images. IEEE Geoscience and Remote Sensing Letters, 19:1–5, 2022.
- Fsce: Few-shot object detection via contrastive proposal encoding. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 7352–7362, 2021.
- Momentum contrast for unsupervised visual representation learning. In In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 9729–9738, 2020.
- Supervised contrastive learning. Advances in neural information processing systems, 33:18661–18673, 2020.
- Ship rotated bounding box space for ship extraction from high-resolution optical satellite images with complex backgrounds. IEEE geoscience and remote sensing letters, 13(8):1074–1078, 2016.
- Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE transactions on pattern analysis and machine intelligence, 37(9):1904–1916, 2015.
- Ross Girshick. Fast r-cnn. In Proceedings of the IEEE international conference on computer vision, pages 1440–1448, 2015.
- Faster r-cnn: Towards real-time object detection with region proposal networks. Advances in neural information processing systems, 28, 2015.
- Feature pyramid networks for object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 2117–2125, 2017.
- You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 779–788, 2016.
- Ssd: Single shot multibox detector. In Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14, pages 21–37. Springer, 2016.
- Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision, pages 2980–2988, 2017.
- Cornernet: Detecting objects as paired keypoints. In Proceedings of the European conference on computer vision (ECCV), pages 734–750, 2018.
- Centernet: Keypoint triplets for object detection. In Proceedings of the IEEE/CVF international conference on computer vision, pages 6569–6578, 2019.
- End-to-end object detection with transformers. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part I 16, pages 213–229. Springer, 2020.
- Rotation-insensitive and context-augmented object detection in remote sensing images. IEEE Transactions on Geoscience and Remote Sensing, 56(4):2337–2348, 2017.
- A single shot framework with multi-scale feature fusion for geospatial object detection. Remote Sensing, 11(5):594, 2019.
- Cross-scale feature fusion for object detection in optical remote sensing images. IEEE Geoscience and Remote Sensing Letters, 18(3):431–435, 2020.
- Rotated region based cnn for ship detection. In 2017 IEEE International Conference on Image Processing (ICIP), pages 900–904. IEEE, 2017.
- Multiscale rotated bounding box-based deep learning method for detecting ship targets in remote sensing images. Sensors, 18(8):2702, 2018.
- Learning roi transformer for detecting oriented objects in aerial images. arXiv preprint arXiv:1812.00155, 2018.
- Scrdet: Towards more robust detection for small, cluttered and rotated objects. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 8232–8241, 2019.
- Improved yolo network for free-angle remote sensing target detection. Remote Sensing, 13(11):2171, 2021.
- Self-adaptive aspect ratio anchor for oriented object detection in remote sensing images. Remote Sensing, 13(7):1318, 2021.
- Align deep features for oriented object detection. IEEE Transactions on Geoscience and Remote Sensing, 60:1–11, 2021.
- Meta-learning to detect rare objects. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 9925–9934, 2019.
- Frustratingly simple few-shot object detection. arXiv preprint arXiv:2003.06957, 2020.
- Generalized few-shot object detection without forgetting. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4527–4536, 2021.
- Few-shot object detection with self-adaptive attention network for remote sensing images. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 14:4854–4865, 2021.
- Few-shot object detection of remote sensing images via two-stage fine-tuning. IEEE Geoscience and Remote Sensing Letters, 19:1–5, 2021.
- Few-shot object detection on aerial imagery via deep metric learning and knowledge inheritance. International Journal of Applied Earth Observation and Geoinformation, 122:103397, 2023.
- A simple framework for contrastive learning of visual representations. International conference on machine learning, pages 1597–1607, 2020.
- Improved baselines with momentum contrastive learning. arXiv preprint arXiv:2003.04297, 2020.
- Deep learning face representation by joint identification-verification. Advances in neural information processing systems, 27, 2014.
- Few-shot object detection with dense-global feature interaction and dual-contrastive learning. Applied Intelligence, pages 1–18, 2022.
- Oriented response networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 519–528, 2017.
- A simple framework for contrastive learning of visual representations. In International conference on machine learning, pages 1597–1607. PMLR, 2020.
- Learning rotation-invariant convolutional neural networks for object detection in vhr optical remote sensing images. IEEE Transactions on Geoscience and Remote Sensing, 54(12):7405–7415, 2016.
- Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016.
- The pascal visual object classes (voc) challenge. International journal of computer vision, 88:303–338, 2010.