Few-Shot Object Detection with Sparse Context Transformers (2402.09315v1)
Abstract: Few-shot detection is a major task in pattern recognition which seeks to localize objects using models trained with few labeled data. One of the mainstream few-shot methods is transfer learning which consists in pretraining a detection model in a source domain prior to its fine-tuning in a target domain. However, it is challenging for fine-tuned models to effectively identify new classes in the target domain, particularly when the underlying labeled training data are scarce. In this paper, we devise a novel sparse context transformer (SCT) that effectively leverages object knowledge in the source domain, and automatically learns a sparse context from only few training images in the target domain. As a result, it combines different relevant clues in order to enhance the discrimination power of the learned detectors and reduce class confusion. We evaluate the proposed method on two challenging few-shot object detection benchmarks, and empirical results show that the proposed method obtains competitive performance compared to the related state-of-the-art.
- “Few-shot object detection: A comprehensive survey,” arXiv preprint arXiv:2112.11699, 2021.
- H. Sahbi, D. Geman. A hierarchy of support vector machines for pattern detection. Journal of Machine Learning Research 7.Oct (2006): 2087-2123.
- “A comparative review of recent few-shot object detection algorithms,” arXiv preprint arXiv:2111.00201, 2021.
- H. Sahbi and N. Boujemaa. ”Coarse-to-fine support vector classifiers for face detection.” Object recognition supported by user interaction for service robots. Vol. 3. IEEE, 2002.
- “Defrcn: Decoupled faster r-cnn for few-shot object detection,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 8681–8690.
- “Few-shot object detection via classification refinement and distractor retreatment,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 15395–15403.
- H. Sahbi and N. Boujemaa. ”From coarse to fine skin and face detection.” Proceedings of the eighth ACM international conference on Multimedia. 2000.
- “FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding,” in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2021.
- Ross Girshick, “Fast r-cnn,” in 2015 IEEE International Conference on Computer Vision (ICCV), Dec 2015.
- SSD: Single Shot MultiBox Detector, p. 21–37, Jan 2016.
- “LSTD: A low-shot transfer detector for object detection,” CoRR, vol. abs/1803.01529, 2018.
- “UniT: Unified Knowledge Transfer for Any-shot Object Detection and Segmentation,” in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2021.
- M. Jiu and H. Sahbi. ”Deep representation design from deep kernel networks.” Pattern Recognition 88 (2019): 447-457.
- “ Context-Transformer: Tackling Object Confusion for Few-Shot Detection,” Proceedings of the AAAI Conference on Artificial Intelligence, p. 12653–12660, Jun 2020.
- “Simultaneous Deep Transfer Across Domains and Tasks,” in 2015 IEEE International Conference on Computer Vision (ICCV), Dec 2015.
- “Pyramid Scene Parsing Network,” in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jul 2017.
- H. Sahbi, J.-Y. Audibert, and R. Keriven, “Context-dependent kernels for object classification,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 33, pp. 699–708, 2011.
- “Incremental Learning of Object Detectors without Catastrophic Forgetting,” in 2017 IEEE International Conference on Computer Vision (ICCV), Oct 2017.
- “Few-shot Object Detection via Feature Reweighting,” in 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Oct 2019.
- “Meta R-CNN: Towards General Solver for Instance-Level Low-Shot Learning,” in 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Oct 2019.
- “Dense relation distillation with context-aware aggregation for few-shot object detection,” in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2021.
- “Cos R-CNN for Online Few-shot Object Detection,” Jul 2023.
- A. Mazari and H. Sahbi. ”MLGCN: Multi-Laplacian graph convolutional networks for human action recognition.” The British Machine Vision Conference (BMVC). 2019.
- Microsoft COCO: Common Objects in Context, p. 740–755, Jan 2014.
- Receptive Field Block Net for Accurate and Fast Object Detection, p. 404–419, Jan 2018.
- “Automatic differentiation in pytorch,” Oct 2017.