Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
125 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Few-Shot Object Detection with Sparse Context Transformers (2402.09315v1)

Published 14 Feb 2024 in cs.CV

Abstract: Few-shot detection is a major task in pattern recognition which seeks to localize objects using models trained with few labeled data. One of the mainstream few-shot methods is transfer learning which consists in pretraining a detection model in a source domain prior to its fine-tuning in a target domain. However, it is challenging for fine-tuned models to effectively identify new classes in the target domain, particularly when the underlying labeled training data are scarce. In this paper, we devise a novel sparse context transformer (SCT) that effectively leverages object knowledge in the source domain, and automatically learns a sparse context from only few training images in the target domain. As a result, it combines different relevant clues in order to enhance the discrimination power of the learned detectors and reduce class confusion. We evaluate the proposed method on two challenging few-shot object detection benchmarks, and empirical results show that the proposed method obtains competitive performance compared to the related state-of-the-art.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (26)
  1. “Few-shot object detection: A comprehensive survey,” arXiv preprint arXiv:2112.11699, 2021.
  2. H. Sahbi, D. Geman. A hierarchy of support vector machines for pattern detection. Journal of Machine Learning Research 7.Oct (2006): 2087-2123.
  3. “A comparative review of recent few-shot object detection algorithms,” arXiv preprint arXiv:2111.00201, 2021.
  4. H. Sahbi and N. Boujemaa. ”Coarse-to-fine support vector classifiers for face detection.” Object recognition supported by user interaction for service robots. Vol. 3. IEEE, 2002.
  5. “Defrcn: Decoupled faster r-cnn for few-shot object detection,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 8681–8690.
  6. “Few-shot object detection via classification refinement and distractor retreatment,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 15395–15403.
  7. H. Sahbi and N. Boujemaa. ”From coarse to fine skin and face detection.” Proceedings of the eighth ACM international conference on Multimedia. 2000.
  8. “FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding,” in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2021.
  9. Ross Girshick, “Fast r-cnn,” in 2015 IEEE International Conference on Computer Vision (ICCV), Dec 2015.
  10. SSD: Single Shot MultiBox Detector, p. 21–37, Jan 2016.
  11. “LSTD: A low-shot transfer detector for object detection,” CoRR, vol. abs/1803.01529, 2018.
  12. “UniT: Unified Knowledge Transfer for Any-shot Object Detection and Segmentation,” in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2021.
  13. M. Jiu and H. Sahbi. ”Deep representation design from deep kernel networks.” Pattern Recognition 88 (2019): 447-457.
  14. “ Context-Transformer: Tackling Object Confusion for Few-Shot Detection,” Proceedings of the AAAI Conference on Artificial Intelligence, p. 12653–12660, Jun 2020.
  15. “Simultaneous Deep Transfer Across Domains and Tasks,” in 2015 IEEE International Conference on Computer Vision (ICCV), Dec 2015.
  16. “Pyramid Scene Parsing Network,” in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jul 2017.
  17. H. Sahbi, J.-Y. Audibert, and R. Keriven, “Context-dependent kernels for object classification,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 33, pp. 699–708, 2011.
  18. “Incremental Learning of Object Detectors without Catastrophic Forgetting,” in 2017 IEEE International Conference on Computer Vision (ICCV), Oct 2017.
  19. “Few-shot Object Detection via Feature Reweighting,” in 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Oct 2019.
  20. “Meta R-CNN: Towards General Solver for Instance-Level Low-Shot Learning,” in 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Oct 2019.
  21. “Dense relation distillation with context-aware aggregation for few-shot object detection,” in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2021.
  22. “Cos R-CNN for Online Few-shot Object Detection,” Jul 2023.
  23. A. Mazari and H. Sahbi. ”MLGCN: Multi-Laplacian graph convolutional networks for human action recognition.” The British Machine Vision Conference (BMVC). 2019.
  24. Microsoft COCO: Common Objects in Context, p. 740–755, Jan 2014.
  25. Receptive Field Block Net for Accurate and Fast Object Detection, p. 404–419, Jan 2018.
  26. “Automatic differentiation in pytorch,” Oct 2017.

Summary

We haven't generated a summary for this paper yet.