Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Adaptive Self-Training for Object Detection (2212.05911v2)

Published 7 Dec 2022 in cs.CV

Abstract: Deep learning has emerged as an effective solution for solving the task of object detection in images but at the cost of requiring large labeled datasets. To mitigate this cost, semi-supervised object detection methods, which consist in leveraging abundant unlabeled data, have been proposed and have already shown impressive results. However, most of these methods require linking a pseudo-label to a ground-truth object by thresholding. In previous works, this threshold value is usually determined empirically, which is time consuming, and only done for a single data distribution. When the domain, and thus the data distribution, changes, a new and costly parameter search is necessary. In this work, we introduce our method Adaptive Self-Training for Object Detection (ASTOD), which is a simple yet effective teacher-student method. ASTOD determines without cost a threshold value based directly on the ground value of the score histogram. To improve the quality of the teacher predictions, we also propose a novel pseudo-labeling procedure. We use different views of the unlabeled images during the pseudo-labeling step to reduce the number of missed predictions and thus obtain better candidate labels. Our teacher and our student are trained separately, and our method can be used in an iterative fashion by replacing the teacher by the student. On the MS-COCO dataset, our method consistently performs favorably against state-of-the-art methods that do not require a threshold parameter, and shows competitive results with methods that require a parameter sweep search. Additional experiments with respect to a supervised baseline on the DIOR dataset containing satellite images lead to similar conclusions, and prove that it is possible to adapt the score threshold automatically in self-training, regardless of the data distribution. The code is available at https:// github.com/rvandeghen/ASTOD

Definition Search Book Streamline Icon: https://streamlinehq.com
References (36)
  1. Learning with pseudo-ensembles. In Adv. Neural Inf. Process. Syst. (NeurIPS), volume 27, Montréal, Can., Dec. 2014. Curran Assoc. Inc.
  2. ReMixMatch: Semi-supervised learning with distribution matching and augmentation anchoring. In Int. Conf. Learn. Represent. (ICLR), Addis Abada, Ethiopia, Apr.-May 2020.
  3. MixMatch: A holistic approach to semi-supervised learning. In Adv. Neural Inf. Process. Syst. (NeurIPS), volume 32, Vancouver, Canada, Dec. 2019. Curran Assoc. Inc.
  4. The PASCAL visual object classes (VOC) challenge. Int. J. Comput. Vis., 88(2):303–338, Jun. 2010.
  5. Deep residual learning for image recognition. In IEEE Int. Conf. Comput. Vis. Pattern Recognit. (CVPR), pages 770–778, Las Vegas, NV, USA, Jun. 2016.
  6. Consistency-based semi-supervised learning for object detection. In Adv. Neural Inf. Process. Syst. (NeurIPS), volume 32, Vancouver, Canada, Dec. 2019. Curran Assoc. Inc.
  7. MUM: Mix image tiles and UnMix feature tiles for semi-supervised object detection. In IEEE Int. Conf. Comput. Vis. Pattern Recognit. (CVPR), pages 14492–14501, New Orleans, LA, USA, Jun. 2022. Inst. Electr. Electron. Eng. (IEEE).
  8. Temporal ensembling for semi-supervised learning. ArXiv, abs/1610.02242, 2016.
  9. Dong-hyun Lee. Pseudo-label: The simple and efficient semi-supervised learning method for deep neural networks. In ICML Workshop on Challenges in Representation Learning, volume 3, pages 1–6, Atlanta, Georgia, USA, Jun. 2013.
  10. PseCo: Pseudo labeling and consistency training for semi-supervised object detection. ArXiv, abs/2203.16317, 2022.
  11. Object detection in optical remote sensing images: A survey and a new benchmark. ISPRS J. Photogramm. Remote Sens., 159:296–307, Jan. 2020.
  12. Improving object detection with selective self-supervised self-training. In Eur. Conf. Comput. Vis. (ECCV), volume 12374 of Lect. Notes Comput. Sci., pages 589–607. Springer, Oct. 2020.
  13. Feature pyramid networks for object detection. In IEEE Int. Conf. Comput. Vis. Pattern Recognit. (CVPR), pages 2117–2125, Honolulu, HI, USA, Jul. 2017.
  14. Focal loss for dense object detection. ArXiv, abs/1708.02002, 2017.
  15. Microsoft COCO: Common Objects in Context. In Eur. Conf. Comput. Vis. (ECCV), volume 8693 of Lect. Notes Comput. Sci., pages 740–755. Springer, Sept. 2014.
  16. Unbiased teacher for semi-supervised object detection. In Int. Conf. Learn. Represent. (ICLR), May 2021.
  17. Unbiased teacher v2: Semi-supervised object detection for anchor-free and anchor-based detectors. In IEEE Int. Conf. Comput. Vis. Pattern Recognit. (CVPR), pages 9809–9818, New Orleans, LA, USA, Jun. 2022. Inst. Electr. Electron. Eng. (IEEE).
  18. Meta pseudo labels. In IEEE Int. Conf. Comput. Vis. Pattern Recognit. (CVPR), pages 11557–11568, Nashville, TN, USA, Jun. 2021.
  19. Meta pseudo labels. ArXiv, abs/2003.10580, 2020.
  20. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell., 39(6):1137–1149, Jun. 2017.
  21. Semi-supervised self-training of object detection models. In IEEE Workshops on Applications of Computer Vision (WACV/MOTION), volume 1, pages 29–36, Breckenridge, Colorado, USA, Jan. 2005. Inst. Electr. Electron. Eng. (IEEE).
  22. ImageNet large scale visual recognition challenge. Int. J. Comput. Vis., 115(3):211–252, Apr. 2015.
  23. A simple semi-supervised learning framework for object detection. ArXiv, abs/2005.04757, 2020.
  24. Non-iterative optimization of pseudo-labeling thresholds for training object detection models from multiple datasets. In IEEE Int. Conf. Image Process. (ICIP), pages 1676–1680, Bordeaux, France, Oct. 2022. Inst. Electr. Electron. Eng. (IEEE).
  25. Humble teachers teach better students for semi-supervised object detection. In IEEE/CVF Conf. Comput. Vis. Pattern Recognit. (CVPR), pages 3131–3140, Nashville, TN, USA, Jun. 2021. Inst. Electr. Electron. Eng. (IEEE).
  26. Semi-supervised training to improve player and ball detection in soccer. In IEEE Int. Conf. Comput. Vis. Pattern Recognit. Work. (CVPRW), CVsports, New Orleans, LA, USA, Jun. 2022.
  27. Omni-DETR: Omni-supervised object detection with transformers. In IEEE Int. Conf. Comput. Vis. Pattern Recognit. (CVPR), pages 9357–9366, New Orleans, LA, USA, Jun. 2022. Inst. Electr. Electron. Eng. (IEEE).
  28. Unsupervised data augmentation for consistency training. In Adv. Neural Inf. Process. Syst. (NeurIPS), volume 33, pages 6256–6268. Curran Assoc. Inc., Dec. 2020.
  29. Self-training with noisy student improves ImageNet classification. In IEEE Int. Conf. Comput. Vis. Pattern Recognit. (CVPR), pages 10684–10695, Seattle, WA, USA, Jun. 2020.
  30. End-to-end semi-supervised object detection with soft teacher. In IEEE Int. Conf. Comput. Vis. (ICCV), pages 3060–3069, Montréal, Can., Oct. 2021.
  31. Interactive self-training with mean teachers for semi-supervised object detection. In IEEE Int. Conf. Comput. Vis. Pattern Recognit. (CVPR), pages 5941–5950, Nashville, TN, USA, Jun. 2021.
  32. Adversarially robust generalization just requires more unlabeled data. ArXiv, abs/1906.00555, 2019.
  33. S4L: Self-supervised semi-supervised learning. In IEEE Int. Conf. Comput. Vis. (ICCV), pages 1476–1485, Seoul, South Korea, Oct. 2019. Inst. Electr. Electron. Eng. (IEEE).
  34. Semi-supervised object detection with adaptive class-rebalancing self-training. ArXiv, abs/2107.05031, 2021.
  35. Instant-teaching: An end-to-end semi-supervised object detection framework. In IEEE Int. Conf. Comput. Vis. Pattern Recognit. (CVPR), pages 4079–4088, Nashville, TN, USA, Jun. 2021. Inst. Electr. Electron. Eng. (IEEE).
  36. Rethinking pre-training and self-training. In Adv. Neural Inf. Process. Syst. (NeurIPS), volume 33, pages 3833–3845. Curran Assoc. Inc., Dec. 2020.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Renaud Vandeghen (7 papers)
  2. Gilles Louppe (68 papers)
  3. Marc Van Droogenbroeck (48 papers)
Citations (6)

Summary

We haven't generated a summary for this paper yet.