Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Label Matching Semi-Supervised Object Detection (2206.06608v1)

Published 14 Jun 2022 in cs.CV

Abstract: Semi-supervised object detection has made significant progress with the development of mean teacher driven self-training. Despite the promising results, the label mismatch problem is not yet fully explored in the previous works, leading to severe confirmation bias during self-training. In this paper, we delve into this problem and propose a simple yet effective LabelMatch framework from two different yet complementary perspectives, i.e., distribution-level and instance-level. For the former one, it is reasonable to approximate the class distribution of the unlabeled data from that of the labeled data according to Monte Carlo Sampling. Guided by this weakly supervision cue, we introduce a re-distribution mean teacher, which leverages adaptive label-distribution-aware confidence thresholds to generate unbiased pseudo labels to drive student learning. For the latter one, there exists an overlooked label assignment ambiguity problem across teacher-student models. To remedy this issue, we present a novel label assignment mechanism for self-training framework, namely proposal self-assignment, which injects the proposals from student into teacher and generates accurate pseudo labels to match each proposal in the student model accordingly. Experiments on both MS-COCO and PASCAL-VOC datasets demonstrate the considerable superiority of our proposed framework to other state-of-the-arts. Code will be available at https://github.com/hikvision-research/SSOD.

An Expert Overview of Label Matching Semi-Supervised Object Detection

The paper "Label Matching Semi-Supervised Object Detection" presents a sophisticated approach to improving semi-supervised object detection (SSOD) by addressing inherent label mismatch problems within the framework of the mean teacher model. The primary consideration in this research is to mitigate the confirmation bias that arises from the label mismatch issue during the self-training process in SSOD. This paper introduces the LabelMatch framework, which innovatively resolves this problem through both distribution-level and instance-level perspectives.

The central theme of the paper is the breakdown of the label mismatch issue into two interconnected dimensions:

  1. Distribution-level Mismatch: This issue highlights the inconsistencies between the class distribution of pseudo-labels and the ground truth labels when employing a single confidence threshold. The authors propose a re-distribution mean teacher model that utilizes Monte Carlo sampling to align the class distribution in unlabeled data with that of labeled data. An adaptive label-distribution-aware confidence threshold (ACT) is dynamically settled to generate unbiased pseudo-labels. This mechanism adapts according to the current state of the teacher model, driving an iteration process that rejuvenates the thresholds every specified number of iterations.
  2. Instance-level Mismatch: The paper suggests that existing frameworks overlook the ambiguity in label assignment across the teacher-student models during the training process. To tackle this, the paper presents a novel label assignment methodology, dubbed as proposal self-assignment. This involves integrating student-generated proposals into the teacher model to enhance pseudo-label accuracy for each proposal in the student model. This strategy differentiates itself by moving away from the IoU-based conventional assignment methods and instead provides a comprehensive soft learning mechanism with varying degrees of reliability for pseudo labels.

The treatment of the labeled and unlabeled datasets within these methodologies demonstrates a notable impact on standard datasets including MS-COCO and PASCAL-VOC, achieving state-of-the-art results. Particularly, the research showcases the potential in scenarios with scarce labels, outperforming existing methods with significant margins.

This research has broader implications not only for practical implementations but also for theoretical advancements in the field of SSOD. By maintaining class distribution consistency through adaptive thresholds, the authors introduce a layer of robustness to the pseudo-labeling process that could be beneficial for real-world applications where labeled data is limited. Furthermore, the proposal self-assignment approach presents a template for refining label assignments in other semi-supervised and domain-adaptive learning scenarios.

Looking forward, the methodologies proposed by this paper open up several avenues for further research. The reliability of the label-distribution-aware confidence thresholds in dynamic environments as well as exploring the scalability of the proposal self-assignment across different detection frameworks and complex scenarios are key areas of potential exploration. Furthermore, challenging the assumptions regarding the class-distribution similarities in domain-adaptive settings could enhance the applicable scope of LabelMatch, encouraging the development of more universally robust semi-supervised detection models.

In summary, this paper offers substantial advancements to the field of semi-supervised object detection through innovative methodological improvements, thereby setting a precedent for tackling label mismatch challenges that could significantly influence future research and applications in this domain.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Binbin Chen (33 papers)
  2. Weijie Chen (52 papers)
  3. Shicai Yang (18 papers)
  4. Yunyi Xuan (2 papers)
  5. Jie Song (217 papers)
  6. Di Xie (57 papers)
  7. Shiliang Pu (106 papers)
  8. Mingli Song (163 papers)
  9. Yueting Zhuang (164 papers)
Citations (66)