Humble Teachers Teach Better Students for Semi-Supervised Object Detection (2106.10456v1)

Published 19 Jun 2021 in cs.CV

Abstract: We propose a semi-supervised approach for contemporary object detectors following the teacher-student dual model framework. Our method is featured with 1) the exponential moving averaging strategy to update the teacher from the student online, 2) using plenty of region proposals and soft pseudo-labels as the student's training targets, and 3) a light-weighted detection-specific data ensemble for the teacher to generate more reliable pseudo-labels. Compared to the recent state-of-the-art -- STAC, which uses hard labels on sparsely selected hard pseudo samples, the teacher in our model exposes richer information to the student with soft-labels on many proposals. Our model achieves COCO-style AP of 53.04% on VOC07 val set, 8.4% better than STAC, when using VOC12 as unlabeled data. On MS-COCO, it outperforms prior work when only a small percentage of data is taken as labeled. It also reaches 53.8% AP on MS-COCO test-dev with 3.1% gain over the fully supervised ResNet-152 Cascaded R-CNN, by tapping into unlabeled data of a similar size to the labeled data.

PDF Abstract

Semi-Supervised Object Detection via Humble Teacher Approach

In contemporary object detection tasks, the scarcity of annotated data poses significant challenges, particularly for large-scale applications. Building upon a dual-model framework, this paper presents the Humble Teacher methodology as a novel semi-supervised learning approach to enhance object detection systems. Utilizing the teacher-student paradigm, the Humble Teacher introduces several pivotal strategies that differentiate it from prior works and result in considerable improvements in detection performance.

Teacher-Student Model and Methodological Enhancements

The core of the paper's approach is leveraging the teacher-student model for semi-supervised object detection. The teacher model guides the student model using pseudo-labels derived from unlabeled data. This method incorporates the exponential moving averaging (EMA) strategy, assimilating fresh feedback more resiliently, ensuring a steadier transition between model updates compared to fixed-parameter models. The Humble Teacher model differs significantly from STAC, a prior technique, by adopting the use of soft pseudo-labels instead of hard labels. Soft labels provide more nuanced information about object presence and locations within an image, which can enhance the learning process by preserving the distribution over class probabilities and bounding box regression offsets.

Performance and Results

Empirical evidence underscores the superiority of the Humble Teacher model across extensive object detection datasets. On the VOC07 validation set, it achieves a COCO-style Average Precision (AP) of 53.04%, outperforming STAC by 8.4%. Similarly, when tested on the MS-COCO dataset, and leveraging a balance between labeled and unlabeled data, the approach surpasses the performances of state-of-the-art detectors. For instance, when comparing models trained with different percentages of labeled data, Humble Teacher consistently demonstrates a marked improvement, showcasing mAP gains up to 9.23% over the supervised baseline at varying labeling percentages.

Theoretical Implications and Future Directions

The ability of Humble Teacher to leverage weakly labeled data offers significant implications for semi-supervised learning in object detection. It closes the performance gap between fully-supervised models and those using minimal labeled instances. From a theoretical standpoint, the methodology extends the understanding of teacher-student dynamics in the semi-supervised domain, emphasizing the role of data representations that encompass broader informational content. The utilization of task-specific ensembles further enhances reliability and efficacy, suggesting avenues for future exploration in model augmentation and improvement.

Looking forward, there is potential for refining this approach to accommodate continually expanding datasets and novel object classes, further optimizing the balance between labeled and unlabeled data. Additionally, exploring deeper integration with other learning paradigms such as self-supervised learning could provide further gains in performance and applicability, evolutionarily steering the development of autonomous detection systems.

In conclusion, the Humble Teacher emerges as a significant advancement in semi-supervised object detection, providing a robust framework that effectively utilizes unlabeled data to achieve enhanced detection results. The paper enriches the semi-supervised learning landscape, positioning such methodologies as viable contenders against traditional fully-supervised models in resource-constrained environments.

PDF Markdown Bookmark Chat (Pro)

Authors (4)

Yihe Tang (5 papers)
Weifeng Chen (22 papers)
Yijun Luo (4 papers)
Yuting Zhang (30 papers)

Citations (161)

View on Semantic Scholar

Humble Teachers Teach Better Students for Semi-Supervised Object Detection (2106.10456v1)

Semi-Supervised Object Detection via Humble Teacher Approach

Teacher-Student Model and Methodological Enhancements

Performance and Results

Theoretical Implications and Future Directions

Related Papers