Denoising and Selecting Pseudo-Heatmaps for Semi-Supervised Human Pose Estimation (2310.00099v1)
Abstract: We propose a new semi-supervised learning design for human pose estimation that revisits the popular dual-student framework and enhances it two ways. First, we introduce a denoising scheme to generate reliable pseudo-heatmaps as targets for learning from unlabeled data. This uses multi-view augmentations and a threshold-and-refine procedure to produce a pool of pseudo-heatmaps. Second, we select the learning targets from these pseudo-heatmaps guided by the estimated cross-student uncertainty. We evaluate our proposed method on multiple evaluation setups on the COCO benchmark. Our results show that our model outperforms previous state-of-the-art semi-supervised pose estimators, especially in extreme low-data regime. For example with only 0.5K labeled images our method is capable of surpassing the best competitor by 7.22 mAP (+25% absolute improvement). We also demonstrate that our model can learn effectively from unlabeled data in the wild to further boost its generalization and performance.
- Pseudo-labeling and confirmation bias in deep semi-supervised learning. In IJCNN, 2020.
- Learning with pseudo-ensembles. arXiv preprint arXiv:1412.4864, 2014.
- Remixmatch: Semi-supervised learning with distribution matching and augmentation anchoring. In ICLR, 2020.
- Mixmatch: A holistic approach to semi-supervised learning. arXiv preprint arXiv:1905.02249, 2019.
- Combining labeled and unlabeled data with co-training. In CLT, 1998.
- Weight uncertainty in neural network. In ICML, 2015.
- Leo Breiman. Random forests. Machine learning, 2001.
- Semi-supervised learning (chapelle, o. et al., eds.; 2006)[book reviews]. IEEE TNN, 2009.
- Semi-supervised and unsupervised deep visual learning: A survey. IEEE TPAMI, 2022.
- Cascaded pyramid network for multi-person pose estimation. In CVPR, 2018.
- Imagenet: A large-scale hierarchical image database. In CVPR, 2009.
- Dropout as a bayesian approximation: Representing model uncertainty in deep learning. In ICML, 2016.
- Deep residual learning for image recognition. In CVPR, 2016.
- Improving landmark localization with semi-supervised learning. In CVPR, 2018.
- Label propagation for deep semi-supervised learning. In CVPR, 2019.
- Consistency-based semi-supervised learning for object detection. In NeurIPS, 2019.
- Semi-supervised hierarchical models for 3d human pose reconstruction. In CVPR, 2007.
- Dual student: Breaking the limits of the teacher in semi-supervised learning. In ICCV, 2019.
- Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
- Temporal ensembling for semi-supervised learning. arXiv preprint arXiv:1610.02242, 2016.
- Simple and scalable predictive uncertainty estimation using deep ensembles. NeurIPS, 30, 2017.
- Dong-Hyun Lee et al. Pseudo-label: The simple and efficient semi-supervised learning method for deep neural networks. In ICMLW, 2013.
- Microsoft coco: Common objects in context. In ECCV, 2014.
- Unbiased teacher for semi-supervised object detection. arXiv preprint arXiv:2102.09480, 2021.
- Unbiased teacher v2: Semi-supervised object detection for anchor-free and anchor-based detectors. In CVPR, 2022.
- Predictive uncertainty estimation via prior networks. NeurIPS, 2018.
- Tom M Mitchell. Generalization as search. Artificial intelligence, 1982.
- Multiview-consistent semi-supervised learning for 3d human pose estimation. In CVPR, 2020.
- Semi-supervised keypoint localization. arXiv preprint arXiv:2101.07988, 2021.
- Uncertainty-aware self-training for few-shot text classification. In H. Larochelle, M. Ranzato, R. Hadsell, M.F. Balcan, and H. Lin, editors, Advances in Neural Information Processing Systems, volume 33, pages 21199–21212. Curran Associates, Inc., 2020.
- Stacked hourglass networks for human pose estimation. In ECCV, 2016.
- Deep neural networks are easily fooled: High confidence predictions for unrecognizable images. In CVPR, 2015.
- Towards accurate multi-person pose estimation in the wild. In CVPR, 2017.
- 3d human pose estimation in video with temporal convolutions and semi-supervised training. In CVPR, 2019.
- Data distillation: Towards omni-supervised learning. In CVPR, 2018.
- In defense of pseudo-labeling: An uncertainty-aware pseudo-label selection framework for semi-supervised learning. arXiv preprint arXiv:2101.06329, 2021.
- Regularization with stochastic transformations and perturbations for deep semi-supervised learning. arXiv preprint arXiv:1606.04586, 2016.
- Fixmatch: Simplifying semi-supervised learning with consistency and confidence. In NeurIPS, 2020.
- A simple semi-supervised learning framework for object detection. arXiv preprint arXiv:2005.04757, 2020.
- Deep high-resolution representation learning for human pose estimation. In CVPR, 2019.
- Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results. In NeurIPS, 2017.
- Joint training of a convolutional network and a graphical model for human pose estimation. In NeurIPS, 2014.
- Semi-and weakly-supervised human pose estimation. CVIU, 2018.
- Uncertainty estimation using a single deep deterministic neural network. In ICML. PMLR, 2020.
- Pseudo-labeled auto-curriculum learning for semi-supervised keypoint localization. arXiv preprint arXiv:2201.08613, 2022.
- Large-scale datasets for going deeper in image understanding. In ICME, 2019.
- 3d semi-supervised learning with uncertainty-aware multi-view co-training. In WACV, 2020.
- Simple baselines for human pose estimation and tracking. In ECCV, 2018.
- Unsupervised data augmentation for consistency training. In NeurIPS, 2020.
- Self-training with noisy student improves imagenet classification. In CVPR, 2020.
- An empirical study of the collapsing problem in semi-supervised 2d human pose estimation. In ICCV, 2021.
- End-to-end semi-supervised object detection with soft teacher. In ICCV, 2021.
- Articulated human detection with flexible mixtures of parts. IEEE TPAMI, 2013.
- Uncertainty-aware self-ensembling model for semi-supervised 3d left atrium segmentation. In MICCAI, 2019.
- Flexmatch: Boosting semi-supervised learning with curriculum pseudo labeling. In NeurIPS, 2021.
- Instant-teaching: An end-to-end semi-supervised object detection framework. In CVPR, 2021.
- Introduction to semi-supervised learning. Synthesis lectures on artificial intelligence and machine learning, 2009.
- Xiaojin Jerry Zhu. Semi-supervised learning literature survey. Technical report, University of Wisconsin-Madison Department of Computer Sciences, 2005.