Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
126 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Diverse Teacher-Students for Deep Safe Semi-Supervised Learning under Class Mismatch (2405.16093v1)

Published 25 May 2024 in cs.CV

Abstract: Semi-supervised learning can significantly boost model performance by leveraging unlabeled data, particularly when labeled data is scarce. However, real-world unlabeled data often contain unseen-class samples, which can hinder the classification of seen classes. To address this issue, mainstream safe SSL methods suggest detecting and discarding unseen-class samples from unlabeled data. Nevertheless, these methods typically employ a single-model strategy to simultaneously tackle both the classification of seen classes and the detection of unseen classes. Our research indicates that such an approach may lead to conflicts during training, resulting in suboptimal model optimization. Inspired by this, we introduce a novel framework named Diverse Teacher-Students (\textbf{DTS}), which uniquely utilizes dual teacher-student models to individually and effectively handle these two tasks. DTS employs a novel uncertainty score to softly separate unseen-class and seen-class data from the unlabeled set, and intelligently creates an additional ($K$+1)-th class supervisory signal for training. By training both teacher-student models with all unlabeled samples, DTS can enhance the classification of seen classes while simultaneously improving the detection of unseen classes. Comprehensive experiments demonstrate that DTS surpasses baseline methods across a variety of datasets and configurations. Our code and models can be publicly accessible on the link https://github.com/Zhanlo/DTS.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (49)
  1. Remixmatch: Semi-supervised learning with distribution matching and augmentation anchoring, in: 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26-30, 2020, OpenReview.net. URL: https://openreview.net/forum?id=HklkeR4KPB.
  2. Mixmatch: A holistic approach to semi-supervised learning, in: Wallach, H.M., Larochelle, H., Beygelzimer, A., d’Alché-Buc, F., Fox, E.B., Garnett, R. (Eds.), Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, December 8-14, 2019, Vancouver, BC, Canada, pp. 5050–5060. URL: https://proceedings.neurips.cc/paper/2019/hash/1cd138d0499a68f4bb72bee04bbec2d7-Abstract.html.
  3. Curriculum labeling: Revisiting pseudo-labeling for semi-supervised learning, in: Thirty-Fifth AAAI Conference on Artificial Intelligence, AAAI 2021, Thirty-Third Conference on Innovative Applications of Artificial Intelligence, IAAI 2021, The Eleventh Symposium on Educational Advances in Artificial Intelligence, EAAI 2021, Virtual Event, February 2-9, 2021, AAAI Press. pp. 6912–6920. URL: https://doi.org/10.1609/aaai.v35i8.16852, doi:10.1609/AAAI.V35I8.16852.
  4. Improving the performance of semi-supervised person re-identification by selecting reliable unlabeled samples. Engineering Applications of Artificial Intelligence 133, 108367. URL: https://www.sciencedirect.com/science/article/pii/S0952197624005256, doi:https://doi.org/10.1016/j.engappai.2024.108367.
  5. Semi-supervised learning under class distribution mismatch, in: The Thirty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2020, The Thirty-Second Innovative Applications of Artificial Intelligence Conference, IAAI 2020, The Tenth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2020, New York, NY, USA, February 7-12, 2020, AAAI Press. pp. 3569–3576. URL: https://doi.org/10.1609/aaai.v34i04.5763, doi:10.1609/AAAI.V34I04.5763.
  6. An analysis of single-layer networks in unsupervised feature learning, in: Gordon, G.J., Dunson, D.B., Dudík, M. (Eds.), Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2011, Fort Lauderdale, USA, April 11-13, 2011, JMLR.org. pp. 215–223. URL: http://proceedings.mlr.press/v15/coates11a/coates11a.pdf.
  7. Imagenet: A large-scale hierarchical image database, in: 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 20-25 June 2009, Miami, Florida, USA, IEEE Computer Society. pp. 248–255. URL: https://doi.org/10.1109/CVPR.2009.5206848, doi:10.1109/CVPR.2009.5206848.
  8. A survey on semi-supervised learning. Mach. Learn. 109, 373–440. URL: https://doi.org/10.1007/s10994-019-05855-6, doi:10.1007/S10994-019-05855-6.
  9. Safe deep semi-supervised learning for unseen-class unlabeled data, in: Proceedings of the 37th International Conference on Machine Learning, ICML 2020, 13-18 July 2020, Virtual Event, PMLR. pp. 3897–3906. URL: http://proceedings.mlr.press/v119/guo20i.html.
  10. Accurate screening of COVID-19 using attention-based deep 3d multiple instance learning. IEEE Trans. Medical Imaging 39, 2584–2594. URL: https://doi.org/10.1109/TMI.2020.2996256, doi:10.1109/TMI.2020.2996256.
  11. Masked autoencoders are scalable vision learners, in: IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022, IEEE. pp. 15979–15988. URL: https://doi.org/10.1109/CVPR52688.2022.01553, doi:10.1109/CVPR52688.2022.01553.
  12. Momentum contrast for unsupervised visual representation learning, in: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020, Seattle, WA, USA, June 13-19, 2020, Computer Vision Foundation / IEEE. pp. 9726–9735. URL: https://doi.org/10.1109/CVPR42600.2020.00975, doi:10.1109/CVPR42600.2020.00975.
  13. Safe-student for safe deep semi-supervised learning with unseen-class unlabeled data, in: IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022, IEEE. pp. 14565–14574. URL: https://doi.org/10.1109/CVPR52688.2022.01418, doi:10.1109/CVPR52688.2022.01418.
  14. SAFER-STUDENT for safe deep semi-supervised learning with unseen-class unlabeled data. IEEE Trans. Knowl. Data Eng. 36, 318–334. URL: https://doi.org/10.1109/TKDE.2023.3279139, doi:10.1109/TKDE.2023.3279139.
  15. They are not completely useless: Towards recycling transferable unlabeled data for class-mismatched semi-supervised learning. IEEE Trans. Multim. 25, 1844–1857. URL: https://doi.org/10.1109/TMM.2022.3179895, doi:10.1109/TMM.2022.3179895.
  16. Label propagation for deep semi-supervised learning, in: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019, Long Beach, CA, USA, June 16-20, 2019, Computer Vision Foundation / IEEE. pp. 5070–5079. URL: http://openaccess.thecvf.com/content_CVPR_2019/html/Iscen_Label_Propagation_for_Deep_Semi-Supervised_Learning_CVPR_2019_paper.html, doi:10.1109/CVPR.2019.00521.
  17. Learning multiple layers of features from tiny images .
  18. Imagenet classification with deep convolutional neural networks, in: Bartlett, P.L., Pereira, F.C.N., Burges, C.J.C., Bottou, L., Weinberger, K.Q. (Eds.), Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012, Lake Tahoe, Nevada, United States, pp. 1106–1114. URL: https://proceedings.neurips.cc/paper/2012/hash/c399862d3b9d6b76c8436e924a68c45b-Abstract.html.
  19. Temporal ensembling for semi-supervised learning, in: 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, April 24-26, 2017, Conference Track Proceedings, OpenReview.net. URL: https://openreview.net/forum?id=BJ6oOfqge.
  20. Pseudo-label: The simple and efficient semi-supervised learning method for deep neural networks, in: ICML, Atlanta. p. 896.
  21. Comatch: Semi-supervised learning with contrastive graph regularization, in: 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021, Montreal, QC, Canada, October 10-17, 2021, IEEE. pp. 9455–9464. URL: https://doi.org/10.1109/ICCV48922.2021.00934, doi:10.1109/ICCV48922.2021.00934.
  22. Iomatch: Simplifying open-set semi-supervised learning with joint inliers and outliers utilization, in: IEEE/CVF International Conference on Computer Vision, ICCV 2023, Paris, France, October 1-6, 2023, IEEE. pp. 15824–15833. URL: https://doi.org/10.1109/ICCV51070.2023.01454, doi:10.1109/ICCV51070.2023.01454.
  23. Microsoft COCO: common objects in context, in: Fleet, D.J., Pajdla, T., Schiele, B., Tuytelaars, T. (Eds.), Computer Vision - ECCV 2014 - 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V, Springer. pp. 740–755. URL: https://doi.org/10.1007/978-3-319-10602-1_48, doi:10.1007/978-3-319-10602-1\_48.
  24. Semi-supervised object detection with uncurated unlabeled data for remote sensing images. International Journal of Applied Earth Observation and Geoinformation 129, 103814. URL: https://www.sciencedirect.com/science/article/pii/S1569843224001687, doi:https://doi.org/10.1016/j.jag.2024.103814.
  25. Semi-supervised federated learning on evolving data streams. Information Sciences 643, 119235. URL: https://www.sciencedirect.com/science/article/pii/S0020025523008204, doi:https://doi.org/10.1016/j.ins.2023.119235.
  26. Virtual adversarial training: A regularization method for supervised and semi-supervised learning. IEEE Trans. Pattern Anal. Mach. Intell. 41, 1979–1993. URL: https://doi.org/10.1109/TPAMI.2018.2858821, doi:10.1109/TPAMI.2018.2858821.
  27. Reading digits in natural images with unsupervised feature learning, in: NIPS workshop on deep learning and unsupervised feature learning, Granada, Spain. p. 7.
  28. Realistic evaluation of deep semi-supervised learning algorithms, in: Bengio, S., Wallach, H.M., Larochelle, H., Grauman, K., Cesa-Bianchi, N., Garnett, R. (Eds.), Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, NeurIPS 2018, December 3-8, 2018, Montréal, Canada, pp. 3239–3250. URL: https://proceedings.neurips.cc/paper/2018/hash/c1fea270c48e8079d8ddf7d06d26ab52-Abstract.html.
  29. Not all unlabeled data are equal: Learning to weight data in semi-supervised learning, in: Larochelle, H., Ranzato, M., Hadsell, R., Balcan, M., Lin, H. (Eds.), Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual. URL: https://proceedings.neurips.cc/paper/2020/hash/f7ac67a9aa8d255282de7d11391e1b69-Abstract.html.
  30. In defense of pseudo-labeling: An uncertainty-aware pseudo-label selection framework for semi-supervised learning, in: 9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3-7, 2021, OpenReview.net. URL: https://openreview.net/forum?id=-ODN6SbiUU.
  31. Openmatch: Open-set semi-supervised learning with open-set consistency regularization, in: Ranzato, M., Beygelzimer, A., Dauphin, Y., Liang, P., Vaughan, J.W. (Eds.), Advances in Neural Information Processing Systems, Curran Associates, Inc.. pp. 25956–25967. URL: https://proceedings.neurips.cc/paper_files/paper/2021/file/da11e8cd1811acb79ccf0fd62cd58f86-Paper.pdf.
  32. Regularization with stochastic transformations and perturbations for deep semi-supervised learning, in: Lee, D.D., Sugiyama, M., von Luxburg, U., Guyon, I., Garnett, R. (Eds.), Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, December 5-10, 2016, Barcelona, Spain, pp. 1163–1171. URL: https://proceedings.neurips.cc/paper/2016/hash/30ef30b64204a3088a26bc2e6ecf7602-Abstract.html.
  33. Very deep convolutional networks for large-scale image recognition, in: Bengio, Y., LeCun, Y. (Eds.), 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings. URL: http://arxiv.org/abs/1409.1556.
  34. Fixmatch: Simplifying semi-supervised learning with consistency and confidence, in: Larochelle, H., Ranzato, M., Hadsell, R., Balcan, M., Lin, H. (Eds.), Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual. URL: https://proceedings.neurips.cc/paper/2020/hash/06964dce9addb1c5cb5d6e3d9838f733-Abstract.html.
  35. Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results, in: Guyon, I., von Luxburg, U., Bengio, S., Wallach, H.M., Fergus, R., Vishwanathan, S.V.N., Garnett, R. (Eds.), Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, December 4-9, 2017, Long Beach, CA, USA, pp. 1195–1204. URL: https://proceedings.neurips.cc/paper/2017/hash/68053af2923e00204c3ca7c6a3150cf7-Abstract.html.
  36. Semi-supervised learning by olivier chapelle, bernhard schölkopf, and alexander zien (review). IEEE Trans. Neural Networks 20, 542. URL: https://doi.org/10.1109/TNN.2009.2015974, doi:10.1109/TNN.2009.2015974.
  37. Imbalanced semi-supervised learning with bias adaptive classifier, in: The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023, OpenReview.net. URL: https://openreview.net/pdf?id=rVM8wD2G7Dy.
  38. Unsupervised selective labeling for more effective semi-supervised learning, in: Avidan, S., Brostow, G.J., Cissé, M., Farinella, G.M., Hassner, T. (Eds.), Computer Vision - ECCV 2022 - 17th European Conference, Tel Aviv, Israel, October 23-27, 2022, Proceedings, Part XXX, Springer. pp. 427–445. URL: https://doi.org/10.1007/978-3-031-20056-4_25, doi:10.1007/978-3-031-20056-4\_25.
  39. USB: A unified semi-supervised learning benchmark for classification, in: Koyejo, S., Mohamed, S., Agarwal, A., Belgrave, D., Cho, K., Oh, A. (Eds.), Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, NeurIPS 2022, New Orleans, LA, USA, November 28 - December 9, 2022. URL: http://papers.nips.cc/paper_files/paper/2022/hash/190dd6a5735822f05646dc27decff19b-Abstract-Datasets_and_Benchmarks.html.
  40. Freematch: Self-adaptive thresholding for semi-supervised learning, in: The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023, OpenReview.net. URL: https://openreview.net/pdf?id=PDrUPTXJI_A.
  41. Out-of-distributed semantic pruning for robust semi-supervised learning, in: IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, BC, Canada, June 17-24, 2023, IEEE. pp. 23849–23858. URL: https://doi.org/10.1109/CVPR52729.2023.02284, doi:10.1109/CVPR52729.2023.02284.
  42. Self-training with noisy student improves imagenet classification, in: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020, Seattle, WA, USA, June 13-19, 2020, Computer Vision Foundation / IEEE. pp. 10684–10695. URL: https://openaccess.thecvf.com/content_CVPR_2020/html/Xie_Self-Training_With_Noisy_Student_Improves_ImageNet_Classification_CVPR_2020_paper.html, doi:10.1109/CVPR42600.2020.01070.
  43. Shrinking class space for enhanced certainty in semi-supervised learning, in: IEEE/CVF International Conference on Computer Vision, ICCV 2023, Paris, France, October 1-6, 2023, IEEE. pp. 16141–16150. URL: https://doi.org/10.1109/ICCV51070.2023.01483, doi:10.1109/ICCV51070.2023.01483.
  44. A survey on deep semi-supervised learning. CoRR abs/2103.00550. URL: https://arxiv.org/abs/2103.00550, arXiv:2103.00550.
  45. Multi-task curriculum framework for open-set semi-supervised learning, in: Vedaldi, A., Bischof, H., Brox, T., Frahm, J. (Eds.), Computer Vision - ECCV 2020 - 16th European Conference, Glasgow, UK, August 23-28, 2020, Proceedings, Part XII, Springer. pp. 438–454. URL: https://doi.org/10.1007/978-3-030-58610-2_26, doi:10.1007/978-3-030-58610-2\_26.
  46. Wide residual networks, in: Wilson, R.C., Hancock, E.R., Smith, W.A.P. (Eds.), Proceedings of the British Machine Vision Conference 2016, BMVC 2016, York, UK, September 19-22, 2016, BMVA Press. URL: http://www.bmva.org/bmvc/2016/papers/paper087/index.html.
  47. Flexmatch: Boosting semi-supervised learning with curriculum pseudo labeling, in: Ranzato, M., Beygelzimer, A., Dauphin, Y.N., Liang, P., Vaughan, J.W. (Eds.), Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual, pp. 18408–18419. URL: https://proceedings.neurips.cc/paper/2021/hash/995693c15f439e3d189b06e89d145dd5-Abstract.html.
  48. Openood v1.5: Enhanced benchmark for out-of-distribution detection. CoRR abs/2306.09301. URL: https://doi.org/10.48550/arXiv.2306.09301, doi:10.48550/ARXIV.2306.09301, arXiv:2306.09301.
  49. Simmatch: Semi-supervised learning with similarity matching, in: IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022, IEEE. pp. 14451–14461. URL: https://doi.org/10.1109/CVPR52688.2022.01407, doi:10.1109/CVPR52688.2022.01407.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com