Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

MultiMatch: Multi-task Learning for Semi-supervised Domain Generalization (2208.05853v3)

Published 11 Aug 2022 in cs.CV

Abstract: Domain generalization (DG) aims at learning a model on source domains to well generalize on the unseen target domain. Although it has achieved great success, most of existing methods require the label information for all training samples in source domains, which is time-consuming and expensive in the real-world application. In this paper, we resort to solving the semi-supervised domain generalization (SSDG) task, where there are a few label information in each source domain. To address the task, we first analyze the theory of the multi-domain learning, which highlights that 1) mitigating the impact of domain gap and 2) exploiting all samples to train the model can effectively reduce the generalization error in each source domain so as to improve the quality of pseudo-labels. According to the analysis, we propose MultiMatch, i.e., extending FixMatch to the multi-task learning framework, producing the high-quality pseudo-label for SSDG. To be specific, we consider each training domain as a single task (i.e., local task) and combine all training domains together (i.e., global task) to train an extra task for the unseen test domain. In the multi-task framework, we utilize the independent BN and classifier for each task, which can effectively alleviate the interference from different domains during pseudo-labeling. Also, most of parameters in the framework are shared, which can be trained by all training samples sufficiently. Moreover, to further boost the pseudo-label accuracy and the model's generalization, we fuse the predictions from the global task and local task during training and testing, respectively. A series of experiments validate the effectiveness of the proposed method, and it outperforms the existing semi-supervised methods and the SSDG method on several benchmark DG datasets.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (68)
  1. Metareg: Towards domain generalization using meta-regularization. In Advances in Neural Information Processing Systems (NeurIPS) (2018), pp. 1006–1016.
  2. A theory of learning from different domains. Machine Learning (ML) 79, 1 (2010), 151–175.
  3. Remixmatch: Semi-supervised learning with distribution matching and augmentation anchoring. In International Conference on Learning Representations (ICLR) (2020).
  4. Mixmatch: A holistic approach to semi-supervised learning. In Advances in Neural Information Processing Systems (NeurIPS) (2019), pp. 5050–5060.
  5. Domain generalization by solving jigsaw puzzles. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2019), pp. 2229–2238.
  6. Domain-specific batch normalization for unsupervised domain adaptation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2019), pp. 7354–7362.
  7. Imagenet: A large-scale hierarchical image database. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2009), pp. 248–255.
  8. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018).
  9. Domain generalization via model-agnostic learning of semantic features. In Advances in Neural Information Processing Systems (NeurIPS) (2019), pp. 6447–6458.
  10. Dynamic graph learning convolutional networks for semi-supervised classification. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 17, 1s (2021), 1–13.
  11. Alphamatch: Improving consistency for semi-supervised learning with alpha-divergence. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2021), pp. 13683–13692.
  12. DLOW: domain flow for adaptation and generalization. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2019), pp. 2477–2486.
  13. Semi-supervised learning by entropy minimization. In Advances in Neural Information Processing Systems (NeurIPS) (2004), pp. 529–536.
  14. The elements of statistical learning: data mining, inference, and prediction, vol. 2. 2009.
  15. Deep residual learning for image recognition. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016), pp. 770–778.
  16. Arbitrary style transfer in real-time with adaptive instance normalization. In International Conference on Computer Vision (ICCV) (2017), pp. 1510–1519.
  17. Self-challenging improves cross-domain generalization. In European Conference on Computer Vision (ECCV) (2020), pp. 124–140.
  18. Multi-target domain adaptation with collaborative consistency learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2021), pp. 8187–8196.
  19. Feature stylization and domain-aware contrastive learning for domain generalization. In ACM International Conference on Multimedia (MM) (2021), pp. 22–31.
  20. Style normalization and restitution for domain generalization and adaptation. IEEE Transactions on Multimedia (TMM) 24 (2022), 3636–3651.
  21. Temporal ensembling for semi-supervised learning. In International Conference on Learning Representations (ICLR) (2017).
  22. Learning to generalize: Meta-learning for domain generalization. In AAAI Conference on Artificial Intelligence (AAAI) (2018), pp. 3490–3497.
  23. Deeper, broader and artier domain generalization. In International Conference on Computer Vision (ICCV) (2017), pp. 5543–5551.
  24. Episodic training for domain generalization. In International Conference on Computer Vision (ICCV) (2019), pp. 1446–1455.
  25. Domain generalization with adversarial feature learning. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2018), pp. 5400–5409.
  26. Comatch: Semi-supervised learning with contrastive graph regularization. In International Conference on Computer Vision (ICCV) (2021), pp. 9475–9484.
  27. Deep domain generalization via conditional invariant adversarial networks. In European Conference on Computer Vision (ECCV) (2018), pp. 647–663.
  28. Category-stitch learning for union domain generalization. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) (2022).
  29. Domain generalization via encoding and resampling in a unified latent space. IEEE Transactions on Multimedia (TMM) (2021).
  30. A strong baseline and batch normalization neck for deep person re-identification. IEEE Transactions on Multimedia (TMM) 22, 10 (2020), 2597–2609.
  31. Domain generalization via invariant feature representation. In International Conference on Machine Learning (ICML) (2013), pp. 10–18.
  32. Reducing domain gap by reducing style bias. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2021), pp. 8690–8699.
  33. All labels are not created equal: Enhancing semi-supervision via label grouping and co-training. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2021), pp. 7241–7250.
  34. Daso: Distribution-aware semantics-oriented pseudo-label for imbalanced semi-supervised learning. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 9786–9796.
  35. Moment matching for multi-source domain adaptation. In International Conference on Computer Vision (ICCV) (2019), pp. 1406–1415.
  36. Greyreid: A novel two-stream deep framework with rgb-grey information for person re-identification. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 17, 1 (2021), 27:1–27:22.
  37. A novel mix-normalization method for generalizable multi-source person re-identification. IEEE Transactions on Multimedia (TMM) (2022).
  38. Multi-component image translation for deep domain generalization. In IEEE Winter Conference on Applications of Computer Vision (WACV) (2019), pp. 579–588.
  39. Correlation-aware adversarial domain adaptation and generalization. Pattern Recognition (PR) 100 (2020), 107124.
  40. Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 39, 6 (2017), 1137–1149.
  41. Learning to optimize domain specific normalization for domain generalization. In European Conference on Computer Vision (ECCV) (2020), pp. 68–83.
  42. Generalizing across domains via cross-gradient training. In International Conference on Learning Representations (ICLR) (2018).
  43. Fixmatch: Simplifying semi-supervised learning with consistency and confidence. In Advances in Neural Information Processing Systems (NeurIPS) (2020).
  44. Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results. In Advances in Neural Information Processing Systems (NeurIPS) (2017), pp. 1195–1204.
  45. Visualizing data using t-sne. Journal of machine learning research (JMLR) 9, 11 (2008), 2579–2605.
  46. Deep hashing network for unsupervised domain adaptation. In CVPR (2017), pp. 5018–5027.
  47. Generalizing to unseen domains: A survey on domain generalization. In International Joint Conference on Artificial Intelligence (IJCAI) (2021), pp. 4627–4635.
  48. Debiased learning from naturally imbalanced pseudo-labels. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 14647–14657.
  49. Pointwise manifold regularization for semi-supervised learning. Frontiers of Computer Science (FCS) 15 (2021), 1–8.
  50. Variational disentanglement for domain generalization. arXiv preprint arXiv:2109.05826 (2021).
  51. Feature-based style randomization for domain generalization. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT) 32, 8 (2022), 5495–5509.
  52. Domain-specific feature elimination: multi-source domain adaptation for image classification. Frontiers of Computer Science (FCS) 17, 4 (2023), 174705.
  53. A fourier-based framework for domain generalization. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2021), pp. 14383–14392.
  54. Towards corruption-agnostic robust domain adaptation. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 18, 4 (2022), 99:1–99:16.
  55. Class-aware contrastive semi-supervised learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (June 2022), pp. 14421–14430.
  56. Domain randomization and pyramid consistency: Simulation-to-real generalization without accessing target domain data. In International Conference on Computer Vision (ICCV) (2019), pp. 2100–2110.
  57. Flexmatch: Boosting semi-supervised learning with curriculum pseudo labeling. In Advances in Neural Information Processing Systems (NeurIPS) (2021), pp. 18408–18419.
  58. Generalizable model-agnostic semantic segmentation via target-specific normalization. Pattern Recognition (PR) 122 (2022), 108292.
  59. Balanced and accurate pseudo-labels for semi-supervised image classification. ACM Transactions on Multimidia Computing Communications and Applications (TOMM) (2022).
  60. Domain generalization via entropy regularization. In Advances in Neural Information Processing Systems (NeurIPS) (2020).
  61. Dc-ssl: Addressing mismatched class distribution in semi-supervised learning. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 9757–9765.
  62. Simmatch: Semi-supervised learning with similarity matching. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 14471–14481.
  63. Domain generalization: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) (2022).
  64. Semi-supervised domain generalization with stochastic stylematch. arXiv preprint arXiv:2106.00592 (2021).
  65. Deep domain-adversarial image generation for domain generalisation. In AAAI Conference on Artificial Intelligence (AAAI) (2020), pp. 13025–13032.
  66. Domain adaptive ensemble learning. IEEE Transactions Image Process (TIP) 30 (2021), 8008–8018.
  67. Generalizable cross-modality medical image segmentation via style augmentation and dual normalization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (June 2022), pp. 20856–20865.
  68. Unpaired image-to-image translation using cycle-consistent adversarial networks. In International Conference on Computer Vision (ICCV) (2017), pp. 2242–2251.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Lei Qi (84 papers)
  2. Hongpeng Yang (3 papers)
  3. Yinghuan Shi (79 papers)
  4. Xin Geng (90 papers)
Citations (8)

Summary

We haven't generated a summary for this paper yet.