NormAUG: Normalization-guided Augmentation for Domain Generalization (2307.13492v2)
Abstract: Deep learning has made significant advancements in supervised learning. However, models trained in this setting often face challenges due to domain shift between training and test sets, resulting in a significant drop in performance during testing. To address this issue, several domain generalization methods have been developed to learn robust and domain-invariant features from multiple training domains that can generalize well to unseen test domains. Data augmentation plays a crucial role in achieving this goal by enhancing the diversity of the training data. In this paper, inspired by the observation that normalizing an image with different statistics generated by different batches with various domains can perturb its feature, we propose a simple yet effective method called NormAUG (Normalization-guided Augmentation). Our method includes two paths: the main path and the auxiliary (augmented) path. During training, the auxiliary path includes multiple sub-paths, each corresponding to batch normalization for a single domain or a random combination of multiple domains. This introduces diverse information at the feature level and improves the generalization of the main path. Moreover, our NormAUG method effectively reduces the existing upper boundary for generalization based on theoretical perspectives. During the test stage, we leverage an ensemble strategy to combine the predictions from the auxiliary path of our model, further boosting performance. Extensive experiments are conducted on multiple benchmark datasets to validate the effectiveness of our proposed method.
- K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 770–778.
- R. Aversa, P. Coronica, C. De Nobili, and S. Cozzini, “Deep learning, feature learning, and clustering analysis for sem image classification,” Data Intelligence (DI), vol. 2, no. 4, pp. 513–528, 2020.
- X. Wang, T. E. Huang, B. Liu, F. Yu, X. Wang, J. E. Gonzalez, and T. Darrell, “Robust object detection via instance-level temporal cycle confusion,” in IEEE International Conference on Computer Vision (ICCV), 2021, pp. 9123–9132.
- O. Ronneberger, P. Fischer, and T. Brox, “U-net: Convolutional networks for biomedical image segmentation,” in International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2015, pp. 234–241.
- S. Lingwal, K. K. Bhatia, and M. Singh, “Semantic segmentation of landcover for cropland mapping and area estimation using machine learning techniques,” Data Intelligence (DI), vol. 5, no. 2, pp. 370–387, 2023.
- P. Li, D. Li, W. Li, S. Gong, Y. Fu, and T. M. Hospedales, “A simple feature augmentation for domain generalization,” in International Conference on Computer Vision (ICCV), 2021, pp. 8886–8895.
- C. Li, D. Zhang, W. Huang, and J. Zhang, “Cross contrasting feature perturbation for domain generalization,” in International Conference on Computer Vision (ICCV), 2023, pp. 1327–1337.
- Y. Wang, L. Qi, Y. Shi, and Y. Gao, “Feature-based style randomization for domain generalization,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. 32, no. 8, pp. 5495–5509, 2022.
- J. Zhang, L. Qi, Y. Shi, and Y. Gao, “MVDG: A unified multi-view framework for domain generalization,” in European Conference on Computer Vision (ECCV), 2022, pp. 161–177.
- K. Zhou, Y. Yang, Y. Qiao, and T. Xiang, “Domain adaptive ensemble learning,” IEEE Transactions on Image Processing (TIP), vol. 30, pp. 8008–8018, 2021.
- Y. Ding, L. Wang, B. Liang, S. Liang, Y. Wang, and F. Chen, “Domain generalization by learning and removing domain-specific features,” in Advances in Neural Information Processing Systems (NeurIPS), 2022.
- K. Zhou, Y. Yang, Y. Qiao, and T. Xiang, “Domain generalization with mixstyle,” in International Conference on Learning Representations (ICLR), 2021.
- Z. Ding and Y. Fu, “Deep domain generalization with structured low-rank constraint,” IEEE Transactions on Image Processing (TIP), vol. 27, no. 1, pp. 304–313, 2018.
- Q. Xu, R. Zhang, Y. Zhang, Y. Wang, and Q. Tian, “A fourier-based framework for domain generalization,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 14 383–14 392.
- L. Van der Maaten and G. Hinton, “Visualizing data using t-sne.” Journal of Machine Learning Research (JMLR), vol. 9, no. 11, 2008.
- J. Deng, W. Dong, R. Socher, L. Li, K. Li, and L. Fei-Fei, “Imagenet: A large-scale hierarchical image database,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2009, pp. 248–255.
- Y. Zhang, M. Li, R. Li, K. Jia, and L. Zhang, “Exact feature distribution matching for arbitrary style transfer and domain generalization,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 8025–8035.
- Z. Huang, H. Wang, E. P. Xing, and D. Huang, “Self-challenging improves cross-domain generalization,” in European Conference on Computer Vision (ECCV), 2020, pp. 124–140.
- Z. Wang, Y. Luo, R. Qiu, Z. Huang, and M. Baktashmotlagh, “Learning to diversify for single domain generalization,” in IEEE International Conference on Computer Vision (ICCV), 2021, pp. 814–823.
- X. Huang and S. J. Belongie, “Arbitrary style transfer in real-time with adaptive instance normalization,” in IEEE International Conference on Computer Vision (ICCV), 2017, pp. 1510–1519.
- J. Kang, S. Lee, N. Kim, and S. Kwak, “Style neophile: Constantly seeking novel styles for domain generalization,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 7120–7130.
- L. Niu, W. Li, and D. Xu, “Visual recognition by learning from web data: A weakly supervised domain generalization approach,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, pp. 2774–2783.
- S. Seo, Y. Suh, D. Kim, G. Kim, J. Han, and B. Han, “Learning to optimize domain specific normalization for domain generalization,” in European Conference on Computer Vision (ECCV), 2020, pp. 68–83.
- M. Segù, A. Tonioni, and F. Tombari, “Batch normalization embeddings for deep domain generalization,” Pattern Recognition (PR), vol. 135, p. 109115, 2023.
- Y. Li, M. Gong, X. Tian, T. Liu, and D. Tao, “Domain generalization via conditional invariant representations,” in AAAI Conference on Artificial Intelligence (AAAI), 2018, pp. 3579–3587.
- T. Matsuura and T. Harada, “Domain generalization using a mixture of multiple latent domains,” in AAAI Conference on Artificial Intelligence (AAAI), 2020, pp. 11 749–11 756.
- P. Chattopadhyay, Y. Balaji, and J. Hoffman, “Learning to balance specificity and invariance for in and out of domain generalization,” in European Conference on Computer Vision (ECCV), 2020, pp. 301–318.
- R. Meng, X. Li, W. Chen, S. Yang, J. Song, X. Wang, L. Zhang, M. Song, D. Xie, and S. Pu, “Attention diversification for domain generalization,” in European Conference on Computer Vision (ECCV), 2022, pp. 322–340.
- K. Lee, S. Kim, and S. Kwak, “Cross-domain ensemble distillation for domain generalization,” in European Conference on Computer Vision (ECCV), 2022, pp. 1–20.
- C. Finn, P. Abbeel, and S. Levine, “Model-agnostic meta-learning for fast adaptation of deep networks,” in International Conference on Machine Learning (ICML), pp. 1126–1135.
- Y. Balaji, S. Sankaranarayanan, and R. Chellappa, “Metareg: Towards domain generalization using meta-regularization,” in Advances in Neural Information Processing Systems (NeurIPS), 2018, pp. 1006–1016.
- D. Li, Y. Yang, Y. Song, and T. M. Hospedales, “Learning to generalize: Meta-learning for domain generalization,” in AAAI Conference on Artificial Intelligence (AAAI), 2018, pp. 3490–3497.
- Q. Dou, D. C. de Castro, K. Kamnitsas, and B. Glocker, “Domain generalization via model-agnostic learning of semantic features,” in Advances in Neural Information Processing Systems (NeurIPS), 2019, pp. 6447–6458.
- F. Lv, J. Liang, S. Li, J. Zhang, and D. Liu, “Improving generalization with domain convex game,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023, pp. 24 315–24 324.
- S. Ioffe and C. Szegedy, “Batch normalization: Accelerating deep network training by reducing internal covariate shift,” in International Conference on Machine Learning (ICML), 2015, pp. 448–456.
- I. Albuquerque, J. Monteiro, M. Darvishi, T. H. Falk, and I. Mitliagkas, “Generalizing to unseen domains via distribution matching,” arXiv preprint arXiv:1911.00804, 2019.
- J. Wang, C. Lan, C. Liu, Y. Ouyang, and T. Qin, “Generalizing to unseen domains: A survey on domain generalization,” in International Joint Conference on Artificial Intelligence (IJCAI), 2021, pp. 4627–4635.
- D. Li, Y. Yang, Y.-Z. Song, and T. M. Hospedales, “Deeper, broader and artier domain generalization,” in IEEE International Conference on Computer Vision (ICCV), 2017, pp. 5543–5551.
- H. Venkateswara, J. Eusebio, S. Chakraborty, and S. Panchanathan, “Deep hashing network for unsupervised domain adaptation,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017, pp. 5018–5027.
- X. Peng, Q. Bai, X. Xia, Z. Huang, K. Saenko, and B. Wang, “Moment matching for multi-source domain adaptation,” in IEEE International Conference on Computer Vision (ICCV), 2019, pp. 1406–1415.
- K. Zhou, Y. Yang, T. Hospedales, and T. Xiang, “Deep domain-adversarial image generation for domain generalisation,” in AAAI Conference on Artificial Intelligence (AAAI), vol. 34, no. 07, 2020, pp. 13 025–13 032.
- Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, “Gradient-based learning applied to document recognition,” Proceedings of the IEEE, vol. 86, no. 11, pp. 2278–2324, 1998.
- Y. Ganin and V. Lempitsky, “Unsupervised domain adaptation by backpropagation,” in International conference on machine learning (ICML), 2015, pp. 1180–1189.
- Y. Netzer, T. Wang, A. Coates, A. Bissacco, B. Wu, and A. Y. Ng, “Reading digits in natural images with unsupervised feature learning,” 2011.
- F. M. Carlucci, A. D’Innocente, S. Bucci, B. Caputo, and T. Tommasi, “Domain generalization by solving jigsaw puzzles,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 2229–2238.
- C. Chen, J. Li, X. Han, X. Liu, and Y. Yu, “Compound domain generalization via meta-knowledge encoding,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 7109–7119.
- F. Lv, J. Liang, S. Li, B. Zang, C. H. Liu, Z. Wang, and D. Liu, “Causality inspired representation learning for domain generalization,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 8036–8046.
- A. Ramé, C. Dancette, and M. Cord, “Fishr: Invariant gradient variances for out-of-distribution generalization,” in International Conference on Machine Learning (ICML), 2022, pp. 18 347–18 377.
- M. Bui, T. Tran, A. Tran, and D. Q. Phung, “Exploiting domain-specific features to enhance domain generalization,” in Advances in Neural Information Processing Systems (NeurIPS), 2021, pp. 21 189–21 201.
- H. Xia, T. Jing, and Z. Ding, “Generative inference network for imbalanced domain generalization,” IEEE Transactions on Image Processing (TIP), vol. 32, pp. 1694–1704, 2023.
- Y. Wang, F. Liu, Z. Chen, Y. Wu, J. Hao, G. Chen, and P. Heng, “Contrastive-ace: Domain generalization through alignment of causal mechanisms,” IEEE Transactions on Image Processing (TIP), vol. 32, pp. 235–250, 2023.
- J. Cha, S. Chun, K. Lee, H. Cho, S. Park, Y. Lee, and S. Park, “SWAD: domain generalization by seeking flat minima,” in Advances in Neural Information Processing Systems (NeurIPS), 2021, pp. 22 405–22 418.
- K. Zhou, Y. Yang, T. M. Hospedales, and T. Xiang, “Learning to generate novel domains for domain generalization,” in European Conference on Computer Vision (ECCV), 2020, pp. 561–578.
- X. Li, Y. Dai, Y. Ge, J. Liu, Y. Shan, and L. Duan, “Uncertainty modeling for out-of-distribution generalization,” in International Conference on Learning Representations (ICLR), 2022.
- C. Lee, T. Batra, M. H. Baig, and D. Ulbricht, “Sliced wasserstein discrepancy for unsupervised domain adaptation,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 10 285–10 295.
- H. Zhang, M. Cissé, Y. N. Dauphin, and D. Lopez-Paz, “mixup: Beyond empirical risk minimization,” in International Conference on Learning Representations (ICLR), 2018.
- H. Nam, H. Lee, J. Park, W. Yoon, and D. Yoo, “Reducing domain gap by reducing style bias,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 8690–8699.
- G. Blanchard, A. A. Deshmukh, Ü. Dogan, G. Lee, and C. Scott, “Domain generalization by marginal transfer learning,” Journal of Machine Learning Research (JMLR), vol. 22, pp. 2:1–2:55, 2021.
- Y. Chen, Y. Wang, Y. Pan, T. Yao, X. Tian, and T. Mei, “A style and semantic memory mechanism for domain generalization,” in IEEE International Conference on Computer Vision (ICCV), 2021, pp. 9144–9153.
- L. Chen, Y. Zhang, Y. Song, Y. Shan, and L. Liu, “Improved test-time adaptation for domain generalization,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023, pp. 24 172–24 182.
- D. Kim, Y. Yoo, S. Park, J. Kim, and J. Lee, “Selfreg: Self-supervised contrastive regularization for domain generalization,” in IEEE International Conference on Computer Vision (ICCV), 2021, pp. 9599–9608.
- S. Min, N. Park, S. Kim, S. Park, and J. Kim, “Grounding visual representations with texts for domain generalization,” in European Conference on Computer Vision (ECCV), 2022, pp. 37–53.
- P. Wang, Z. Zhang, Z. Lei, and L. Zhang, “Sharpness-aware gradient matching for domain generalization,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023, pp. 3769–3778.