Invariant Representation via Decoupling Style and Spurious Features from Images (2312.06226v2)
Abstract: This paper considers the out-of-distribution (OOD) generalization problem under the setting that both style distribution shift and spurious features exist and domain labels are missing. This setting frequently arises in real-world applications and is underlooked because previous approaches mainly handle either of these two factors. The critical challenge is decoupling style and spurious features in the absence of domain labels. To address this challenge, we first propose a structural causal model (SCM) for the image generation process, which captures both style distribution shift and spurious features. The proposed SCM enables us to design a new framework called IRSS, which can gradually separate style distribution and spurious features from images by introducing adversarial neural networks and multi-environment optimization, thus achieving OOD generalization. Moreover, it does not require additional supervision (e.g., domain labels) other than the images and their corresponding labels. Experiments on benchmark datasets demonstrate that IRSS outperforms traditional OOD methods and solves the problem of Invariant risk minimization (IRM) degradation, enabling the extraction of invariant features under distribution shift.
- Invariant risk minimization. arXiv preprint arXiv:1907.02893.
- Decaug: Out-of-distribution generalization via decomposed feature representation and semantic augmentation. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 35, 6705–6713.
- Domain generalization by solving jigsaw puzzles. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2229–2238.
- Invariant rationalization. In International Conference on Machine Learning, 1448–1458. PMLR.
- Grad-cam++: Generalized gradient-based visual explanations for deep convolutional networks. In 2018 IEEE winter conference on applications of computer vision (WACV), 839–847. IEEE.
- Pareto invariant risk minimization. arXiv preprint arXiv:2206.07766.
- Domain generalization with domain-specific aggregation modules. In Pattern Recognition: 40th German Conference, GCPR 2018, Stuttgart, Germany, October 9-12, 2018, Proceedings 40, 187–198. Springer.
- Adversarially adaptive normalization for single domain generalization. In Proceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition, 8208–8217.
- Unbiased metric learning: On the utilization of multiple datasets and web images for softening bias. In Proceedings of the IEEE International Conference on Computer Vision, 1657–1664.
- Unsupervised domain adaptation by backpropagation. In International conference on machine learning, 1180–1189. PMLR.
- Domain-adversarial training of neural networks. The journal of machine learning research, 17(1): 2096–2030.
- In search of lost domain generalization. arXiv preprint arXiv:2007.01434.
- Towards non-iid image classification: A dataset and baselines. Pattern Recognition, 110: 107383.
- Segment anything. arXiv preprint arXiv:2304.02643.
- Out-of-distribution generalization via risk extrapolation (rex). In International Conference on Machine Learning, 5815–5826. PMLR.
- Invariant information bottleneck for domain generalization. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, 7399–7407.
- Deeper, broader and artier domain generalization. In Proceedings of the IEEE international conference on computer vision, 5542–5550.
- Demystifying neural style transfer. arXiv preprint arXiv:1701.01036.
- Bayesian invariant risk minimization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 16021–16030.
- An empirical study of invariant risk minimization on deep models. In ICML 2021 Workshop on Uncertainty and Robustness in Deep Learning, volume 1, 7.
- Learning transferable features with deep adaptation networks. In International conference on machine learning, 97–105. PMLR.
- Towards recognizing unseen categories in unseen domains. In European Conference on Computer Vision, 466–483. Springer.
- Best sources forward: domain generalization through source-specific nets. In 2018 25th IEEE international conference on image processing (ICIP), 1353–1357. IEEE.
- Domain generalization using a mixture of multiple latent domains. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, 11749–11756.
- Causal inference by using invariant prediction: identification and confidence intervals. Journal of the Royal Statistical Society Series B: Statistical Methodology, 78(5): 947–1012.
- The risks of invariant risk minimization. arXiv preprint arXiv:2010.05761.
- Unbiased look at dataset bias. In CVPR 2011, 1521–1528. IEEE.
- Adversarial discriminative domain adaptation. In Proceedings of the IEEE conference on computer vision and pattern recognition, 7167–7176.
- Deep hashing network for unsupervised domain adaptation. In Proceedings of the IEEE conference on computer vision and pattern recognition, 5018–5027.
- Deep Stable Learning for Out-Of-Distribution Generalization. arXiv:2104.07876.
- Deep domain-adversarial image generation for domain generalisation. In Proceedings of the AAAI conference on artificial intelligence, volume 34, 13025–13032.
- Sparse invariant risk minimization. In International Conference on Machine Learning, 27222–27244. PMLR.