Decompose-and-Compose: A Compositional Approach to Mitigating Spurious Correlation (2402.18919v3)
Abstract: While standard Empirical Risk Minimization (ERM) training is proven effective for image classification on in-distribution data, it fails to perform well on out-of-distribution samples. One of the main sources of distribution shift for image classification is the compositional nature of images. Specifically, in addition to the main object or component(s) determining the label, some other image components usually exist, which may lead to the shift of input distribution between train and test environments. More importantly, these components may have spurious correlations with the label. To address this issue, we propose Decompose-and-Compose (DaC), which improves robustness to correlation shift by a compositional approach based on combining elements of images. Based on our observations, models trained with ERM usually highly attend to either the causal components or the components having a high spurious correlation with the label (especially in datapoints on which models have a high confidence). In fact, according to the amount of spurious correlation and the easiness of classification based on the causal or non-causal components, the model usually attends to one of these more (on samples with high confidence). Following this, we first try to identify the causal components of images using class activation maps of models trained with ERM. Afterward, we intervene on images by combining them and retraining the model on the augmented data, including the counterfactual ones. Along with its high interpretability, this work proposes a group-balancing method by intervening on images without requiring group labels or information regarding the spurious features during training. The method has an overall better worst group accuracy compared to previous methods with the same amount of supervision on the group labels in correlation shift.
- Systematic generalisation with group invariant predictions. In International Conference on Learning Representations, 2021.
- Invariant risk minimization. ArXiv, abs/1907.02893, 2020.
- Masktune: Mitigating spurious correlations by forcing to explore. In Advances in Neural Information Processing Systems, 2022.
- Recognition in terra incognita. In Computer Vision – ECCV 2018: 15th European Conference, Munich, Germany, September 8-14, 2018, Proceedings, Part XVI, page 472–489, Berlin, Heidelberg, 2018. Springer-Verlag.
- Simple data balancing achieves competitive worst-group-accuracy. In Proceedings of the First Conference on Causal Learning and Reasoning, pages 336–351. PMLR, 2022.
- Biaswap: Removing dataset bias with bias-tailored swapping augmentation. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 14992–15001, 2021.
- Last layer re-training is sufficient for robustness to spurious correlations. ArXiv, abs/2204.02937, 2022.
- Out-of-distribution generalization via risk extrapolation (rex). In Proceedings of the 38th International Conference on Machine Learning, pages 5815–5826. PMLR, 2021.
- Dropout disagreement: A recipe for group robustness with fewer annotations. In NeurIPS 2022 Workshop on Distribution Shifts: Connecting Methods and Applications, 2022.
- Tell me where to look: Guided attention inference network. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9215–9223, 2018.
- Just train twice: Improving group robustness without training group information. In Proceedings of the 38th International Conference on Machine Learning, pages 6781–6792. PMLR, 2021.
- Decoupled mixup for generalized visual recognition, 2022.
- Deep learning face attributes in the wild. In 2015 IEEE International Conference on Computer Vision (ICCV), pages 3730–3738, 2015.
- Shortcut learning through the lens of early training dynamics, 2023.
- Learning from failure: Training debiased classifier from biased classifier. In Proceedings of the 34th International Conference on Neural Information Processing Systems, Red Hook, NY, USA, 2020. Curran Associates Inc.
- Agree to disagree: Diversity through disagreement for better transferability. In The Eleventh International Conference on Learning Representations, 2023.
- Judea Pearl. Causality. Cambridge University Press, Cambridge, UK, 2 edition, 2009.
- Simple and fast group robustness by automatic feature reweighting. ICML 2023.
- From fake to real: Pretraining on balanced synthetic images to prevent bias. ArXiv, abs/2308.04553, 2023.
- Fair attribute classification through latent space de-biasing. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021, virtual, June 19-25, 2021, pages 9301–9310. Computer Vision Foundation / IEEE, 2021.
- Fishr: Invariant gradient variances for out-of-distribution generalization. In Proceedings of the 39th International Conference on Machine Learning, pages 18347–18377. PMLR, 2022.
- Distributionally robust neural networks. In International Conference on Learning Representations, 2020.
- An investigation of why overparameterization exacerbates spurious correlations. In Proceedings of the 37th International Conference on Machine Learning, pages 8346–8356. PMLR, 2020.
- Finding a ”kneedle” in a haystack: Detecting knee points in system behavior. 2011 31st International Conference on Distributed Computing Systems Workshops, pages 166–171, 2011.
- Grad-cam: Visual explanations from deep networks via gradient-based localization. In 2017 IEEE International Conference on Computer Vision (ICCV), pages 618–626, 2017.
- The pitfalls of simplicity bias in neural networks. Advances in Neural Information Processing Systems, 33, 2020.
- Unbiased look at dataset bias. In CVPR 2011, pages 1521–1528, 2011.
- A closer look at model adaptation using feature distortion and simplicity bias. In The Eleventh International Conference on Learning Representations, 2023.
- The caltech-ucsd birds-200-2011 dataset. Technical Report CNS-TR-2011-001, California Institute of Technology, 2011.
- Score-cam: Score-weighted visual explanations for convolutional neural networks. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pages 111–119, Los Alamitos, CA, USA, 2020a. IEEE Computer Society.
- Causal attention for unbiased visual recognition. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 3091–3100, 2021.
- Deep generative model for robust imbalance classification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020b.
- Discover and cure: Concept-aware mitigation of spurious correlation. arXiv preprint arXiv:2305.00650, 2023.
- Masked images are counterfactual samples for robust fine-tuning. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 20301–20310, 2023.
- Adversarial domain adaptation with domain mixup. Proceedings of the AAAI Conference on Artificial Intelligence, 34:6502–6509, 2020.
- Chroma-vae: Mitigating shortcut learning with generative classifiers. In Advances in Neural Information Processing Systems, pages 20351–20365. Curran Associates, Inc., 2022.
- Improving out-of-distribution robustness via selective augmentation. In International Conference on Machine Learning, ICML 2022, 17-23 July 2022, Baltimore, Maryland, USA, pages 25407–25437. PMLR, 2022.
- Ood-bench: Quantifying and understanding two dimensions of out-of-distribution generalization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 7947–7958, 2022.
- mixup: Beyond empirical risk minimization. In International Conference on Learning Representations, 2018.
- Correct-n-contrast: a contrastive approach for improving robustness to spurious correlations. In Proceedings of the 39th International Conference on Machine Learning, pages 26484–26516. PMLR, 2022.
- Learning multi-attention convolutional neural network for fine-grained image recognition. In 2017 IEEE International Conference on Computer Vision (ICCV), pages 5219–5227, 2017.
- Places: A 10 million image database for scene recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017.
- Fahimeh Hosseini Noohdani (3 papers)
- Parsa Hosseini (4 papers)
- Mahdieh Soleymani Baghshah (50 papers)
- Aryan Yazdan Parast (2 papers)
- HamidReza Yaghoubi Araghi (3 papers)