2000 character limit reached
Tackling Shortcut Learning in Deep Neural Networks: An Iterative Approach with Interpretable Models (2302.10289v9)
Published 20 Feb 2023 in cs.LG and cs.CV
Abstract: We use concept-based interpretable models to mitigate shortcut learning. Existing methods lack interpretability. Beginning with a Blackbox, we iteratively carve out a mixture of interpretable experts (MoIE) and a residual network. Each expert explains a subset of data using First Order Logic (FOL). While explaining a sample, the FOL from biased BB-derived MoIE detects the shortcut effectively. Finetuning the BB with Metadata Normalization (MDN) eliminates the shortcut. The FOLs from the finetuned-BB-derived MoIE verify the elimination of the shortcut. Our experiments show that MoIE does not hurt the accuracy of the original BB and eliminates shortcuts effectively.
- Invariance principle meets information bottleneck for out-of-distribution generalization, 2022.
- Invariant risk minimization, 2020.
- Entropy-based logic explanations of neural networks. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, pp. 6046–6054, 2022.
- Belle, V. Symbolic logic meets machine learning: A brief survey in infinite domains. In International Conference on Scalable Uncertainty Management, pp. 3–16. Springer, 2020.
- Neural-symbolic learning and reasoning: A survey and interpretation. arXiv preprint arXiv:1711.03902, 2017.
- Debiasing skin lesion datasets and models? not so fast. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp. 740–741, 2020.
- Logic explained networks. Artificial Intelligence, 314:103822, 2023.
- Disparities in dermatology ai: Assessments using diverse clinical images. arXiv preprint arXiv:2111.08006, 2021.
- Neural-symbolic learning and reasoning: contributions and challenges. In 2015 AAAI Spring Symposium Series, 2015.
- Selectivenet: A deep neural network with an integrated reject option. In International conference on machine learning, pp. 2151–2159. PMLR, 2019.
- Shortcut learning in deep neural networks. Nature Machine Intelligence, 2(11):665–673, 2020.
- Dividing and conquering a BlackBox to a mixture of interpretable models: Route, interpret, repeat. In Krause, A., Brunskill, E., Cho, K., Engelhardt, B., Sabato, S., and Scarlett, J. (eds.), Proceedings of the 40th International Conference on Machine Learning, volume 202 of Proceedings of Machine Learning Research, pp. 11360–11397. PMLR, 23–29 Jul 2023. URL https://proceedings.mlr.press/v202/ghosh23c.html.
- Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770–778, 2016.
- Densely connected convolutional networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4700–4708, 2017.
- Radgraph: Extracting clinical entities and relations from radiology reports. arXiv preprint arXiv:2106.14463, 2021.
- Mimic-cxr-jpg-chest radiographs with structured labels.
- Learning the difference that makes a difference with counterfactually-augmented data. arXiv preprint arXiv:1909.12434, 2019.
- Seven-point checklist and skin lesion classification using multitask multimodal neural nets. IEEE journal of biomedical and health informatics, 23(2):538–546, 2018.
- Interpretability beyond feature attribution: Quantitative testing with concept activation vectors (tcav).(2017). arXiv preprint arXiv:1711.11279, 2017.
- Concept bottleneck models. In International Conference on Machine Learning, pp. 5338–5348. PMLR, 2020.
- Out-of-distribution generalization via risk extrapolation (rex), 2021.
- Just train twice: Improving group robustness without training group information, 2021.
- Metadata normalization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10917–10927, 2021.
- On interpretability of deep learning based skin lesion classifiers using concept activation vectors. In 2020 international joint conference on neural networks (IJCNN), pp. 1–10. IEEE, 2020.
- Selective classification via neural network training dynamics. arXiv preprint arXiv:2205.13532, 2022.
- ” why should i trust you?” explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp. 1135–1144, 2016.
- Patch shortcuts: Interpretable proxy models efficiently find black-box vulnerabilities. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 56–65, 2021.
- A patient-centric dataset of images and metadata for identifying melanomas using clinical context. Scientific data, 8(1):1–8, 2021.
- Distributionally robust neural networks for group shifts: On the importance of regularization for worst-case generalization. arXiv preprint arXiv:1911.08731, 2019.
- Distributionally robust neural networks for group shifts: On the importance of regularization for worst-case generalization, 2020.
- Gradient matching for domain generalization, 2021.
- Deep coral: Correlation alignment for deep domain adaptation, 2016.
- Going deeper with convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1–9, 2015.
- The ham10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions. Scientific data, 5(1):1–9, 2018.
- Entity, relation, and event extraction with contextualized span representations. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 5784–5789, Hong Kong, China, November 2019. Association for Computational Linguistics. doi: 10.18653/v1/D19-1585. URL https://aclanthology.org/D19-1585.
- The caltech-ucsd birds-200-2011 dataset. 2011.
- Feature fusion vision transformer for fine-grained visual categorization. arXiv preprint arXiv:2107.02341, 2021.
- Zero-shot learning—a comprehensive evaluation of the good, the bad and the ugly. IEEE transactions on pattern analysis and machine intelligence, 41(9):2251–2265, 2018.
- Adversarial domain adaptation with domain mixup. Proceedings of the AAAI Conference on Artificial Intelligence, 34(04):6502–6509, Apr. 2020. doi: 10.1609/aaai.v34i04.6123. URL https://ojs.aaai.org/index.php/AAAI/article/view/6123.
- Improving out-of-distribution robustness via selective augmentation. In Chaudhuri, K., Jegelka, S., Song, L., Szepesvari, C., Niu, G., and Sabato, S. (eds.), Proceedings of the 39th International Conference on Machine Learning, volume 162 of Proceedings of Machine Learning Research, pp. 25407–25437. PMLR, 17–23 Jul 2022. URL https://proceedings.mlr.press/v162/yao22b.html.
- Anatomy-guided weakly-supervised abnormality localization in chest x-rays. arXiv preprint arXiv:2206.12704, 2022.
- Post-hoc concept bottleneck models. arXiv preprint arXiv:2205.15480, 2022.
- Concept embedding models. arXiv preprint arXiv:2209.09056, 2022.