Reducing Texture Bias of Deep Neural Networks via Edge Enhancing Diffusion (2402.09530v2)
Abstract: Convolutional neural networks (CNNs) for image processing tend to focus on localized texture patterns, commonly referred to as texture bias. While most of the previous works in the literature focus on the task of image classification, we go beyond this and study the texture bias of CNNs in semantic segmentation. In this work, we propose to train CNNs on pre-processed images with less texture to reduce the texture bias. Therein, the challenge is to suppress image texture while preserving shape information. To this end, we utilize edge enhancing diffusion (EED), an anisotropic image diffusion method initially introduced for image compression, to create texture reduced duplicates of existing datasets. Extensive numerical studies are performed with both CNNs and vision transformer models trained on original data and EED-processed data from the Cityscapes dataset and the CARLA driving simulator. We observe strong texture-dependence of CNNs and moderate texture-dependence of transformers. Training CNNs on EED-processed images enables the models to become completely ignorant with respect to texture, demonstrating resilience with respect to texture re-introduction to any degree. Additionally we analyze the performance reduction in depth on a level of connected components in the semantic segmentation and study the influence of EED pre-processing on domain generalization as well as adversarial robustness.
- On the robustness of semantic segmentation models to adversarial attacks. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2018.
- The vulnerability of semantic segmentation networks to adversarial attacks in autonomous driving: Enhancing extensive environment sensing. IEEE Signal Processing Magazine, 2021.
- Network dissection: Quantifying interpretability of deep visual representations. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 6541–6549, 2017.
- Deterministic edge-preserving regularization in computed imaging. IEEE Transactions on image processing, 6(2):298–311, 1997.
- Encoder-decoder with atrous separable convolution for semantic image segmentation. In Proceedings of the European conference on computer vision (ECCV), pages 801–818, 2018.
- MMSegmentation Contributors. MMSegmentation: Openmmlab semantic segmentation toolbox and benchmark. https://github.com/open-mmlab/mmsegmentation, 2020.
- The cityscapes dataset. In CVPR Workshop on the Future of Datasets in Vision, volume 2. sn, 2015.
- Rethinking the image feature biases exhibited by deep convolutional neural network models in image recognition. CAAI Transactions on Intelligence Technology, 7(4):721–731, 2022.
- An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929, 2020.
- Carla: An open urban driving simulator. In Conference on robot learning, pages 1–16. PMLR, 2017.
- Image compression with anisotropic diffusion. Journal of Mathematical Imaging and Vision, 31:255–269, 2008.
- Robust contrastive learning using negative samples with diminished semantics. Advances in Neural Information Processing Systems, 34:27356–27368, 2021.
- Imagenet-trained cnns are biased towards texture; increasing shape bias improves accuracy and robustness. arXiv preprint arXiv:1811.12231, 2018.
- Explaining and harnessing adversarial examples. In Yoshua Bengio and Yann LeCun, editors, International Conference on Learning Representations (ICLR), 2015.
- Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016.
- Shift from texture-bias to shape-bias: Edge deformation-based augmentation for robust object recognition. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 1526–1535, 2023.
- Qualitative similarities and differences in visual object representations between brains and deep networks. Nature communications, 12(1):1872, 2021.
- Towards domain adversarial methods to mitigate texture bias. In ICML 2022: Workshop on Spurious Correlations, Invariance and Stability, 2022.
- Supervised contrastive learning. arXiv preprint arXiv:2004.11362, 2020.
- Learning texture invariant representation for domain adaptation of semantic segmentation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 12975–12984, 2020.
- Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems, 25, 2012.
- Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278–2324, 1998.
- Improving robustness to texture bias via shape-focused augmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4323–4331, 2022.
- Uncertainty-based detection of adversarial attacks in semantic segmentation. ArXiv, 2023.
- Uncertainty-weighted loss functions for improved adversarial attacks on semantic segmentation. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pages 3906–3914, January 2024.
- Learning visual representations for transfer learning by suppressing texture. arXiv preprint arXiv:2011.01901, 2020.
- Scale-space and edge detection using anisotropic diffusion. IEEE Transactions on pattern analysis and machine intelligence, 12(7):629–639, 1990.
- Anisotropic diffusion. Geometry-driven diffusion in computer vision, pages 73–92, 1994.
- Image segmentation and edge enhancement with stabilized inverse diffusion equations. IEEE transactions on image processing, 9(2):256–266, 2000.
- Proximal splitting adversarial attacks for semantic segmentation, 2022.
- Prediction error meta classification in semantic segmentation: Detection via aggregated dispersion measures of softmax probabilities. In 2020 International Joint Conference on Neural Networks (IJCNN), pages 1–9. IEEE, 2020.
- Are convolutional neural networks or transformers more like human vision? arXiv preprint arXiv:2105.07197, 2021.
- Attention is all you need. Advances in neural information processing systems, 30, 2017.
- Joachim Weickert. Theoretical foundations of anisotropic diffusion in image processing. Springer, 1996.
- Joachim Weickert. A review of nonlinear diffusion filtering. In International conference on scale-space theories in computer vision, pages 1–28. Springer, 1997.
- Joachim Weickert. Anisotropic diffusion in image processing, volume 1. Teubner Stuttgart, 1998.
- Joachim Weickert. Coherence-enhancing diffusion filtering. International journal of computer vision, 31:111–127, 1999.
- Tensor field interpolation with pdes. In Visualization and processing of tensor fields, pages 315–325. Springer, 2006.
- L 2-stable nonstandard finite differences for anisotropic diffusion. In Scale Space and Variational Methods in Computer Vision: 4th International Conference, SSVM 2013, Schloss Seggau, Leibnitz, Austria, June 2-6, 2013. Proceedings 4, pages 380–391. Springer, 2013.
- Characterizing adversarial examples based on spatial consistency information for semantic segmentation. In European Conference on Computer Vision (ECCV), 2018.
- Segformer: Simple and efficient design for semantic segmentation with transformers. Advances in Neural Information Processing Systems, 34:12077–12090, 2021.
- Dynamic divide-and-conquer adversarial training for robust semantic segmentation. In IEEE International Conference on Computer Vision (ICCV), 2021.
- Visualizing and understanding convolutional networks. In Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part I 13, pages 818–833. Springer, 2014.
- Delving deep into the generalization of vision transformers under distribution shifts. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 7267–7276, 2021.