Infrastructure Crack Segmentation: Boundary Guidance Method and Benchmark Dataset (2306.09196v1)
Abstract: Cracks provide an essential indicator of infrastructure performance degradation, and achieving high-precision pixel-level crack segmentation is an issue of concern. Unlike the common research paradigms that adopt novel AI methods directly, this paper examines the inherent characteristics of cracks so as to introduce boundary features into crack identification and then builds a boundary guidance crack segmentation model (BGCrack) with targeted structures and modules, including a high frequency module, global information modeling module, joint optimization module, etc. Extensive experimental results verify the feasibility of the proposed designs and the effectiveness of the edge information in improving segmentation results. In addition, considering that notable open-source datasets mainly consist of asphalt pavement cracks because of ease of access, there is no standard and widely recognized dataset yet for steel structures, one of the primary structural forms in civil infrastructure. This paper provides a steel crack dataset that establishes a unified and fair benchmark for the identification of steel cracks.
- Deep learning-based crack damage detection using convolutional neural networks. Computer-Aided Civil and Infrastructure Engineering, 32(5):361–378, 2017.
- Autonomous structural visual inspection using region-based deep learning for detecting multiple damage types. Computer-Aided Civil and Infrastructure Engineering, 33(9):731–747, 2018.
- Transunet: Transformers make strong encoders for medical image segmentation, 2021.
- Rethinking atrous convolution for semantic image segmentation, 2017.
- Encoder-decoder with atrous separable convolution for semantic image segmentation. In Computer Vision – ECCV 2018, pages 833–851, Cham, 2018. Springer International Publishing.
- Fast fourier convolution. In Proceedings of the 34th International Conference on Neural Information Processing Systems, NIPS’20, Red Hook, NY, USA, 2020. Curran Associates Inc.
- Sddnet: Real-time crack segmentation. IEEE Transactions on Industrial Electronics, 67(9):8016–8025, 2020.
- Anomaly detection of defects on concrete structures with the convolutional autoencoder. Advanced Engineering Informatics, 45:101105, 2020.
- Artificial intelligence-empowered pipeline for image-based inspection of concrete structures. Automation in Construction, 120:103372, 2020.
- Automatic tunnel lining crack evaluation and measurement using deep learning. Tunnelling and Underground Space Technology, 124:104472, 2022.
- Review on computer vision-based crack detection and quantification methodologies for civil structures. Construction and Building Materials, 356:129238, 2022.
- Learning to predict crisp boundaries. In Computer Vision – ECCV 2018, pages 570–586, Cham, 2018. Springer International Publishing.
- BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 4171–4186, Minneapolis, Minnesota, June 2019. Association for Computational Linguistics.
- An image is worth 16x16 words: Transformers for image recognition at scale. In ICLR, 2021.
- Ce-net: Context encoder network for 2d medical image segmentation. IEEE Transactions on Medical Imaging, 38(10):2281–2292, 2019.
- Pavement crack detection based on transformer network. Automation in Construction, 145:104646, 2023.
- Semi-supervised learning based on convolutional neural network and uncertainty filter for façade defects classification. Computer-Aided Civil and Infrastructure Engineering, 36(3):302–317, 2021.
- Multi-scale hybrid vision transformer and sinkhorn tokenizer for sewer defect classification. Automation in Construction, 144:104614, 2022.
- Automatic damage detection using anchor-free method and unmanned surface vessel. Automation in Construction, 133:104017, 2022.
- Searching for mobilenetv3. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pages 1314–1324, 2019.
- Squeeze-and-excitation networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(8):2011–2023, 2020.
- Deep laplacian pyramid networks for fast and accurate super-resolution. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 5835–5843, 2017.
- Long-distance precision inspection method for bridge cracks with image processing. Automation in Construction, 41:83–95, 2014.
- Improving semantic segmentation via decoupled body and edge supervision. In Computer Vision – ECCV 2020, pages 435–452, Cham, 2020. Springer International Publishing.
- Deepcrack: A deep hierarchical feature learning architecture for crack segmentation. Neurocomputing, 338:139–153, 2019.
- Swin transformer: Hierarchical vision transformer using shifted windows. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 9992–10002, 2021.
- A convnet for the 2020s. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 11966–11976, 2022.
- Intriguing findings of frequency selection for image deblurring, 2022.
- Mobilevit: Light-weight, general-purpose, and mobile-friendly vision transformer. In International Conference on Learning Representations, 2022.
- A generative adversarial learning strategy for enhanced lightweight crack delineation networks. Advanced Engineering Informatics, 52:101575, 2022.
- Zernike-moment measurement of thin-crack width in images enabled by dual-scale deep learning. Computer-Aided Civil and Infrastructure Engineering, 34(5):367–384, 2019.
- Attention u-net: Learning where to look for the pancreas, 2018.
- Basnet: Boundary-aware salient object detection. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 7471–7481, 2019.
- Fcanet: Frequency channel attention networks. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 763–772, 2021.
- On the spectral bias of neural networks. In Proceedings of the 36th International Conference on Machine Learning, volume 97 of Proceedings of Machine Learning Research, pages 5301–5310. PMLR, 09–15 Jun 2019.
- Vision transformers for dense prediction. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 12159–12168, 2021.
- U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015, pages 234–241, Cham, 2015. Springer International Publishing.
- Grad-cam: Visual explanations from deep networks via gradient-based localization. In 2017 IEEE International Conference on Computer Vision (ICCV), pages 618–626, 2017.
- Automatic road crack detection using random structured forests. IEEE Transactions on Intelligent Transportation Systems, 17(12):3434–3445, 2016.
- Resolution-robust large mask inpainting with fourier convolutions. In 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pages 3172–3182, 2022.
- Training data-efficient image transformers & distillation through attention. In Proceedings of the 38th International Conference on Machine Learning, volume 139 of Proceedings of Machine Learning Research, pages 10347–10357. PMLR, 18–24 Jul 2021.
- Attention is all you need. In NeurIPS, 2017.
- Cbam: Convolutional block attention module. In Computer Vision – ECCV 2018, pages 3–19, Cham, 2018. Springer International Publishing.
- Stacked cross refinement network for edge-aware salient object detection. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pages 7263–7272, 2019.
- Early convolutions help transformers see better, 2021.
- Attribute-based structural damage identification by few-shot meta learning with inter-class knowledge transfer. Structural Health Monitoring, 20(4):1494–1517, 2021.
- Automatic seismic damage identification of reinforced concrete columns from images by a region-based deep convolutional neural network. Structural Control and Health Monitoring, 26(3):e2313, 2019. e2313 STC-18-0081.R1.
- Feature pyramid and hierarchical boosting network for pavement crack detection. IEEE Transactions on Intelligent Transportation Systems, 21(4):1525–1535, 2020.
- Automatic pixel-level crack detection and measurement using fully convolutional network. Computer-Aided Civil and Infrastructure Engineering, 33(12):1090–1109, 2018.
- Metaformer is actually what you need for vision. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 10809–10819, 2022.
- Exploiting the complementary strengths of multi-layer cnn features for image retrieval. Neurocomputing, 237:235–241, 2017.
- A real-time detection approach for bridge cracks based on yolov4-fpm. Automation in Construction, 122:103514, 2021.
- Unifying transformer and convolution for dam crack detection. Automation in Construction, 147:104712, 2023.
- Egnet: Edge guidance network for salient object detection. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pages 8778–8787, 2019.
- Deep learning-based crack segmentation for civil infrastructure: data types, architectures, and benchmarked performance. Automation in Construction, 146:104678, 2023.
- Unet++: Redesigning skip connections to exploit multiscale features in image segmentation. IEEE Transactions on Medical Imaging, 39(6):1856–1867, 2020.
- Learning statistical texture for semantic segmentation. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 12532–12541, 2021.
- Cracktree: Automatic crack detection from pavement images. Pattern Recognition Letters, 33(3):227–238, 2012.
- Deepcrack: Learning hierarchical convolutional features for crack detection. IEEE Transactions on Image Processing, 28(3):1498–1512, 2019.