Optimized Vectorizing of Building Structures with Switch: High-Efficiency Convolutional Channel-Switch Hybridization Strategy (2306.15035v2)
Abstract: The building planar graph reconstruction, a.k.a. footprint reconstruction, which lies in the domain of computer vision and geoinformatics, has been long afflicted with the challenge of redundant parameters in conventional convolutional models. Therefore, in this letter, we proposed an advanced and adaptive shift architecture, namely the Switch operator, which incorporates non-exponential growth parameters while retaining analogous functionalities to integrate local feature spatial information, resembling a high-dimensional convolution operation. The Switch operator, cross-channel operation, architecture implements the XOR operation to alternately exchange adjacent or diagonal features, and then blends alternating channels through a 1x1 convolution operation to consolidate information from different channels. The SwitchNN architecture, on the other hand, incorporates a group-based parameter-sharing mechanism inspired by the convolutional neural network process and thereby significantly reducing the number of parameters. We validated our proposed approach through experiments on the SpaceNet corpus, a publicly available dataset annotated with 2,001 buildings across the cities of Los Angeles, Las Vegas, and Paris. Our results demonstrate the effectiveness of this innovative architecture in building planar graph reconstruction from 2D building images.
- M.-H. Guo, T.-X. Xu, J.-J. Liu, Z.-N. Liu, P.-T. Jiang, T.-J. Mu, S.-H. Zhang, R. R. Martin, M.-M. Cheng, and S.-M. Hu, “Attention mechanisms in computer vision: A survey,” Computational Visual Media, vol. 8, no. 3, pp. 331–368, 2022.
- V. Yordanov, L. Biagi, X. Truong, V. Tran, and M. Brovelli, “An overview of geoinformatics state-of-the-art techniques for landslide monitoring and mapping,” The International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences, vol. 46, pp. 205–212, 2021.
- B. Ewenstein and J. Whyte, “Knowledge practices in design: the role of visual representations asepistemic objects’,” Organization studies, vol. 30, no. 1, pp. 07–30, 2009.
- M. Handana, R. Karolina et al., “Performance evaluation of existing building structure with pushover analysis,” in IOP Conference Series: Materials Science and Engineering, vol. 309, no. 1. IOP Publishing, 2018, p. 012039.
- M. Del Carpio Ramos, G. Mosqueda, and M. J. Hashemi, “Large-scale hybrid simulation of a steel moment frame building structure through collapse,” Journal of Structural Engineering, vol. 142, no. 1, p. 04015086, 2016.
- C. T. Boyko, M. R. Gaterell, A. R. Barber, J. Brown, J. R. Bryson, D. Butler, S. Caputo, M. Caserio, R. Coles, R. Cooper et al., “Benchmarking sustainability in cities: The role of indicators and future scenarios,” Global Environmental Change, vol. 22, no. 1, pp. 245–254, 2012.
- N. Nauata and Y. Furukawa, “Vectorizing world buildings: Planar graph reconstruction by primitive detection and relationship inference,” in Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part VIII 16. Springer, 2020, pp. 711–726.
- F. Zhang, N. Nauata, and Y. Furukawa, “Conv-mpn: Convolutional message passing neural network for structured outdoor architecture reconstruction,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 2798–2807.
- S. Guo, X. Yang, J. Ma, G. Ren, and L. Zhang, “A differentiable two-stage alignment scheme for burst image reconstruction with large shift,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 17 472–17 481.
- Z. Zhang, Z. Li, N. Bi, J. Zheng, J. Wang, K. Huang, W. Luo, Y. Xu, and S. Gao, “Ppgnet: Learning point-pair graph for line segment detection,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 7105–7114.
- Y. Zhou, H. Qi, and Y. Ma, “End-to-end wireframe parsing,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019, pp. 962–971.
- F. Zhang, X. Xu, N. Nauata, and Y. Furukawa, “Structured outdoor architecture reconstruction by exploration and classification,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 12 427–12 435.
- J. Chen, Y. Qian, and Y. Furukawa, “Heat: Holistic edge attention transformer for structured reconstruction,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 3866–3875.
- W. Zhao, C. Persello, X. Lv, A. Stein, and M. Vergauwen, “Vectorizing planar roof structure from very high resolution remote sensing images using transformers,” International Journal of Digital Earth, vol. 17, no. 1, pp. 1–15, 2024.
Collections
Sign up for free to add this paper to one or more collections.