Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash 93 tok/s
Gemini 2.5 Pro 35 tok/s Pro
GPT-5 Medium 28 tok/s
GPT-5 High 30 tok/s Pro
GPT-4o 81 tok/s
GPT OSS 120B 439 tok/s Pro
Kimi K2 197 tok/s Pro
2000 character limit reached

Optimized Vectorizing of Building Structures with Switch: High-Efficiency Convolutional Channel-Switch Hybridization Strategy (2306.15035v2)

Published 26 Jun 2023 in cs.AI and cs.CV

Abstract: The building planar graph reconstruction, a.k.a. footprint reconstruction, which lies in the domain of computer vision and geoinformatics, has been long afflicted with the challenge of redundant parameters in conventional convolutional models. Therefore, in this letter, we proposed an advanced and adaptive shift architecture, namely the Switch operator, which incorporates non-exponential growth parameters while retaining analogous functionalities to integrate local feature spatial information, resembling a high-dimensional convolution operation. The Switch operator, cross-channel operation, architecture implements the XOR operation to alternately exchange adjacent or diagonal features, and then blends alternating channels through a 1x1 convolution operation to consolidate information from different channels. The SwitchNN architecture, on the other hand, incorporates a group-based parameter-sharing mechanism inspired by the convolutional neural network process and thereby significantly reducing the number of parameters. We validated our proposed approach through experiments on the SpaceNet corpus, a publicly available dataset annotated with 2,001 buildings across the cities of Los Angeles, Las Vegas, and Paris. Our results demonstrate the effectiveness of this innovative architecture in building planar graph reconstruction from 2D building images.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (14)
  1. M.-H. Guo, T.-X. Xu, J.-J. Liu, Z.-N. Liu, P.-T. Jiang, T.-J. Mu, S.-H. Zhang, R. R. Martin, M.-M. Cheng, and S.-M. Hu, “Attention mechanisms in computer vision: A survey,” Computational Visual Media, vol. 8, no. 3, pp. 331–368, 2022.
  2. V. Yordanov, L. Biagi, X. Truong, V. Tran, and M. Brovelli, “An overview of geoinformatics state-of-the-art techniques for landslide monitoring and mapping,” The International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences, vol. 46, pp. 205–212, 2021.
  3. B. Ewenstein and J. Whyte, “Knowledge practices in design: the role of visual representations asepistemic objects’,” Organization studies, vol. 30, no. 1, pp. 07–30, 2009.
  4. M. Handana, R. Karolina et al., “Performance evaluation of existing building structure with pushover analysis,” in IOP Conference Series: Materials Science and Engineering, vol. 309, no. 1.   IOP Publishing, 2018, p. 012039.
  5. M. Del Carpio Ramos, G. Mosqueda, and M. J. Hashemi, “Large-scale hybrid simulation of a steel moment frame building structure through collapse,” Journal of Structural Engineering, vol. 142, no. 1, p. 04015086, 2016.
  6. C. T. Boyko, M. R. Gaterell, A. R. Barber, J. Brown, J. R. Bryson, D. Butler, S. Caputo, M. Caserio, R. Coles, R. Cooper et al., “Benchmarking sustainability in cities: The role of indicators and future scenarios,” Global Environmental Change, vol. 22, no. 1, pp. 245–254, 2012.
  7. N. Nauata and Y. Furukawa, “Vectorizing world buildings: Planar graph reconstruction by primitive detection and relationship inference,” in Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part VIII 16.   Springer, 2020, pp. 711–726.
  8. F. Zhang, N. Nauata, and Y. Furukawa, “Conv-mpn: Convolutional message passing neural network for structured outdoor architecture reconstruction,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 2798–2807.
  9. S. Guo, X. Yang, J. Ma, G. Ren, and L. Zhang, “A differentiable two-stage alignment scheme for burst image reconstruction with large shift,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 17 472–17 481.
  10. Z. Zhang, Z. Li, N. Bi, J. Zheng, J. Wang, K. Huang, W. Luo, Y. Xu, and S. Gao, “Ppgnet: Learning point-pair graph for line segment detection,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 7105–7114.
  11. Y. Zhou, H. Qi, and Y. Ma, “End-to-end wireframe parsing,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019, pp. 962–971.
  12. F. Zhang, X. Xu, N. Nauata, and Y. Furukawa, “Structured outdoor architecture reconstruction by exploration and classification,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 12 427–12 435.
  13. J. Chen, Y. Qian, and Y. Furukawa, “Heat: Holistic edge attention transformer for structured reconstruction,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 3866–3875.
  14. W. Zhao, C. Persello, X. Lv, A. Stein, and M. Vergauwen, “Vectorizing planar roof structure from very high resolution remote sensing images using transformers,” International Journal of Digital Earth, vol. 17, no. 1, pp. 1–15, 2024.
List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Follow-up Questions

We haven't generated follow-up questions for this paper yet.