3D detection of roof sections from a single satellite image and application to LOD2-building reconstruction (2307.05409v1)
Abstract: Reconstructing urban areas in 3D out of satellite raster images has been a long-standing and challenging goal of both academical and industrial research. The rare methods today achieving this objective at a Level Of Details $2$ rely on procedural approaches based on geometry, and need stereo images and/or LIDAR data as input. We here propose a method for urban 3D reconstruction named KIBS(\textit{Keypoints Inference By Segmentation}), which comprises two novel features: i) a full deep learning approach for the 3D detection of the roof sections, and ii) only one single (non-orthogonal) satellite raster image as model input. This is achieved in two steps: i) by a Mask R-CNN model performing a 2D segmentation of the buildings' roof sections, and after blending these latter segmented pixels within the RGB satellite raster image, ii) by another identical Mask R-CNN model inferring the heights-to-ground of the roof sections' corners via panoptic segmentation, unto full 3D reconstruction of the buildings and city. We demonstrate the potential of the KIBS method by reconstructing different urban areas in a few minutes, with a Jaccard index for the 2D segmentation of individual roof sections of $88.55\%$ and $75.21\%$ on our two data sets resp., and a height's mean error of such correctly segmented pixels for the 3D reconstruction of $1.60$ m and $2.06$ m on our two data sets resp., hence within the LOD2 precision range.
- TensorFlow: Large-scale machine learning on heterogeneous systems, 2015. URL https://www.tensorflow.org/. Software available from tensorflow.org.
- Planeformers: From sparse view planes to 3d reconstruction. In ECCV, 2022.
- J.-P. Bauchet and F. Lafarge. City reconstruction from airborne lidar: A computational geometry approach. ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, 2019.
- CGAL. Computational geometry algorithms library, 2022. URL www.cgal.org. Accessed: 2022-11-18.
- B. Chatterjee. Urban feature classification from remote sensor imagery using deep neural networks. Master’s thesis, Concordia University, November 2019. URL https://spectrum.library.concordia.ca/id/eprint/986139/. Unpublished.
- Learning to predict 3d objects with an interpolation- based differentiable renderer. In NeurIPS, 2019.
- 3d-r2n2: A unified approach for single and multi-view 3d object reconstruction. Computer Vision – ECCV 2016, 2016.
- 3d buildings from imagery with ai. part 1: From elevation rasters, 2021. URL https://medium.com/geoai/3d-buildings-from-imagery-with-ai-fbbc1852e4dd. Accessed: 2022-11-16.
- L. Duan and F. Lafarge. Towards large-scale city reconstruction from satellites. In Proc. of the European Conference on Computer Vision (ECCV), 2016.
- Convolutional neural networks for large-scale remote-sensing image classification. In IEEE Transactions on geoscience and remote sensing, volume 55(2), pages 645–657, 2016.
- Can semantic labeling methods generalize to any city? the inria aerial image labeling benchmark. In 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), pages 3226–3229, 2017.
- FAIR. Detectron2 code repository, 2022. URL https://github.com/facebookresearch/detectron2. Accessed: 2022-11-18.
- Planar Shape Detection at Structural Scales. In CVPR, 2018.
- Single image 3d object reconstruction based on deep learning: A review. Multimedia Tools and Applications, 80:463–498, 2021.
- M. B. Gaetan Bahl and F. Lafarge. Single-shot end-to-end road graph extraction. CVPR 2022 : IEEE Conference on Computer Vision and Pattern Recognition EarthVision Workshop, 2022.
- New frontiers in spectral-spatial hyperspectral image classification: The latest advances based on mathematical morphology, markov random fields, segmentation, sparse representation, and deep learning. IEEE Geoscience and Remote Sensing Magazine, 6(3):10–43, 2018. doi: 10.1109/MGRS.2018.2854840.
- A benchmark for building footprint classification using orthorectified rgb imagery and digital surface models from commercial satellites. In 2017 IEEE Applied Imagery Pattern Recognition Workshop (AIPR), pages 1–7, Los Alamitos, CA, USA, oct 2017. IEEE Computer Society. doi: 10.1109/AIPR.2017.8457973. URL https://doi.ieeecomputersociety.org/10.1109/AIPR.2017.8457973.
- S. Gui and R. Qin. Automated lod-2 model reconstruction from very-high-resolution satellite-derived digital surface model and orthophoto. ISPRS Journal of Photogrammetry and Remote Sensing, 181, 2021.
- Deep residual learning for image recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 770–778, Los Alamitos, CA, USA, jun 2016. IEEE Computer Society. doi: 10.1109/CVPR.2016.90. URL https://doi.ieeecomputersociety.org/10.1109/CVPR.2016.90.
- Mask r-cnn. In 2017 IEEE International Conference on Computer Vision (ICCV), pages 2980–2988, 2017. doi: 10.1109/ICCV.2017.322.
- Urban land use and land cover classification with interpretable machine learning: A case study using sentinel-2 and auxiliary data. Remote Sensing Applications: Society and Environment, 28:100843, 2022. ISSN 2352-9385. doi: https://doi.org/10.1016/j.rsase.2022.100843. URL https://www.sciencedirect.com/science/article/pii/S2352938522001513.
- Primitivenet: Primitive instance segmentation with local primitive embedding under adversarial metric. In ICCV, 2021.
- D. Huh. Curvature-corrected learning dynamics in deep neural networks. In H. D. III and A. Singh, editors, Proceedings of the 37th International Conference on Machine Learning, volume 119 of Proceedings of Machine Learning Research, pages 4552–4560. PMLR, 13–18 Jul 2020. URL http://proceedings.mlr.press/v119/huh20a.html.
- Fully convolutional networks for multisource building extraction from an open aerial and satellite imagery data set. IEEE Transactions on Geoscience and Remote Sensing, 57(1):574–586, 2019. doi: 10.1109/TGRS.2018.2858817.
- Super-resolution land cover mapping based on the convolutional neural network. Remote Sensing, 11(15), 2019. ISSN 2072-4292. doi: 10.3390/rs11151815. URL https://www.mdpi.com/2072-4292/11/15/1815.
- F. Lafarge and G. Bahl. Scanner Neural Network for On-board Segmentation of Satellite Images. In IGARSS 2022 – IEEE International Geoscience and Remote Sensing Symposium, Kuala Lumpur, Malaysia, July 2022. URL https://hal.inria.fr/hal-03664644.
- Persistence analysis of multi-scale planar structure graph in point clouds. Computer Graphics Forum, 39(2), 2020.
- Deep iterative surface normal estimation. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 11244–11253, 2020.
- Urban semantic 3d reconstruction from multiview satellite imagery. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2019.
- Supervised fitting of geometric primitives to 3d point clouds. In CVPR, 2019a.
- Topological map extraction from overhead images. In ICCV, 2019b.
- Geometric object based building reconstruction from satellite imagery derived point clouds. ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, XLII-2/W13:73–78, 06 2019c. doi: 10.5194/isprs-archives-XLII-2-W13-73-2019.
- Microsoft coco: Common objects in context. In D. Fleet, T. Pajdla, B. Schiele, and T. Tuytelaars, editors, Computer Vision – ECCV 2014, pages 740–755, Cham, 2014. Springer International Publishing.
- Feature pyramid networks for object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 2017.
- Planenet: Piece-wise planar reconstruction from a single rgb image. In CVPR, 2018.
- Planercnn: 3d plane detection and reconstruction from a single image. In CVPR, 2019.
- J. Lussange. Biyolo code repository, 2022. URL https://github.com/johannlussange/biyolo. Accessed: 2022-11-18.
- LuxCarta. Procédé de reconstruction d’un modèle 3d d’un toit d’un bâtiment par analyse d’images acquises par télédétection, 2022. URL https://data.inpi.fr/brevets/FR3123753?q=#FR3123753. Accessed: 2023-03-18.
- Cpfn: Cascaded primitive fitting networks for high-resolution point clouds. In ICCV, 2021.
- M. S. Minhas. Transfer learning for semantic segmentation using pytorch deeplab v3, Sep 2019. URL https://github.com/msminhas93/DeepLabv3FineTuning.
- Rapter: rebuilding man-made scenes with regular arrangements of planes. Trans. on Graphics, 34(4), 2015.
- Polygonal building extraction by frame field learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5891–5900, 2021.
- Automatic 3-d building model reconstruction from very high resolution stereo satellite imagery. Remote Sensing, 11(14), 2019.
- J.-S. Proulx-Bourque and M. Turgeon-Pelchat. Toward the use of deep learning for topographic feature extraction from high resolution optical satellite imagery. In IGARSS 2018, pages 3441–3444, 07 2018. doi: 10.1109/IGARSS.2018.8519171.
- Roof-gan: Learning to generate roof geometry and relations for residential houses. In CVPR, 2021.
- Segmentation of point clouds using smoothness constraint. International archives of photogrammetry, remote sensing and spatial information sciences, 36(5), 2006.
- Intuitive and efficient roof modeling for reconstruction and synthesis. ACM Trans. on Graphics, 40(6), 2021.
- Efficient ransac for point-cloud shape detection. Computer graphics forum, 26(2), 2007.
- Parsenet: A parametric surface fitting network for 3d point clouds. In ECCV, 2020.
- ImpliCity: City modeling from satellite images with deep implicit occupancy fields. ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, 2022.
- B. Sun and P. Mordohai. Oriented point sampling for plane detection in unorganized point clouds. In ICRA, 2019.
- Automated chain for large-scale 3d reconstruction of urban scenes from satellite images. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, 2019.
- Deep learning for land use and land cover classification based on hyperspectral and multispectral earth observation data: A review. Remote Sensing, 12(15), 2020. ISSN 2072-4292. doi: 10.3390/rs12152495. URL https://www.mdpi.com/2072-4292/12/15/2495.
- Octree-based region growing for point cloud segmentation. ISPRS Journal of Photogrammetry and Remote Sensing, 104, 2015.
- waspinator. pycococreator code repository, 2021. URL https://github.com/waspinator/pycococreator. Accessed: 2022-11-18.
- Regnet: Self-regulated network for image classification. IEEE Transactions on Neural Networks and Learning Systems, pages 1–6, 2022. doi: 10.1109/TNNLS.2022.3158966.
- Hpnet: Deep primitive segmentation using hybrid representations. In ICCV, 2021.
- F. Yang and Z. Zhou. Recovering 3d planes from a single image via convolutional neural networks. In ECCV, 2018.
- Automatic 3d building reconstruction from multi-view aerial images with deep learning. ISPRS Journal of Photogrammetry and Remote Sensing, 171, 2021.
- M. Yu and F. Lafarge. Finding Good Configurations of Planar Primitives in Unorganized Point Clouds. In Proc. of the IEEE conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, US, 2022.
- Single-image piece-wise planar 3d reconstruction via associative embedding. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 1029–1037, 2019.
- Neural procedural reconstruction for residential buildings. In ECCV, 2018.
- Extracting planar roof structures from very high resolution images using graph neural networks. ISPRS Journal of Photogrammetry and Remote Sensing, 187, 2022.
- Deep learning in remote sensing: A review. IEEE Geoscience and Remote Sensing Magazine (GRSM), 10 2017.
- Polyworld: Polygonal building extraction with graph neural networks in satellite images. In CVPR, 2022.