Surface Normal Estimation with Transformers (2401.05745v1)
Abstract: We propose the use of a Transformer to accurately predict normals from point clouds with noise and density variations. Previous learning-based methods utilize PointNet variants to explicitly extract multi-scale features at different input scales, then focus on a surface fitting method by which local point cloud neighborhoods are fitted to a geometric surface approximated by either a polynomial function or a multi-layer perceptron (MLP). However, fitting surfaces to fixed-order polynomial functions can suffer from overfitting or underfitting, and learning MLP-represented hyper-surfaces requires pre-generated per-point weights. To avoid these limitations, we first unify the design choices in previous works and then propose a simplified Transformer-based model to extract richer and more robust geometric features for the surface normal estimation task. Through extensive experiments, we demonstrate that our Transformer-based method achieves state-of-the-art performance on both the synthetic shape dataset PCPNet, and the real-world indoor scene dataset SceneNN, exhibiting more noise-resilient behavior and significantly faster inference. Most importantly, we demonstrate that the sophisticated hand-designed modules in existing works are not necessary to excel at the task of surface normal estimation.
- Point set surfaces. In Proceedings Visualization, 2001. VIS’01., pages 21–29. IEEE, 2001.
- L1-sparse reconstruction of sharp point set surfaces. ACM Transactions on Graphics, 29(5):1–12, 2010.
- DeepFit: 3D surface fitting via neural network weighted least squares. In European Conference on Computer Vision, pages 20–34. Springer, 2020.
- Nesti-Net: Normal estimation for unstructured 3D point clouds using convolutional neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 10112–10120, 2019.
- James F Blinn. Simulation of wrinkled surfaces. ACM SIGGRAPH Computer Graphics, 12(3):286–292, 1978.
- Fast and robust normal estimation for point clouds with sharp features. In Computer Graphics Forum, pages 1765–1774. Wiley Online Library, 2012.
- Deep learning for robust normal estimation in unstructured point clouds. In Computer Graphics Forum, pages 281–290. Wiley Online Library, 2016.
- Estimating differential quantities using polynomial fitting of osculating jets. Computer Aided Geometric Design, 22(2):121–146, 2005a.
- Estimating differential quantities using polynomial fitting of osculating jets. Computer Aided Geometric Design, 22(2):121–146, 2005b.
- Unit3d: A unified transformer for 3d dense captioning and visual grounding, 2022.
- An image is worth 16x16 words: Transformers for image recognition at scale, 2021.
- Robust moving least-squares fitting with sharp features. 24(3), 2005.
- Henri Gouraud. Continuous shading of curved surfaces. IEEE Transactions on Computers, 100(6):623–629, 1971.
- Algebraic point set surfaces. In ACM SIGGRAPH 2007 papers. 2007.
- PCPNet: learning local shape properties from raw point clouds. In Computer Graphics Forum, pages 75–85. Wiley Online Library, 2018.
- SEMANTIC3D.NET: A new large-scale point cloud classification benchmark. In ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, pages 91–98, 2017.
- Surface reconstruction from unorganized points. In Proceedings of the 19th Annual Conference on Computer Graphics and Interactive Techniques, pages 71–78, 1992.
- Scenenn: A scene meshes dataset with annotations. In International Conference on 3D Vision (3DV), 2016.
- Consolidation of unorganized point clouds for surface reconstruction. ACM Transactions on Graphics, 28(5):1–7, 2009.
- Adaptive mixtures of local experts. Neural Computation, 3(1):79–87, 1991.
- Poisson surface reconstruction. In Proceedings of the fourth Eurographics Symposium on Geometry Processing, 2006.
- Anisotropic smoothing of point sets. Computer Aided Geometric Design, 22(7):680–692, 2005.
- Deep iterative surface normal estimation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 11247–11256, 2020.
- Graphfit: Learning multi-scale graph-convolutional representation for point cloud normal estimation. In Computer Vision – ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXXII, page 651–667, Berlin, Heidelberg, 2022a. Springer-Verlag.
- Hsurf-net: Normal estimation for 3d point clouds by learning hyper surfaces. In Advances in Neural Information Processing Systems, pages 4218–4230. Curran Associates, Inc., 2022b.
- Neaf: Learning neural angle fields for point normal estimation. Proceedings of the AAAI Conference on Artificial Intelligence, 37(1):1396–1404, 2023.
- Deep feature-preserving normal estimation for point cloud filtering. Computer-Aided Design, 125:102860, 2020a.
- Low rank matrix approximation for 3D geometry filtering. IEEE Transactions on Visualization and Computer Graphics, 2020b.
- An End-to-End Transformer Model for 3D Object Detection. In ICCV, 2021.
- Estimating surface normals in noisy point cloud data. In Proceedings of the Nineteenth Annual Symposium on Computational Geometry, pages 322–328, 2003.
- Efficient simplification of point-sampled surfaces. In IEEE Visualization, 2002. VIS 2002., pages 163–170. IEEE, 2002.
- Bui Tuong Phong. Illumination for computer generated pictures. Communications of the ACM, 18(6):311–317, 1975.
- PointNet: Deep learning on point sets for 3D classification and segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 652–660, 2017a.
- PointNet++: Deep hierarchical feature learning on point sets in a metric space. Advances in Neural Information Processing Systems, 30:5099–5108, 2017b.
- You only look once: Unified, real-time object detection, 2015. cite arxiv:1506.02640.
- Relationformer: A unified framework for image-to-graph generation, 2022.
- Gilbert W Stewart. On the early history of the singular value decomposition. SIAM Review, 35(4):551–566, 1993.
- Denoising point sets via l0 minimization. Computer Aided Geometric Design, 35:2–15, 2015.
- Attention is all you need. Advances in Neural Information Processing Systems, 30, 2017.
- Dynamic graph CNN for learning on point clouds. CoRR, abs/1801.07829, 2018.
- Pointr: Diverse point cloud completion with geometry-aware transformers, 2021.
- Geometry guided deep surface normal estimation. Computer-Aided Design, 142:103119, 2022a.
- Geometry guided deep surface normal estimation. Computer-Aided Design, 142:103119, 2022b.
- Point transformer. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 16259–16268, 2021a.
- 3DVG-Transformer: Relation modeling for visual grounding on point clouds. In ICCV, pages 2928–2937, 2021b.
- Refine-Net: Normal refinement neural network for noisy point clouds. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022.
- Normal estimation for 3D point clouds via local plane constraint and multi-scale selection. Computer-Aided Design, 129:102916, 2020.
- Improvement of normal estimation for point clouds via simplifying surface fitting. arXiv preprint arXiv:2104.10369, 2021.
- AdaFit: Rethinking learning-based normal estimation on point clouds. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 6118–6127, 2021a.
- Deformable detr: Deformable transformers for end-to-end object detection, 2021b.
- Barry Shichen Hu (1 paper)
- Siyun Liang (3 papers)
- Johannes Paetzold (12 papers)
- Huy H. Nguyen (36 papers)
- Isao Echizen (83 papers)
- Jiapeng Tang (18 papers)