Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

NeuSurf: On-Surface Priors for Neural Surface Reconstruction from Sparse Input Views (2312.13977v2)

Published 21 Dec 2023 in cs.CV

Abstract: Recently, neural implicit functions have demonstrated remarkable results in the field of multi-view reconstruction. However, most existing methods are tailored for dense views and exhibit unsatisfactory performance when dealing with sparse views. Several latest methods have been proposed for generalizing implicit reconstruction to address the sparse view reconstruction task, but they still suffer from high training costs and are merely valid under carefully selected perspectives. In this paper, we propose a novel sparse view reconstruction framework that leverages on-surface priors to achieve highly faithful surface reconstruction. Specifically, we design several constraints on global geometry alignment and local geometry refinement for jointly optimizing coarse shapes and fine details. To achieve this, we train a neural network to learn a global implicit field from the on-surface points obtained from SfM and then leverage it as a coarse geometric constraint. To exploit local geometric consistency, we project on-surface points onto seen and unseen views, treating the consistent loss of projected features as a fine geometric constraint. The experimental results with DTU and BlendedMVS datasets in two prevalent sparse settings demonstrate significant improvements over the state-of-the-art methods.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (34)
  1. Using multiple hypotheses to improve depth-maps for multi-view stereo. In Computer Vision–ECCV 2008: 10th European Conference on Computer Vision, Marseille, France, October 12-18, 2008, Proceedings, Part I 10, 766–779. Springer.
  2. VolumeFusion: Deep Depth Fusion for 3D Scene Reconstruction. 2021 IEEE/CVF International Conference on Computer Vision (ICCV), 16066–16075.
  3. Geo-Neus: Geometry-Consistent Neural Implicit Surfaces Learning for Multi-view Reconstruction. In Koyejo, S.; Mohamed, S.; Agarwal, A.; Belgrave, D.; Cho, K.; and Oh, A., eds., Advances in Neural Information Processing Systems, volume 35, 3403–3416. Curran Associates, Inc.
  4. Accurate, Dense, and Robust Multiview Stereopsis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(8): 1362–1376.
  5. Massively Parallel Multiview Stereopsis by Surface Normal Diffusion. In 2015 IEEE International Conference on Computer Vision (ICCV), 873–881.
  6. Implicit Geometric Regularization for Learning Shapes. In Proceedings of Machine Learning and Systems 2020, 3569–3579.
  7. Large Scale Multi-view Stereopsis Evaluation. In 2014 IEEE Conference on Computer Vision and Pattern Recognition, 406–413.
  8. SurfaceNet: An End-To-End 3D Neural Network for Multiview Stereopsis. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2307–2315.
  9. Screened Poisson Surface Reconstruction. ACM Trans. Graph., 32(3).
  10. Probabilistic Labeling Cost for High-Accuracy Multi-view Reconstruction. In 2014 IEEE Conference on Computer Vision and Pattern Recognition, 1534–1541.
  11. A quasi-dense approach to surface reconstruction from uncalibrated images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(3): 418–433.
  12. Neural Sparse Voxel Fields. In Larochelle, H.; Ranzato, M.; Hadsell, R.; Balcan, M.; and Lin, H., eds., Advances in Neural Information Processing Systems, volume 33, 15651–15663. Curran Associates, Inc.
  13. Sparseneus: Fast generalizable neural surface reconstruction from sparse views. In European Conference on Computer Vision, 210–227. Springer.
  14. Neural-Pull: Learning Signed Distance Function from Point clouds by Learning to Pull Space onto Surface. In Meila, M.; and Zhang, T., eds., Proceedings of the 38th International Conference on Machine Learning, volume 139 of Proceedings of Machine Learning Research, 7246–7257. PMLR.
  15. Towards better gradient consistency for neural signed distance functions via level set alignment. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 17724–17734.
  16. Occupancy Networks: Learning 3D Reconstruction in Function Space. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
  17. NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis. In Vedaldi, A.; Bischof, H.; Brox, T.; and Frahm, J.-M., eds., Computer Vision – ECCV 2020, 405–421. Cham: Springer International Publishing. ISBN 978-3-030-58452-8.
  18. UNISURF: Unifying Neural Implicit Surfaces and Radiance Fields for Multi-View Reconstruction. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 5589–5599.
  19. Volrecon: Volume rendering of signed ray distance functions for generalizable multi-view reconstruction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 16685–16695.
  20. Structure-from-motion revisited. In Proceedings of the IEEE conference on computer vision and pattern recognition, 4104–4113.
  21. Pixelwise View Selection for Unstructured Multi-View Stereo. In Leibe, B.; Matas, J.; Sebe, N.; and Welling, M., eds., Computer Vision – ECCV 2016, 501–518. Cham: Springer International Publishing. ISBN 978-3-319-46487-9.
  22. NeuS: Learning Neural Implicit Surfaces by Volume Rendering for Multi-view Reconstruction. Advances in Neural Information Processing Systems, 34: 27171–27183.
  23. AutoRecon: Automated 3D Object Discovery and Reconstruction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 21382–21391.
  24. Multi-Scale Geometric Consistency Guided Multi-View Stereo. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
  25. Mvsnet: Depth inference for unstructured multi-view stereo. In Proceedings of the European conference on computer vision (ECCV), 767–783.
  26. BlendedMVS: A Large-scale Dataset for Generalized Multi-view Stereo Networks. Computer Vision and Pattern Recognition (CVPR).
  27. Volume rendering of neural implicit surfaces. Advances in Neural Information Processing Systems, 34: 4805–4815.
  28. Monosdf: Exploring monocular geometric cues for neural implicit surface reconstruction. Advances in neural information processing systems, 35: 25018–25032.
  29. Visibility-aware Multi-view Stereo Network. British Machine Vision Conference (BMVC).
  30. Fast Learning Radiance Fields by Shooting Much Fewer Rays. IEEE Transactions on Image Processing, 32: 2703–2718.
  31. Learning a More Continuous Zero Level Set in Unsigned Distance Fields through Level Set Projection. In Proceedings of the IEEE/CVF international conference on computer vision.
  32. Learning consistency-aware unsigned distance functions progressively from raw point clouds. Advances in Neural Information Processing Systems, 35: 16481–16494.
  33. 3D-OAE: Occlusion Auto-Encoders for Self-Supervised Learning on Point Clouds. arXiv preprint arXiv:2203.14084.
  34. VDN-NeRF: Resolving Shape-Radiance Ambiguity via View-Dependence Normalization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 35–45.
Citations (17)

Summary

We haven't generated a summary for this paper yet.