NoPose-NeuS: Jointly Optimizing Camera Poses with Neural Implicit Surfaces for Multi-view Reconstruction (2312.15238v1)
Abstract: Learning neural implicit surfaces from volume rendering has become popular for multi-view reconstruction. Neural surface reconstruction approaches can recover complex 3D geometry that are difficult for classical Multi-view Stereo (MVS) approaches, such as non-Lambertian surfaces and thin structures. However, one key assumption for these methods is knowing accurate camera parameters for the input multi-view images, which are not always available. In this paper, we present NoPose-NeuS, a neural implicit surface reconstruction method that extends NeuS to jointly optimize camera poses with the geometry and color networks. We encode the camera poses as a multi-layer perceptron (MLP) and introduce two additional losses, which are multi-view feature consistency and rendered depth losses, to constrain the learned geometry for better estimated camera poses and scene surfaces. Extensive experiments on the DTU dataset show that the proposed method can estimate relatively accurate camera poses, while maintaining a high surface reconstruction quality with 0.89 mean Chamfer distance.
- Structure-from-motion revisited. In Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
- Pixelwise view selection for unstructured multi-view stereo. In European Conference on Computer Vision (ECCV), 2016.
- Nerf: Representing scenes as neural radiance fields for view synthesis, 2020.
- Unisurf: Unifying neural implicit surfaces and radiance fields for multi-view reconstruction, 2021.
- Volume rendering of neural implicit surfaces, 2021.
- Neus: Learning neural implicit surfaces by volume rendering for multi-view reconstruction, 2023.
- Nerf–: Neural radiance fields without known camera parameters, 2022.
- Barf: Bundle-adjusting neural radiance fields, 2021.
- Self-calibrating neural radiance fields, 2021.
- Nope-nerf: Optimising neural radiance field with no pose prior, 2023.
- Nerftrinsic four: An end-to-end trainable nerf jointly optimizing diverse intrinsic and extrinsic camera parameters, 2023.
- Learning signed distance field for multi-view surface reconstruction, 2021.
- Recovering fine details for neural implicit surface reconstruction, 2022.
- Large-scale data for multiple-view stereopsis. International Journal of Computer Vision, pages 1–16, 2016.
- Monosdf: Exploring monocular geometric cues for neural implicit surface reconstruction, 2022.
- Accurate, dense, and robust multiview stereopsis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(8):1362–1376, 2010.
- Gipuma: Massively parallel multi-view stereo reconstruction. 2016.
- Screened poisson surface reconstruction. ACM Trans. Graph., 32(3), jul 2013.
- Mvsnet: Depth inference for unstructured multi-view stereo, 2018.
- Patchmatchnet: Learned multi-view patchmatch stereo, 2020.
- Visibility-aware multi-view stereo network, 2020.
- Transmvsnet: Global context-aware multi-view stereo network with transformers, 2021.
- Mvsformer: Multi-view stereo by learning robust image features and temperature-based depth, 2022.
- Differentiable volumetric rendering: Learning implicit 3d representations without 3d supervision, 2020.
- Multiview neural surface reconstruction by disentangling geometry and appearance, 2020.
- Marching cubes: A high resolution 3d surface construction algorithm. SIGGRAPH Comput. Graph., 21(4):163–169, aug 1987.
- Zoedepth: Zero-shot transfer by combining relative and metric depth, 2023.
- Fourier features let networks learn high frequency functions in low dimensional domains, 2020.
- Adam: A method for stochastic optimization, 2017.