LidaRF: Delving into Lidar for Neural Radiance Field on Street Scenes (2405.00900v2)
Abstract: Photorealistic simulation plays a crucial role in applications such as autonomous driving, where advances in neural radiance fields (NeRFs) may allow better scalability through the automatic creation of digital 3D assets. However, reconstruction quality suffers on street scenes due to largely collinear camera motions and sparser samplings at higher speeds. On the other hand, the application often demands rendering from camera views that deviate from the inputs to accurately simulate behaviors like lane changes. In this paper, we propose several insights that allow a better utilization of Lidar data to improve NeRF quality on street scenes. First, our framework learns a geometric scene representation from Lidar, which is fused with the implicit grid-based representation for radiance decoding, thereby supplying stronger geometric information offered by explicit point cloud. Second, we put forth a robust occlusion-aware depth supervision scheme, which allows utilizing densified Lidar points by accumulation. Third, we generate augmented training views from Lidar points for further improvement. Our insights translate to largely improved novel view synthesis under real driving scenes.
- GitHub - lxxue/FRNN: Fixed Radius Nearest Neighbor Search on GPU — github.com. https://github.com/lxxue/FRNN. [Accessed 18-11-2023].
- Interstate Highway standards - Wikipedia — en.wikipedia.org. https://en.wikipedia.org/wiki/Interstate_Highway_standards. [Accessed 18-11-2023].
- Neural point-based graphics. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XXII 16, pages 696–712. Springer, 2020.
- Mip-nerf: A multiscale representation for anti-aliasing neural radiance fields. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 5855–5864, 2021.
- Mip-nerf 360: Unbounded anti-aliased neural radiance fields. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022.
- nuscenes: A multimodal dataset for autonomous driving. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 11621–11631, 2020.
- Cloner: Camera-lidar fusion for occupancy grid-aided neural representations. IEEE Robotics and Automation Letters, 2023.
- Pointersect: Neural rendering with cloud-ray intersection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 8359–8369, 2023a.
- Neural radiance field with lidar maps. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 17914–17923, 2023b.
- 4d spatio-temporal convnets: Minkowski convolutional neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 3075–3084, 2019.
- Kangle Deng et. al. Depth-supervised NeRF: Fewer views and faster training for free. In CVPR, 2022.
- Streetsurf: Extending multi-view implicit surface reconstruction to street views. arXiv preprint arXiv:2306.04988, 2023.
- Mask r-cnn. In Proceedings of the IEEE international conference on computer vision, pages 2961–2969, 2017.
- Rama C Hoetzlein. Fast fixed-radius nearest neighbors: interactive million-particle fluids. In GPU Technology Conference, page 2, 2014.
- Trivol: Point cloud rendering via triple volumes. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 20732–20741, 2023a.
- Point2pix: Photo-realistic point cloud rendering via neural radiance fields. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 8349–8358, 2023b.
- Ray tracing volume densities. ACM SIGGRAPH computer graphics, 1984.
- Direct visibility of point sets. In ACM SIGGRAPH 2007 papers, pages 24–es. 2007.
- Panoptic neural fields: A semantic object-aware neural scene representation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 12871–12881, 2022.
- Real-time neural rasterization for large scenes. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 8416–8427, 2023a.
- On the variance of the adaptive learning rate and beyond. arXiv preprint arXiv:1908.03265, 2019.
- Bevfusion: Multi-task multi-sensor fusion with unified bird’s-eye view representation. In 2023 IEEE International Conference on Robotics and Automation (ICRA), 2023b.
- Nerf: Representing scenes as neural radiance fields for view synthesis. In European conference on computer vision, 2020.
- Thomas Müller. tiny-cuda-nn, 2021.
- Instant neural graphics primitives with a multiresolution hash encoding. ACM Trans. Graph., 2022.
- Neural scene graphs for dynamic scenes. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2856–2865, 2021.
- Neural point light fields. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 18419–18429, 2022.
- Pointnet++: Deep hierarchical feature learning on point sets in a metric space. Advances in neural information processing systems, 30, 2017.
- Accelerating 3d deep learning with pytorch3d. arXiv preprint arXiv:2007.08501, 2020.
- Urban radiance fields. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022.
- Block-nerf: Scalable large scene neural view synthesis. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 8248–8258, 2022.
- Nerfstudio: A modular framework for neural radiance field development. In ACM SIGGRAPH 2023 Conference Proceedings, 2023.
- Torchsparse++: Efficient training and inference framework for sparse convolution on gpus. In IEEE/ACM International Symposium on Microarchitecture (MICRO), 2023.
- Moving forward in structure from motion. In 2007 IEEE Conference on Computer Vision and Pattern Recognition, pages 1–7. IEEE, 2007.
- Digging into depth priors for outdoor neural radiance fields. In ACMMM, 2023a.
- Planerf: Svd unsupervised 3d plane regularization for nerf large-scale scene reconstruction. arXiv preprint arXiv:2305.16914, 2023b.
- Sparsenerf: Distilling depth ranking for few-shot novel view synthesis. In ICCV, 2023c.
- Neural fields meet explicit geometric representations for inverse rendering of urban scenes. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 8370–8380, 2023d.
- Depth-guided optimization of neural radiance fields for indoor multi-view stereo. PAMI, 2023.
- Argoverse 2: Next generation datasets for self-driving perception and forecasting. arXiv preprint arXiv:2301.00493, 2023.
- Behind the scenes: Density fields for single view reconstruction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9076–9086, 2023.
- Pandaset: Advanced sensor suite dataset for autonomous driving. In 2021 IEEE International Intelligent Transportation Systems Conference (ITSC), pages 3095–3101. IEEE, 2021.
- S-nerf: Neural radiance fields for street views. In The Eleventh International Conference on Learning Representations, 2022.
- Point-nerf: Point-based neural radiance fields. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5438–5448, 2022.
- Nerfvs: Neural radiance fields for free view synthesis via geometry scaffolds. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 16549–16558, 2023a.
- Unisim: A neural closed-loop sensor simulator. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023b.
- Center-based 3d object detection and tracking. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2021.
- Monosdf: Exploring monocular geometric cues for neural implicit surface reconstruction. NeurIPS, 2022.
- Nerflets: Local radiance fields for efficient structure-aware 3d scene representation from 2d supervision. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 8274–8284, 2023a.
- Frequency-modulated point cloud rendering with easy editing. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 119–129, 2023b.
- Open3D: A modern library for 3D data processing. arXiv:1801.09847, 2018.
- Sampling: Scene-adaptive hierarchical multiplane images representation for novel view synthesis from a single image. In ICCV, 2023.