NeuRAD: Neural Rendering for Autonomous Driving (2311.15260v3)
Abstract: Neural radiance fields (NeRFs) have gained popularity in the autonomous driving (AD) community. Recent methods show NeRFs' potential for closed-loop simulation, enabling testing of AD systems, and as an advanced training data augmentation technique. However, existing methods often require long training times, dense semantic supervision, or lack generalizability. This, in turn, hinders the application of NeRFs for AD at scale. In this paper, we propose NeuRAD, a robust novel view synthesis method tailored to dynamic AD data. Our method features simple network design, extensive sensor modeling for both camera and lidar -- including rolling shutter, beam divergence and ray dropping -- and is applicable to multiple datasets out of the box. We verify its performance on five popular AD datasets, achieving state-of-the-art performance across the board. To encourage further development, we will openly release the NeuRAD source code. See https://github.com/georghess/NeuRAD .
- Zenseact open dataset: A large-scale and diverse multimodal dataset for autonomous driving. In Int. Conf. Comput. Vis., pages 20178–20188, 2023.
- Mip-nerf: A multiscale representation for anti-aliasing neural radiance fields. In Int. Conf. Comput. Vis., pages 5855–5864, 2021.
- Mip-nerf 360: Unbounded anti-aliased neural radiance fields. In IEEE Conf. Comput. Vis. Pattern Recog., pages 5470–5479, 2022.
- Zip-nerf: Anti-aliased grid-based neural radiance fields. In Int. Conf. Comput. Vis., pages 19697–19705, 2023.
- nuscenes: A multimodal dataset for autonomous driving. In IEEE Conf. Comput. Vis. Pattern Recog., pages 11621–11631, 2020.
- Tensorf: Tensorial radiance fields. In Eur. Conf. Comput. Vis., pages 333–350. Springer, 2022.
- Carla: An open urban driving simulator. In Conference on robot learning, pages 1–16. PMLR, 2017.
- Plenoxels: Radiance fields without neural networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5501–5510, 2022.
- Panoptic nerf: 3d-to-2d label transfer for panoptic urban scene segmentation. In 2022 International Conference on 3D Vision (3DV), pages 1–11. IEEE, 2022.
- Vision meets robotics: The kitti dataset. The International Journal of Robotics Research, 32(11):1231–1237, 2013.
- Instruct-nerf2nerf: Editing 3d scenes with instructions. In Int. Conf. Comput. Vis., 2023.
- Gans trained by a two time-scale update rule converge to a local nash equilibrium. Adv. Neural Inform. Process. Syst., 30, 2017.
- Tri-miprf: Tri-mip representation for efficient anti-aliasing neural radiance fields. In Int. Conf. Comput. Vis., pages 19774–19783, 2023.
- Neural lidar fields for novel view synthesis. In Int. Conf. Comput. Vis., 2023.
- Image-to-image translation with conditional adversarial networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 1125–1134, 2017.
- 3d gaussian splatting for real-time radiance field rendering. ACM Trans. Graph., 42(4):1–14, 2023.
- Adam: A method for stochastic optimization. In 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings, 2015.
- Panoptic neural fields: A semantic object-aware neural scene representation. In IEEE Conf. Comput. Vis. Pattern Recog., pages 12871–12881, 2022.
- Nerfacc: Efficient sampling accelerates nerfs. arXiv preprint arXiv:2305.04966, 2023a.
- Neuralangelo: High-fidelity neural surface reconstruction. In IEEE Conf. Comput. Vis. Pattern Recog., pages 8456–8465, 2023b.
- Towards zero domain gap: A comprehensive study of realistic lidar simulation for autonomy testing. In Int. Conf. Comput. Vis., pages 8272–8282, 2023.
- Nerf in the wild: Neural radiance fields for unconstrained photo collections. In IEEE Conf. Comput. Vis. Pattern Recog., pages 7210–7219, 2021.
- Occupancy networks: Learning 3d reconstruction in function space. In IEEE Conf. Comput. Vis. Pattern Recog., pages 4460–4470, 2019.
- Local light field fusion: Practical view synthesis with prescriptive sampling guidelines. ACM Trans. Graph., 38(4):1–14, 2019.
- Nerf: Representing scenes as neural radiance fields for view synthesis. In Eur. Conf. Comput. Vis., pages 405–421, Cham, 2020. Springer International Publishing.
- Thomas Müller. tiny-cuda-nn, 2021.
- Instant neural graphics primitives with a multiresolution hash encoding. ACM Trans. Graph., 41(4):1–15, 2022.
- Unisurf: Unifying neural implicit surfaces and radiance fields for multi-view reconstruction. In Int. Conf. Comput. Vis., pages 5589–5599, 2021.
- Neural scene graphs for dynamic scenes. In IEEE Conf. Comput. Vis. Pattern Recog., pages 2856–2865, 2021.
- Urban radiance fields. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 12932–12942, 2022.
- Airsim: High-fidelity visual and physical simulation for autonomous vehicles. In Field and Service Robotics: Results of the 11th International Conference, pages 621–635. Springer, 2018.
- Block-nerf: Scalable large scene neural view synthesis. In IEEE Conf. Comput. Vis. Pattern Recog., pages 8248–8258, 2022.
- Nerfstudio: A modular framework for neural radiance field development. In ACM SIGGRAPH 2023 Conference Proceedings, pages 1–12, 2023.
- Suds: Scalable urban dynamic scenes. In IEEE Conf. Comput. Vis. Pattern Recog., pages 12375–12385, 2023.
- Neus: Learning neural implicit surfaces by volume rendering for multi-view reconstruction. In Adv. Neural Inform. Process. Syst., pages 27171–27183, 2021a.
- Immortal tracker: Tracklet never dies. arXiv preprint arXiv:2111.13672, 2021b.
- High-resolution image synthesis and semantic manipulation with conditional gans. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 8798–8807, 2018a.
- High-resolution image synthesis and semantic manipulation with conditional gans. In IEEE Conf. Comput. Vis. Pattern Recog., 2018b.
- Neus2: Fast learning of neural implicit surfaces for multi-view reconstruction. In Int. Conf. Comput. Vis., pages 3295–3306, 2023.
- Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process., 13(4):600–612, 2004.
- Nerf–: Neural radiance fields without known camera parameters. arXiv preprint arXiv:2102.07064, 2021c.
- Bundlesdf: Neural 6-dof tracking and 3d reconstruction of unknown objects. In IEEE Conf. Comput. Vis. Pattern Recog., pages 606–617, 2023.
- Argoverse 2: Next generation datasets for self-driving perception and forecasting. In Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks (NeurIPS Datasets and Benchmarks 2021), 2021.
- Mars: An instance-aware, modular and realistic simulator for autonomous driving. CICAI, 2023.
- Pandaset: Advanced sensor suite dataset for autonomous driving. In 2021 IEEE International Intelligent Transportation Systems Conference (ITSC), pages 3095–3101, 2021.
- S-neRF: Neural radiance fields for street views. In The Eleventh International Conference on Learning Representations, 2023.
- Unisim: A neural closed-loop sensor simulator. In IEEE Conf. Comput. Vis. Pattern Recog., pages 1389–1399, 2023a.
- Reconstructing objects in-the-wild for realistic sensor simulation. In 2023 IEEE International Conference on Robotics and Automation (ICRA), pages 11661–11668, 2023b.
- The unreasonable effectiveness of deep features as a perceptual metric. In IEEE Conf. Comput. Vis. Pattern Recog., pages 586–595, 2018.
- On the continuity of rotation representations in neural networks. In IEEE Conf. Comput. Vis. Pattern Recog., pages 5745–5753, 2019.