Neural Radiance Fields with Torch Units (2404.02617v1)
Abstract: Neural Radiance Fields (NeRF) give rise to learning-based 3D reconstruction methods widely used in industrial applications. Although prevalent methods achieve considerable improvements in small-scale scenes, accomplishing reconstruction in complex and large-scale scenes is still challenging. First, the background in complex scenes shows a large variance among different views. Second, the current inference pattern, $i.e.$, a pixel only relies on an individual camera ray, fails to capture contextual information. To solve these problems, we propose to enlarge the ray perception field and build up the sample points interactions. In this paper, we design a novel inference pattern that encourages a single camera ray possessing more contextual information, and models the relationship among sample points on each camera ray. To hold contextual information,a camera ray in our proposed method can render a patch of pixels simultaneously. Moreover, we replace the MLP in neural radiance field models with distance-aware convolutions to enhance the feature propagation among sample points from the same camera ray. To summarize, as a torchlight, a ray in our proposed method achieves rendering a patch of image. Thus, we call the proposed method, Torch-NeRF. Extensive experiments on KITTI-360 and LLFF show that the Torch-NeRF exhibits excellent performance.
- Vision-only robot navigation in a neural radiance world. IEEE Robotics and Automation Letters 7, 4606–4613.
- Nerf-tex: Neural reflectance field textures, in: Computer Graphics Forum, Wiley Online Library. pp. 287–301.
- Mip-nerf: A multiscale representation for anti-aliasing neural radiance fields, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 5855–5864.
- Mip-nerf 360: Unbounded anti-aliased neural radiance fields, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5470–5479.
- Scenerf: Self-supervised monocular 3d scene reconstruction with radiance fields, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 9387–9398.
- pi-gan: Periodic implicit generative adversarial networks for 3d-aware image synthesis, in: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 5799–5809.
- Gm-nerf: Learning generalizable model-based neural radiance fields from multi-view images, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 20648–20658.
- Learning continuous image representation with local implicit image function, in: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 8628–8638.
- Mobilenerf: Exploiting the polygon rasterization pipeline for efficient neural field rendering on mobile architectures, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 16569–16578.
- Lisa: Learning implicit shape and appearance of hands, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 20533–20543.
- Neural knitworks: Patched neural implicit representation networks. arXiv preprint arXiv:2109.14406 .
- Depth-supervised nerf: Fewer views and faster training for free, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12882–12891.
- Plenoxels: Radiance fields without neural networks, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5501–5510.
- Dynamic neural radiance fields for monocular 4d facial avatar reconstruction, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 8649–8658.
- Fastnerf: High-fidelity neural rendering at 200fps, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 14346–14355.
- Analyzing and improving the image quality of stylegan, in: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 8110–8119.
- Layered neural atlases for consistent video editing. ACM Transactions on Graphics (TOG) 40, 1–12.
- Point-based neural rendering with per-view optimization, in: Computer Graphics Forum, Wiley Online Library. pp. 29–43.
- Panoptic neural fields: A semantic object-aware neural scene representation, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12871–12881.
- Steernerf: Accelerating nerf rendering via smooth viewpoint trajectory, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 20701–20711.
- 3d neural scene representations for visuomotor control, in: Conference on Robot Learning, PMLR. pp. 112–123.
- Neural scene flow fields for space-time view synthesis of dynamic scenes, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6498–6508.
- Kitti-360: A novel dataset and benchmarks for urban scene understanding in 2d and 3d. IEEE Transactions on Pattern Analysis and Machine Intelligence 45, 3292–3310.
- Autoint: Automatic integration for fast neural volume rendering, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 14556–14565.
- Nerf-in: Free-form nerf inpainting with rgb-d priors. arXiv preprint arXiv:2206.04901 .
- Neural actor: Neural free-view synthesis of human actors with pose control. ACM transactions on graphics (TOG) 40, 1–16.
- Editing conditional radiance fields, in: Proceedings of the IEEE/CVF international conference on computer vision, pp. 5773–5783.
- Urban radiance field representation with deformable neural mesh primitives, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 465–476.
- Nerf in the wild: Neural radiance fields for unconstrained photo collections, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7210–7219.
- Local light field fusion: Practical view synthesis with prescriptive sampling guidelines. ACM Transactions on Graphics (TOG) 38, 1–14.
- Nerf: Representing scenes as neural radiance fields for view synthesis. Communications of the ACM 65, 99–106.
- Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784 .
- Autorf: Learning 3d object radiance fields from single view observations, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3971–3980.
- Instant neural graphics primitives with a multiresolution hash encoding. ACM Transactions on Graphics (ToG) 41, 1–15.
- Donerf: Towards real-time rendering of compact neural radiance fields using depth oracle networks, in: Computer Graphics Forum, Wiley Online Library. pp. 45–59.
- Giraffe: Representing scenes as compositional generative neural feature fields, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11453–11464.
- Animatable neural radiance fields for modeling dynamic human bodies, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 14314–14323.
- Neural body: Implicit neural representations with structured latent codes for novel view synthesis of dynamic humans, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9054–9063.
- D-nerf: Neural radiance fields for dynamic scenes, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10318–10327.
- Derf: Decomposed radiance fields, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 14153–14161.
- Kilonerf: Speeding up neural radiance fields with thousands of tiny mlps, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 14335–14345.
- Urban radiance fields, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12932–12942.
- Sharf: Shape-conditioned radiance fields from a single view. arXiv preprint arXiv:2102.08860 .
- Free view synthesis, in: Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XIX 16, Springer. pp. 623–640.
- Graf: Generative radiance fields for 3d-aware image synthesis. Advances in Neural Information Processing Systems 33, 20154–20166.
- Nerp: implicit neural representation learning with prior embedding for sparsely sampled image reconstruction. IEEE Transactions on Neural Networks and Learning Systems .
- Light field networks: Neural scene representations with single-evaluation rendering. Advances in Neural Information Processing Systems 34, 19313–19325.
- Direct voxel grid optimization: Super-fast convergence for radiance fields reconstruction, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5459–5469.
- Block-nerf: Scalable large scene neural view synthesis, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 8248–8258.
- Grf: Learning a general radiance field for 3d scene representation and rendering .
- Clip-nerf: Text-and-image driven manipulation of neural radiance fields, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3835–3844.
- Nerf-sr: High quality neural radiance fields using supersampling, in: Proceedings of the 30th ACM International Conference on Multimedia, pp. 6445–6454.
- Ibrnet: Learning multi-view image-based rendering, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4690–4699.
- Image quality assessment: from error visibility to structural similarity. IEEE transactions on image processing 13, 600–612.
- Humannerf: Free-viewpoint rendering of moving people from monocular video, in: Proceedings of the IEEE/CVF conference on computer vision and pattern Recognition, pp. 16210–16220.
- Space-time neural irradiance fields for free-viewpoint video, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9421–9431.
- Plenvdb: Memory efficient vdb-based radiance fields for fast training and rendering, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 88–96.
- Learning object-compositional neural radiance field for editable scene rendering, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 13779–13788.
- inerf: Inverting neural radiance fields for pose estimation, in: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), IEEE. pp. 1323–1330.
- pixelnerf: Neural radiance fields from one or few images, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4578–4587.
- Nerf++: Analyzing and improving neural radiance fields. arXiv preprint arXiv:2010.07492 .
- The unreasonable effectiveness of deep features as a perceptual metric, in: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 586–595.
- Pyramid scene parsing network, in: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2881–2890.