The Sky's the Limit: Re-lightable Outdoor Scenes via a Sky-pixel Constrained Illumination Prior and Outside-In Visibility (2311.16937v2)
Abstract: Inverse rendering of outdoor scenes from unconstrained image collections is a challenging task, particularly illumination/albedo ambiguities and occlusion of the illumination environment (shadowing) caused by geometry. However, there are many cues in an image that can aid in the disentanglement of geometry, albedo and shadows. Whilst sky is frequently masked out in state-of-the-art methods, we exploit the fact that any sky pixel provides a direct observation of distant lighting in the corresponding direction and, via a neural illumination prior, a statistical cue to derive the remaining illumination environment. The incorporation of our illumination prior is enabled by a novel `outside-in' method for computing differentiable sky visibility based on a neural directional distance function. This is highly efficient and can be trained in parallel with the neural scene representation, allowing gradients from appearance loss to flow from shadows to influence the estimation of illumination and geometry. Our method estimates high-quality albedo, geometry, illumination and sky visibility, achieving state-of-the-art results on the NeRF-OSR relighting benchmark. Our code and models can be found at https://github.com/JADGardner/neusky
- Representing 3D Shapes With Probabilistic Directed Distance Fields. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 19343–19354, June 2022.
- Shape, Illumination, and Reflectance from Shading. TPAMI, 2015.
- Mip-NeRF 360: Unbounded Anti-Aliased Neural Radiance Fields. CVPR, 2022.
- Intrinsic images in the wild. ACM Transactions on Graphics (TOG), 33(4):1–12, 2014.
- Neural-PIL: Neural Pre-Integrated Lighting for Reflectance Decomposition. In A. Beygelzimer, Y. Dauphin, P. Liang, and J. Wortman Vaughan, editors, Advances in Neural Information Processing Systems, 2021.
- Two-shot spatially-varying brdf and shape estimation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3982–3991, 2020.
- User-assisted intrinsic images. In ACM SIGGRAPH Asia 2009 papers, pages 1–10. Association for Computing Machinery, New York, NY, United States, 2009.
- Pi-GAN: Periodic Implicit Generative Adversarial Networks for 3D-Aware Image Synthesis. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5799–5809, June 2021.
- Vision Transformer Adapter for Dense Predictions. In The Eleventh International Conference on Learning Representations, 2023.
- Text2Light: Zero-Shot Text-Driven HDR Panorama Generation. ACM Trans. Graph., 41(6), Nov. 2022. Number of pages: 16 Place: New York, NY, USA Publisher: Association for Computing Machinery tex.articleno: 195 tex.issue_date: December 2022.
- The Cityscapes Dataset for Semantic Urban Scene Understanding. In Proc. of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
- EverLight: Indoor-Outdoor Editable HDR Lighting Estimation, 2023. arXiv: 2304.13207 [cs.CV].
- PANDORA: Polarization-Aided Neural Decomposition of Radiance. In Shai Avidan, Gabriel J. Brostow, Moustapha Cissé, Giovanni Maria Farinella, and Tal Hassner, editors, Computer Vision - ECCV 2022 - 17th European Conference, Tel Aviv, Israel, October 23-27, 2022, Proceedings, Part VII, volume 13667 of Lecture Notes in Computer Science, pages 538–556. Springer, 2022. tex.bibsource: dblp computer science bibliography, https://dblp.org tex.biburl: https://dblp.org/rec/conf/eccv/DaveZV22.bib tex.timestamp: Mon, 05 Dec 2022 13:35:31 +0100.
- Statistical characterization of real-world illumination. Journal of Vision, 4(9):11–11, Sept. 2004.
- Rotation-Equivariant Conditional Spherical Neural Fields for Learning a Natural Illumination Prior. In Alice H. Oh, Alekh Agarwal, Danielle Belgrave, and Kyunghyun Cho, editors, Advances in Neural Information Processing Systems, 2022.
- Reni++ a rotation-equivariant, scale-invariant, natural illumination prior, 2023.
- Ground truth dataset and baseline evaluations for intrinsic image algorithms. In 2009 IEEE 12th International Conference on Computer Vision, pages 2335–2342. IEEE, 2009.
- Denoising Diffusion Probabilistic Models. In H. Larochelle, M. Ranzato, R. Hadsell, M.F. Balcan, and H. Lin, editors, Advances in Neural Information Processing Systems, volume 33, pages 6840–6851. Curran Associates, Inc., 2020.
- Adam: A Method for Stochastic Optimization. In Yoshua Bengio and Yann LeCun, editors, 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings, 2015. tex.bibsource: dblp computer science bibliography, https://dblp.org tex.timestamp: Thu, 25 Jul 2019 14:25:37 +0200.
- Shading annotations in the wild. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 6998–7007, 2017.
- NeuLighting: Neural Lighting for Free Viewpoint Outdoor Scene Relighting with Unconstrained Photo Collections. In SIGGRAPH Asia 2022 Conference Papers, SA ’22, New York, NY, USA, 2022. Association for Computing Machinery. Number of pages: 9 Place: Daegu, Republic of Korea tex.articleno: 13.
- RayDF: Neural Ray-surface Distance Fields with Multi-view Consistency. In Thirty-seventh Conference on Neural Information Processing Systems, 2023.
- SGDR: Stochastic Gradient Descent with Warm Restarts. In International Conference on Learning Representations, 2017.
- Efficient and differentiable shadow computation for inverse problems. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 13107–13116, 2021.
- NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis. In ECCV, 2020.
- Instant Neural Graphics Primitives with a Multiresolution Hash Encoding. ACM Trans. Graph., 41(4):102:1–102:15, July 2022. Number of pages: 15 Place: New York, NY, USA Publisher: ACM tex.articleno: 102 tex.issue_date: July 2022.
- Fast Python sampler for the von Mises Fisher distribution. tex.hal_id: hal-04004568 tex.hal_version: v3, Aug. 2023.
- A versatile scene model with differentiable visibility applied to generative pose estimation. In Proceedings of the 2015 International Conference on Computer Vision (ICCV 2015), 2015.
- NeRF for Outdoor Scene Relighting. In European Conference on Computer Vision (ECCV), 2022.
- Structure-from-motion revisited. In Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
- Pixelwise view selection for unstructured multi-view stereo. In European Conference on Computer Vision (ECCV), 2016.
- Implicit Neural Representations with Periodic Activation Functions. In Proc. NeurIPS, 2020.
- Hdr environment map estimation for real-time augmented reality. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 11298–11306, June 2021.
- NeRV: Neural Reflectance and Visibility Fields for Relighting and View Synthesis. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 7495–7504, June 2021.
- Nerfstudio: A Modular Framework for Neural Radiance Field Development. arXiv preprint arXiv:2302.04264, 2023.
- Neural Density-Distance Fields. In Proceedings of the European Conference on Computer Vision, 2022.
- NeuS: Learning Neural Implicit Surfaces by Volume Rendering for Multi-view Reconstruction. In A. Beygelzimer, Y. Dauphin, P. Liang, and J. Wortman Vaughan, editors, Advances in Neural Information Processing Systems, 2021.
- Neural Fields meet Explicit Geometric Representations for Inverse Rendering of Urban Scenes. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2023.
- Lance Williams. Casting curved shadows on curved surfaces. In Proceedings of the 5th annual conference on Computer graphics and interactive techniques, pages 270–274, 1978.
- Differentiable shadow mapping for efficient inverse graphics. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 142–153, 2023.
- NeILF: Neural Incident Light Field for Physically-based Material Estimation. In Shai Avidan, Gabriel Brostow, Moustapha Cissé, Giovanni Maria Farinella, and Tal Hassner, editors, Computer Vision – ECCV 2022, pages 700–716, Cham, 2022. Springer Nature Switzerland.
- FIRe: Fast Inverse Rendering using Directional and Signed Distance Functions, 2022. arXiv: 2203.16284 [cs.CV].
- Ye Yu and William A. P. Smith. InverseRenderNet: Learning Single Image Inverse Rendering. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2019.
- SDFStudio: A Unified Framework for Surface Reconstruction, 2022.
- MonoSDF: Exploring Monocular Geometric Cues for Neural Implicit Surface Reconstruction. In S. Koyejo, S. Mohamed, A. Agarwal, D. Belgrave, K. Cho, and A. Oh, editors, Advances in Neural Information Processing Systems, volume 35, pages 25018–25032. Curran Associates, Inc., 2022.
- Physg: Inverse rendering with spherical gaussians for physics-based material editing and relighting. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5453–5462, 2021.
- NeRFactor: Neural Factorization of Shape and Reflectance under an Unknown Illumination. ACM Trans. Graph., 40(6), Dec. 2021. Number of pages: 18 Place: New York, NY, USA Publisher: Association for Computing Machinery tex.articleno: 237 tex.issue_date: December 2021.
- Multi-View Reconstruction using Signed Ray Distance Functions (SRDF), 2023. arXiv: 2209.00082 [cs.CV].
- A Deep Signed Directional Distance Function for Object Shape Representation. CoRR, abs/2107.11024, 2021. arXiv: 2107.11024 tex.bibsource: dblp computer science bibliography, https://dblp.org tex.timestamp: Fri, 04 Aug 2023 08:25:46 +0200.