Papers

Topics

Authors

Recent

View all

Gemini 2.5 Flash

175 tokens/sec

GPT-4o

7 tokens/sec

Gemini 2.5 Pro Pro

42 tokens/sec

o3 Pro

4 tokens/sec

GPT-4.1 Pro

38 tokens/sec

DeepSeek R1 via Azure Pro

28 tokens/sec

2000 character limit reached

144 3

HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting (2405.15125v4)

Published 24 May 2024 in cs.CV

Abstract: High dynamic range (HDR) novel view synthesis (NVS) aims to create photorealistic images from novel viewpoints using HDR imaging techniques. The rendered HDR images capture a wider range of brightness levels containing more details of the scene than normal low dynamic range (LDR) images. Existing HDR NVS methods are mainly based on NeRF. They suffer from long training time and slow inference speed. In this paper, we propose a new framework, High Dynamic Range Gaussian Splatting (HDR-GS), which can efficiently render novel HDR views and reconstruct LDR images with a user input exposure time. Specifically, we design a Dual Dynamic Range (DDR) Gaussian point cloud model that uses spherical harmonics to fit HDR color and employs an MLP-based tone-mapper to render LDR color. The HDR and LDR colors are then fed into two Parallel Differentiable Rasterization (PDR) processes to reconstruct HDR and LDR views. To establish the data foundation for the research of 3D Gaussian splatting-based methods in HDR NVS, we recalibrate the camera parameters and compute the initial positions for Gaussian point clouds. Experiments demonstrate that our HDR-GS surpasses the state-of-the-art NeRF-based method by 3.84 and 1.91 dB on LDR and HDR NVS while enjoying 1000x inference speed and only requiring 6.3% training time. Code and recalibrated data will be publicly available at https://github.com/caiyuanhao1998/HDR-GS . A brief video introduction of our work is available at https://youtu.be/wtU7Kcwe7ck

References (78)

Citations (4)

View on Semantic Scholar

Summary

The paper introduces HDR-GS, which leverages a dual dynamic range Gaussian point cloud model to render both HDR and LDR images with exceptional speed.
It employs parallel differentiable rasterization to achieve high-quality image synthesis, surpassing existing NeRF methods with improvements of 3.84 dB in LDR PSNR and 1.91 dB in HDR PSNR.
The framework recalibrates camera parameters using structure-from-motion, enabling efficient and versatile HDR novel view synthesis for real-time applications.

High Dynamic Range Novel View Synthesis Using Gaussian Splatting

The paper addresses the challenges inherent in high dynamic range (HDR) novel view synthesis (NVS) and introduces a novel framework named High Dynamic Range Gaussian Splatting (HDR-GS). HDR NVS aims to render photorealistic images from new viewpoints using HDR imaging techniques, which provide a higher range of luminance levels than traditional low dynamic range (LDR) imaging. The primary limitations of existing HDR NVS methods, primarily based on Neural Radiance Fields (NeRF), are their extensive training times and slow inference speeds. This paper proposes HDR-GS, which leverages a Dual Dynamic Range (DDR) Gaussian point cloud model to significantly enhance efficiency while maintaining or improving the quality of the synthesized images.

Methodological Innovations

The HDR-GS framework introduces several innovative components that collectively address the deficiencies of current HDR NVS methods:

Dual Dynamic Range (DDR) Gaussian Point Clouds: The DDR model utilizes spherical harmonics (SH) to represent HDR colors and employs a multi-layer perceptron (MLP)-based tone-mapper to render LDR colors based on user-defined exposure times. This contrastive modeling allows HDR-GS to efficiently reconstruct both HDR and LDR views.
Parallel Differentiable Rasterization (PDR): The framework implements two parallel PDR processes to individually reconstruct HDR and LDR images from the Gaussian point clouds. This dual-path approach ensures that HDR-GS can generate images with controllable exposure levels while maintaining high fidelity.
Recalibration of Camera Parameters: To adapt the 3D Gaussian splatting technique for HDR imaging, the authors recalibrate camera parameters and employ structure-from-motion (SfM) techniques to compute initial positions for Gaussian point clouds. This recalibration is pivotal in overcoming the limitations of the normalized device coordinate (NDC) system when applying 3D Gaussian splatting techniques.

Experimental Results

The empirical evaluation shows that HDR-GS significantly outperforms state-of-the-art NeRF-based methods in both LDR and HDR novel view synthesis tasks:

Performance Metrics: HDR-GS achieves notable improvements in peak signal-to-noise ratio (PSNR), structural similarity index measure (SSIM), and learned perceptual image patch similarity (LPIPS) scores. Specifically, HDR-GS surpasses the best-performing NeRF-based method by 3.84 dB in PSNR for LDR rendering and 1.91 dB for HDR rendering.
Efficiency: HDR-GS demonstrates a dramatic enhancement in efficiency—achieving approximately 1000 times faster inference speed and requiring only 6.3% of the training time compared to current best-performing NeRF methods.

Implications and Future Directions

The proposed HDR-GS framework offers several significant implications for the broader field of computer vision and generative AI:

Practical Applicability: By significantly reducing both the training and inference times, HDR-GS enables real-time applications in augmented reality (AR), virtual reality (VR), and other interactive systems requiring dynamic scene rendering with HDR imagery.
Advancement in HDR Imaging: The dual dynamic range approach provides a new paradigm for HDR image synthesis, which could inspire further research into more efficient and versatile HDR reconstruction methods.
Adaptability: The recalibration of camera parameters for initial Gaussian point cloud positions establishes a robust foundation for extending Gaussian splatting to various imaging scenarios.

Future research could build upon the HDR-GS framework by exploring more advanced model architectures for DDR and improving the efficiency of the PDR processes. Additionally, further studies could investigate the applicability of this framework in other domains such as 3D scene reconstruction, video synthesis, and beyond.

In conclusion, HDR-GS represents a significant advancement in the efficient rendering of HDR images from novel viewpoints, enhancing both the quality and speed of HDR NVS. This framework not only addresses the limitations of existing NeRF-based methods but also sets the stage for further innovations in the field of HDR imaging.

PDF Markdown

Tweets

https://twitter.com/janusch_patas/status/1794932286397489222

https://twitter.com/_akhaliq/status/1794921228462923925

https://twitter.com/CSVisionPapers/status/1795086546388844849

https://twitter.com/arxivsanitybot/status/1795084439908937738

[2405.15125] HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting (3 points, 0 comments)