Generating Content for HDR Deghosting from Frequency View (2404.00849v1)
Abstract: Recovering ghost-free High Dynamic Range (HDR) images from multiple Low Dynamic Range (LDR) images becomes challenging when the LDR images exhibit saturation and significant motion. Recent Diffusion Models (DMs) have been introduced in HDR imaging field, demonstrating promising performance, particularly in achieving visually perceptible results compared to previous DNN-based methods. However, DMs require extensive iterations with large models to estimate entire images, resulting in inefficiency that hinders their practical application. To address this challenge, we propose the Low-Frequency aware Diffusion (LF-Diff) model for ghost-free HDR imaging. The key idea of LF-Diff is implementing the DMs in a highly compacted latent space and integrating it into a regression-based model to enhance the details of reconstructed images. Specifically, as low-frequency information is closely related to human visual perception we propose to utilize DMs to create compact low-frequency priors for the reconstruction process. In addition, to take full advantage of the above low-frequency priors, the Dynamic HDR Reconstruction Network (DHRNet) is carried out in a regression-based manner to obtain final HDR images. Extensive experiments conducted on synthetic and real-world benchmark datasets demonstrate that our LF-Diff performs favorably against several state-of-the-art methods and is 10$\times$ faster than previous DM-based methods.
- Attention-guided progressive neural texture fusion for high dynamic range image restoration. IEEE Transactions on Image Processing, 31:2661–2672, 2022.
- Wavegrad: Estimating gradients for waveform generation. In International Conference on Learning Representations, 2020.
- Hierarchical integration diffusion model for realistic image deblurring. Advances in Neural Information Processing Systems, 36, 2024.
- Diffusion models beat gans on image synthesis. In Advances in Neural Information Processing Systems, pages 8780–8794. Curran Associates, Inc., 2021.
- Glgnet: light field angular superresolution with arbitrary interpolation rates. Visual Intelligence, 2(1):6, 2024.
- Generative adversarial networks. Communications of the ACM, 63(11):139–144, 2020.
- Ghost-free high dynamic range imaging. In IEEE Conference on Asian Conference on Computer Vision (ACCV), pages 486–500, 2011.
- Denoising diffusion probabilistic models. In Advances in Neural Information Processing Systems, pages 6840–6851. Curran Associates, Inc., 2020.
- HDR deghosting: How to deal with saturation? In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 1163–1170, 2013.
- Sensor-realistic synthetic data engine for multi-frame high dynamic range photography. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pages 516–517, 2020.
- Deep high dynamic range imaging of dynamic scenes. ACM Transactions on Graphics, 36(4):1–12, 2017.
- Denoising diffusion restoration models. Advances in Neural Information Processing Systems, 35:23593–23606, 2022.
- Srdiff: Single image super-resolution with diffusion probabilistic models. Neurocomputing, 479:47–59, 2022a.
- Uphdr-gan: Generative adversarial network for high dynamic range imaging with unpaired data. IEEE Transactions on Circuits and Systems for Video Technology, 32(11):7532–7546, 2022b.
- Adnet: Attention-guided deformable convolutional network for high dynamic range imaging. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 463–470, 2021.
- Ghost-free high dynamic range imaging with context-aware transformer. pages 344–360, 2022.
- Dpm-solver: A fast ode solver for diffusion probabilistic model sampling in around 10 steps. Advances in Neural Information Processing Systems, 35:5775–5787, 2022a.
- Transformer for single image super-resolution. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 457–466, 2022b.
- Refusion: Enabling large-size realistic image restoration with latent-space diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1680–1691, 2023.
- HDR-VDP-2:a calibrated visual metric for visibility and quality predictions in all luminance conditions. In ACM Siggraph, pages 1–14, 2011.
- Hdr-gan: Hdr image reconstruction from multi-exposed ldr images with large motions. IEEE Transactions on Image Processing, 30:3885–3896, 2021.
- Deep hdr reconstruction of dynamic scenes. In 2018 IEEE 3rd International Conference on Image, Vision and Computing (ICIVC), pages 347–351. IEEE, 2018.
- High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 10684–10695, 2022.
- Image super-resolution via iterative refinement. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022.
- Robust patch-based hdr reconstruction of dynamic scenes. ACM Trans. Graph., 31(6):203–1, 2012.
- Deep unsupervised learning using nonequilibrium thermodynamics. In Proceedings of the 32nd International Conference on Machine Learning, pages 2256–2265, 2015.
- Denoising diffusion implicit models. In 9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3-7, 2021. OpenReview.net, 2021.
- Selective transhdr: Transformer-based selective hdr imaging using ghost region mask. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XVII, pages 288–304. Springer, 2022.
- Score-based generative modeling through stochastic differential equations. arXiv preprint arXiv:2011.13456, 2020.
- Alignment-free hdr deghosting with semantics consistent transformer. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 12836–12845, 2023.
- Image registration for multi-exposure high dynamic range image acquisition. In International Conference in Central Europe on Computer Graphics and Visualization, WSCG’07, 2007.
- An objective deghosting quality metric for HDR images. Comput. Graph. Forum, 35(2):139–152, 2016a.
- An objective deghosting quality metric for hdr images. In Computer Graphics Forum, pages 139–152. Wiley Online Library, 2016b.
- High-frequency component helps explain the generalization of convolutional neural networks. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 8684–8694, 2020.
- Zero-shot image restoration using denoising diffusion null-space model. In The Eleventh International Conference on Learning Representations, 2022.
- Greg Ward. Fast, robust image registration for compositing high dynamic range photographs from hand-held exposures. Journal of Graphics Tools, 8, 2012.
- Deblurring via stochastic refinement. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 16293–16303, 2022.
- Deep high dynamic range imaging with large foreground motions. In European Conference on Computer Vision (ECCV), 2018.
- Diffir: Efficient diffusion model for image restoration. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 13095–13105, 2023.
- High dynamic range imaging by sparse representation. Neurocomputing, 269:160–169, 2017.
- Attention-guided network for ghost-free high dynamic range imaging. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1751–1760, 2019.
- Deep hdr imaging via a non-local network. IEEE Transactions on Image Processing, 29:4308–4322, 2020.
- A unified hdr imaging method with pixel and patch level. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 22211–22220, 2023a.
- Towards high-quality hdr deghosting with conditional diffusion models. IEEE Transactions on Circuits and Systems for Video Technology, pages 1–1, 2023b.
- Restormer: Efficient transformer for high-resolution image restoration. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 5728–5739, 2022.
- Efficient content reconstruction for high dynamic range imaging. In ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 7660–7664, 2024a.
- Eiffhdr: An efficient network for multi-exposure high dynamic range imaging. In ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 6560–6564, 2024b.