Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
139 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Towards Real-World HDR Video Reconstruction: A Large-Scale Benchmark Dataset and A Two-Stage Alignment Network (2405.00244v1)

Published 30 Apr 2024 in cs.CV

Abstract: As an important and practical way to obtain high dynamic range (HDR) video, HDR video reconstruction from sequences with alternating exposures is still less explored, mainly due to the lack of large-scale real-world datasets. Existing methods are mostly trained on synthetic datasets, which perform poorly in real scenes. In this work, to facilitate the development of real-world HDR video reconstruction, we present Real-HDRV, a large-scale real-world benchmark dataset for HDR video reconstruction, featuring various scenes, diverse motion patterns, and high-quality labels. Specifically, our dataset contains 500 LDRs-HDRs video pairs, comprising about 28,000 LDR frames and 4,000 HDR labels, covering daytime, nighttime, indoor, and outdoor scenes. To our best knowledge, our dataset is the largest real-world HDR video reconstruction dataset. Correspondingly, we propose an end-to-end network for HDR video reconstruction, where a novel two-stage strategy is designed to perform alignment sequentially. Specifically, the first stage performs global alignment with the adaptively estimated global offsets, reducing the difficulty of subsequent alignment. The second stage implicitly performs local alignment in a coarse-to-fine manner at the feature level using the adaptive separable convolution. Extensive experiments demonstrate that: (1) models trained on our dataset can achieve better performance on real scenes than those trained on synthetic datasets; (2) our method outperforms previous state-of-the-art methods. Our dataset is available at https://github.com/yungsyu99/Real-HDRV.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (45)
  1. User generated hdr gaming video streaming: Dataset, codec comparison, and challenges. IEEE Transactions on Circuits and Systems for Video Technology, 32(3):1236–1249, 2021.
  2. Flexhdr: Modeling alignment and exposure uncertainties for flexible hdr imaging. IEEE Transactions on Image Processing, 31:5923–5935, 2022.
  3. Hdr video reconstruction: A coarse-to-fine network and a real-world benchmark dataset. In IEEE International Conference on Computer Vision (ICCV), pages 2502–2511, 2021a.
  4. Hdrunet: Single image hdr reconstruction with denoising and dequantization. In IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pages 354–363, 2021b.
  5. Lan-hdr: Luminance-based alignment network for high dynamic range video reconstruction. In IEEE International Conference on Computer Vision (ICCV), pages 12760–12769, 2023.
  6. Recovering high dynamic range radiance maps from photographs. In ACM International Conferenceon Computer Graphics and Interactive Techniques (SIGGRAPH), pages 369–378, 1997.
  7. Creating cinematic wide gamut hdr-video for the evaluation of tone mapping operators and hdr-displays. In Digital photography X, pages 279–288, 2014.
  8. Locally non-rigid registration for mobile hdr photography. In IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pages 48–55, 2015.
  9. Motion aware exposure bracketing for hdr video. Computer Graphics Forum, 34(4):119–130, 2015.
  10. Learning a practical sdr-to-hdrtv up-conversion using new dataset and degradation models. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 22231–22241, 2023.
  11. Hybrid high dynamic range imaging fusing neuromorphic and conventional images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023.
  12. Multiple view geometry in computer vision. Cambridge university press, 2003.
  13. Measuring colorfulness in natural images. In Human vision and electronic imaging VIII, pages 87–95, 2003.
  14. Hdr deghosting: How to deal with saturation? In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 1163–1170, 2013.
  15. La-hdr: Light adaptive hdr reconstruction framework for single ldr image considering varied light conditions. IEEE Transactions on Multimedia, pages 1–16, 2022.
  16. A dual-conversion-gain video sensor with dewarping and overlay on a single chip. In IEEE International Solid-State Circuits Conference (ISSCC), pages 52–53, 2009.
  17. Deep hdr video from sequences with alternating exposures. Computer Graphics Forum, 38(2):193–205, 2019.
  18. Patch-based high dynamic range video. ACM Transactions on Graphics, 32(202):1–8, 2013.
  19. High dynamic range video. In ACM International Conferenceon Computer Graphics and Interactive Techniques (SIGGRAPH), pages 319––325, 2003.
  20. A unified framework for multi-sensor hdr video reconstruction. Signal Processing: Image Communication, 29(2):203–215, 2014.
  21. Single-image hdr reconstruction by multi-exposure generation. In IEEE Winter Conference on Applications of Computer Vision (WACV), pages 4052–4061, 2023.
  22. Ghost-free high dynamic range imaging with context-aware transformer. In European Conference on Computer Vision (ECCV), pages 344–360, 2022.
  23. High dynamic range video with ghost removal. In Applications of Digital Image Processing XXXIII, pages 307–314, 2010.
  24. Hdr-vdp-2: a calibrated visual metric for visibility and quality predictions in all luminance conditions. ACM Transactions on graphics, 30(4):1–14, 2011.
  25. Pu21: A novel perceptually uniform encoding for adapting existing quality metrics for hdr. In 2021 Picture Coding Symposium (PCS), pages 1–5, 2021.
  26. Optical splitting trees for high-precision monocular imaging. IEEE Computer Graphics and Applications, 27(2):32–42, 2007.
  27. Exposure fusion. In Proceedings of 15th Pacific Conference on Computer Graphics and Applications (PG), pages 382–390, 2007.
  28. Hdr-vqm: An objective quality measure for high dynamic range video. Signal Processing: Image Communication, 35:46–60, 2015.
  29. High dynamic range imaging: Spatially varying pixel exposures. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 472–479, 2000.
  30. Video frame interpolation via adaptive separable convolution. In IEEE International Conference on Computer Vision (ICCV), pages 261–270, 2017.
  31. High speed and high dynamic range video with an event camera. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(6):1964–1980, 2021.
  32. Photographic tone reproduction for digital images. ACM Transactions on Graphics, 21(3):267–276, 2002.
  33. Ldr2hdr: on-the-fly reverse tone mapping of legacy video and photographs. In ACM International Conferenceon Computer Graphics and Interactive Techniques (SIGGRAPH), pages 39–es, 2007.
  34. Robust patch-based hdr reconstruction of dynamic scenes. ACM Transactions on Graphics, 31(6):203–1, 2012.
  35. Raft: Recurrent all-pairs field transforms for optical flow. In European Conference on Computer Vision (ECCV), pages 402–419, 2020.
  36. Laurens vd Maaten and Geoffrey Hinton. Visualizing data using t-sne. Journal of machine learning research, 9(11):2579–2605, 2008.
  37. Seeing dynamic scene in the dark: A high-quality video dataset with mechatronic alignment. In IEEE International Conference on Computer Vision (ICCV), pages 9700–9709, 2021.
  38. Efficient sparse-to-dense optical flow estimation using a learned basis and layers. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 120–130, 2015.
  39. Hdrflow: Real-time hdr video reconstruction with large motions, 2024.
  40. Video enhancement with task-oriented flow. International Journal of Computer Vision, 127:1106–1125, 2019.
  41. Attention-guided network for ghost-free high dynamic range imaging. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 1751–1760, 2019.
  42. A unified hdr imaging method with pixel and patch level. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 22211–22220, 2023.
  43. Motion basis learning for unsupervised deep homography estimation with subspace projection. In IEEE International Conference on Computer Vision (ICCV), pages 13097–13105, 2021.
  44. Hdr video reconstruction with a large dynamic dataset in raw and srgb domains, 2023.
  45. Rawhdr: High dynamic range image reconstruction from a single raw image. In IEEE International Conference on Computer Vision (ICCV), pages 12334–12344, 2023.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com