Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Semantic Aware Diffusion Inverse Tone Mapping (2405.15468v1)

Published 24 May 2024 in cs.CV and cs.GR

Abstract: The range of real-world scene luminance is larger than the capture capability of many digital camera sensors which leads to details being lost in captured images, most typically in bright regions. Inverse tone mapping attempts to boost these captured Standard Dynamic Range (SDR) images back to High Dynamic Range (HDR) by creating a mapping that linearizes the well exposed values from the SDR image, and provides a luminance boost to the clipped content. However, in most cases, the details in the clipped regions cannot be recovered or estimated. In this paper, we present a novel inverse tone mapping approach for mapping SDR images to HDR that generates lost details in clipped regions through a semantic-aware diffusion based inpainting approach. Our method proposes two major contributions - first, we propose to use a semantic graph to guide SDR diffusion based inpainting in masked regions in a saturated image. Second, drawing inspiration from traditional HDR imaging and bracketing methods, we propose a principled formulation to lift the SDR inpainted regions to HDR that is compatible with generative inpainting methods. Results show that our method demonstrates superior performance across different datasets on objective metrics, and subjective experiments show that the proposed method matches (and in most cases outperforms) state-of-art inverse tone mapping operators in terms of objective metrics and outperforms them for visual fidelity.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (47)
  1. Ahmet Oguz Akyüz and Erik Reinhard. 2007. Noise reduction in high dynamic range imaging. J. Vis. Commun. Image Represent. 18, 5 (2007), 366–376. https://doi.org/10.1016/j.jvcir.2007.04.001
  2. NoR-VDPNet++: Real-Time No-Reference Image Quality Metrics. IEEE Access 11 (2023), 34544–34553. https://doi.org/10.1109/ACCESS.2023.3263496
  3. Inverse Tone Mapping. In GRAPHITE ’06 (Kuala Lumpur, Malaysia). ACM, New York, NY, USA, 349–356. https://doi.org/10.1145/1174429.1174489
  4. Self-supervised High Dynamic Range Imaging: What Can Be Learned from a Single 8-bit Video? ACM Trans. Graph. 43, 2, Article 24 (mar 2024), 16 pages. https://doi.org/10.1145/3648570
  5. Tone expansion using lighting style aesthetics. Comput. Graph. 62 (2017), 77–86. https://doi.org/10.1016/j.cag.2016.12.006
  6. Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets. arXiv:2311.15127 [cs.CV]
  7. Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs. In The Twenty-Fourth IEEE Conference on Computer Vision and Pattern Recognition. IEEE.
  8. Text2Light: Zero-Shot Text-Driven HDR Panorama Generation. ACM Trans. Graph. 41, 6, Article 195 (nov 2022), 16 pages. https://doi.org/10.1145/3550454.3555447
  9. Single image ldr to hdr conversion using conditional diffusion. In 2023 IEEE International Conference on Image Processing (ICIP). IEEE, 3533–3537.
  10. Paul E. Debevec and Jitendra Malik. 1997. Recovering High Dynamic Range Radiance Maps from Photographs. In Proceedings of the 24th Annual Conference on Computer Graphics and Interactive Techniques (SIGGRAPH ’97). ACM Press/Addison-Wesley Publishing Co., USA, 369–378. https://doi.org/10.1145/258734.258884
  11. Prafulla Dhariwal and Alexander Nichol. 2021. Diffusion models beat gans on image synthesis. Advances in neural information processing systems 34 (2021), 8780–8794.
  12. Enhancement of Bright Video Features for HDR Displays. Computer Graphics Forum 27, 4 (2008), 1265–1274.
  13. HDR image reconstruction from a single exposure using deep CNNs. ACM Trans. Graph. 36, 6 (2017), 178:1–178:15. https://doi.org/10.1145/3130800.3130816
  14. Image inpainting: A review. Neural Processing Letters 51 (2020), 2007–2028.
  15. Deep Reverse Tone Mapping. ACM Trans. Graph. 36, 6, Article 177 (2017), 10 pages. https://doi.org/10.1145/3130800.3130834
  16. Mark D Fairchild. 2007. The HDR photographic survey. In Color and imaging conference, Vol. 15. Society of Imaging Science and Technology, 233–238.
  17. PyMatting: A Python Library for Alpha Matting. Journal of Open Source Software 5, 54 (2020), 2481. https://doi.org/10.21105/joss.02481
  18. G-SemTMO: Tone Mapping with a Trainable Semantic Graph. arXiv preprint arXiv:2208.14113 (2022).
  19. Tone mapping operators: Progressing towards semantic-awareness. In 2020 IEEE International Conference on Multimedia & Expo Workshops (ICMEW). IEEE, 1–6.
  20. Comparison of Single Image HDR Reconstruction Methods — the Caveats of Quality Assessment. In ACM SIGGRAPH 2022 Conference Proceedings (Vancouver, BC, Canada) (SIGGRAPH ’22). Association for Computing Machinery, New York, NY, USA, Article 1, 8 pages. https://doi.org/10.1145/3528233.3530729
  21. Burst Photography for High Dynamic Range and Low-Light Imaging on Mobile Cameras. ACM Trans. Graph. 35, 6, Article 192 (nov 2016), 12 pages. https://doi.org/10.1145/2980179.2980254
  22. Deep Arbitrary HDRI: Inverse Tone Mapping With Controllable Exposure Changes. IEEE Trans. Multim. 24 (2022), 2713–2726. https://doi.org/10.1109/TMM.2021.3087034
  23. Segment Anything. arXiv preprint arXiv:2304.02643 (2023).
  24. Rafael Pacheco Kovaleski and Manuel M. Oliveira. 2014. High-Quality Reverse Tone Mapping for a Wide Range of Exposures. In 27th SIBGRAPI Conference on Graphics, Patterns and Images. IEEE Computer Society, New York, 49–56.
  25. Deep recursive hdri: Inverse tone mapping using generative adversarial networks. In proceedings of the European Conference on Computer Vision (ECCV). 596–611.
  26. Single-Image HDR Reconstruction by Learning to Reverse the Camera Pipeline. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 1648–1657.
  27. HDR-VDP-3: A multi-metric for predicting image differences, quality and contrast distortions in high dynamic range and regular content. arXiv:2304.13625 [eess.IV]
  28. ExpandNet: A Deep Convolutional Neural Network for High Dynamic Range Expansion from Low Dynamic Range Content. Comput. Graph. Forum 37, 2 (2018), 37–49. https://doi.org/10.1111/cgf.13340
  29. Evaluation of reverse tone mapping through varying exposure conditions. In ACM SIGGRAPH Asia 2009 Papers (Yokohama, Japan) (SIGGRAPH Asia ’09). Association for Computing Machinery, New York, NY, USA, Article 160, 8 pages. https://doi.org/10.1145/1661412.1618506
  30. The Reproduction of Specular Highlights on High Dynamic Range Displays. In 14th Color and Imaging Conference, CIC 2006, Scottsdale, Arizona, USA, November 6-10, 2006. Society for Imaging Science and Technology, 333–338. https://doi.org/10.2352/CIC.2006.14.1.ART00061
  31. LDR2HDR: On-the-Fly Reverse Tone Mapping of Legacy Video and Photographs. ACM Trans. Graph. 26, 3 (2007), 39. https://doi.org/10.1145/1276377.1276426
  32. High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 10684–10695.
  33. U-Net: Convolutional Networks for Biomedical Image Segmentation. In Medical Image Computing and Computer-Assisted Intervention - MICCAI 2015 - 18th International Conference Munich, Germany, October 5 - 9, 2015, Proceedings, Part III (Lecture Notes in Computer Science, Vol. 9351), Nassir Navab, Joachim Hornegger, William M. Wells III, and Alejandro F. Frangi (Eds.). Springer, 234–241. https://doi.org/10.1007/978-3-319-24574-4_28
  34. Photorealistic text-to-image diffusion models with deep language understanding. Advances in Neural Information Processing Systems 35 (2022), 36479–36494.
  35. Single Image HDR Reconstruction Using a CNN with Masked Features and Perceptual Loss. ACM Trans. Graph. 39, 4, Article 80 (2020), 10 pages. https://doi.org/10.1145/3386569.3392403
  36. High dynamic range display systems. ACM Trans. Graph. 23, 3 (aug 2004), 760–768. https://doi.org/10.1145/1015706.1015797
  37. Blind image quality evaluation using perception based features. In Twenty First National Conference on Communications, NCC 2015, Mumbai, India, February 27 - March 1, 2015. IEEE, 1–6. https://doi.org/10.1109/NCC.2015.7084843
  38. GlowGAN: Unsupervised Learning of HDR Images from LDR Images in the Wild. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 10509–10519.
  39. High Dynamic Range Image Hallucination. In SIGGRAPH ’07: ACM SIGGRAPH 2007 Sketches (San Diego, California). ACM, New York, NY, USA, 72. https://doi.org/10.1145/1278780.1278867
  40. Fastfcn: Rethinking Dilated Convolution in the Backbone for Semantic Segmentation. arXiv preprint arXiv:1903.11816 (2019).
  41. SegFormer: Simple and efficient design for semantic segmentation with transformers. Advances in Neural Information Processing Systems 34 (2021), 12077–12090.
  42. Luminance Attentive Networks for HDR Image and Panorama Reconstruction. Computer Graphics Forum 40, 7 (2021), 181–192.
  43. Ye Yu and William AP Smith. 2019. Inverserendernet: Learning single image inverse rendering. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 3155–3164.
  44. Semantic Segmentation with Extended DeepLabv3 Architecture. In 2019 27th Signal Processing and Communications Applications Conference (SIU). IEEE, 1–4.
  45. Adding conditional control to text-to-image diffusion models. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 3836–3847.
  46. Scene Parsing through ADE20k Dataset. In Proceedings of the IEEE conference on computer vision and pattern recognition. 633–641.
  47. RawHDR: High Dynamic Range Image Reconstruction from a Single Raw Image. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). IEEE, 12334–12344.
Citations (1)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets