Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
156 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Fearless Luminance Adaptation: A Macro-Micro-Hierarchical Transformer for Exposure Correction (2309.00872v2)

Published 2 Sep 2023 in cs.CV

Abstract: Photographs taken with less-than-ideal exposure settings often display poor visual quality. Since the correction procedures vary significantly, it is difficult for a single neural network to handle all exposure problems. Moreover, the inherent limitations of convolutions, hinder the models ability to restore faithful color or details on extremely over-/under- exposed regions. To overcome these limitations, we propose a Macro-Micro-Hierarchical transformer, which consists of a macro attention to capture long-range dependencies, a micro attention to extract local features, and a hierarchical structure for coarse-to-fine correction. In specific, the complementary macro-micro attention designs enhance locality while allowing global interactions. The hierarchical structure enables the network to correct exposure errors of different scales layer by layer. Furthermore, we propose a contrast constraint and couple it seamlessly in the loss function, where the corrected image is pulled towards the positive sample and pushed away from the dynamically generated negative samples. Thus the remaining color distortion and loss of detail can be removed. We also extend our method as an image enhancer for low-light face recognition and low-light semantic segmentation. Experiments demonstrate that our approach obtains more attractive results than state-of-the-art methods quantitatively and qualitatively.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (71)
  1. Learning multi-scale photo exposure correction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 9157–9167.
  2. Learning photographic global tonal adjustment with a database of input/output image pairs. In CVPR 2011. IEEE, 97–104.
  3. A joint intrinsic-extrinsic prior model for retinex. In Proceedings of the IEEE international conference on computer vision. 4000–4009.
  4. Bilateral guided upsampling. ACM Transactions on Graphics (TOG) 35, 6 (2016), 1–8.
  5. A simple framework for contrastive learning of visual representations. In International conference on machine learning. PMLR, 1597–1607.
  6. Deep photo enhancer: Unpaired learning for image enhancement from photographs with gans. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 6306–6314.
  7. Illumination Adaptive Transformer. arXiv preprint arXiv:2205.14871 (2022).
  8. Multitask aet with orthogonal tangent regularity for dark object detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 2553–2562.
  9. Coatnet: Marrying convolution and attention for all data sizes. Advances in Neural Information Processing Systems 34 (2021), 3965–3977.
  10. HDR image reconstruction from a single exposure using deep CNNs. ACM transactions on graphics (TOG) 36, 6 (2017), 1–15.
  11. Exposure Correction Model to Enhance Image Quality. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 676–686.
  12. A weighted variational model for simultaneous reflectance and illumination estimation. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2782–2790.
  13. Zero-reference deep curve estimation for low-light image enhancement. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1780–1789.
  14. Xiaojie Guo. 2016. LIME: A method for low-light image enhancement. In Proceedings of the 24th ACM international conference on Multimedia. 87–91.
  15. David Hasler and Sabine E Suesstrunk. 2003. Measuring colorfulness in natural images. In Human vision and electronic imaging VIII, Vol. 5007. SPIE, 87–95.
  16. Gans trained by a two time-scale update rule converge to a local nash equilibrium. Advances in neural information processing systems 30 (2017).
  17. Exposure: A white-box photo post-processing framework. ACM Transactions on Graphics (TOG) 37, 2 (2018), 1–17.
  18. Exposure Normalization and Compensation for Multiple-Exposure Correction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 6043–6052.
  19. Learning Sample Relationship for Exposure Correction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 9904–9913.
  20. Exposure-Consistency Representation Learning for Exposure Correction. In Proceedings of the 30th ACM International Conference on Multimedia. 6309–6317.
  21. Dslr-quality photos on mobile devices with deep convolutional networks. In Proceedings of the IEEE International Conference on Computer Vision. 3277–3285.
  22. Target oriented perceptual adversarial fusion network for underwater image enhancement. IEEE Transactions on Circuits and Systems for Video Technology 32, 10 (2022), 6584–6598.
  23. Feng Zhao Keyu Yan Jinghao Zhang Yukun Huang Man Zhou Zhiwei Xiong Jie Huang, Yajing Liu. 2022. Deep Fourier-based Exposure Correction Network with Spatial-Frequency Interaction. In Proceedings of the European Conference on Computer Vision (ECCV).
  24. Representative color transform for image enhancement. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 4459–4468.
  25. DALE: Dark region-aware low-light image enhancement. arXiv preprint arXiv:2008.12493 (2020).
  26. GPS-GLASS: Learning Nighttime Semantic Segmentation Using Daytime Video and GPS data. arXiv preprint arXiv:2207.13297 (2022).
  27. Zero-shot day-night domain adaptation with a physics prior. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 4399–4409.
  28. Learning a coordinated network for detail-refinement multiexposure image fusion. IEEE Transactions on Circuits and Systems for Video Technology 33, 2 (2022), 713–727.
  29. Recurrent exposure generation for low-light face detection. IEEE Transactions on Multimedia 24 (2021), 1609–1621.
  30. Target-aware dual adversarial learning and a multi-scenario multi-modality benchmark to fuse infrared and visible for object detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 5802–5811.
  31. Learning a deep multi-scale feature ensemble and an edge-attention guidance for image fusion. IEEE Transactions on Circuits and Systems for Video Technology 32, 1 (2021), 105–119.
  32. Attention-guided global-local adversarial learning for detail-preserving multi-exposure image fusion. IEEE Transactions on Circuits and Systems for Video Technology 32, 8 (2022), 5026–5040.
  33. HoLoCo: Holistic and local contrastive learning network for multi-exposure image fusion. Information Fusion 95 (2023), 237–249.
  34. Smoa: Searching a modality-oriented architecture for infrared and visible image fusion. IEEE Signal Processing Letters 28 (2021), 1818–1822.
  35. Real-world underwater enhancement: Challenges, benchmarks, and solutions under natural light. IEEE transactions on circuits and systems for video technology 30, 12 (2020), 4861–4875.
  36. Twin adversarial contrastive learning for underwater image enhancement and beyond. IEEE Transactions on Image Processing 31 (2022), 4922–4936.
  37. Fixed-rank representation for unsupervised visual learning. In 2012 ieee conference on computer vision and pattern recognition. IEEE, 598–605.
  38. A bilevel integrated model with data-driven layer ensemble for multi-modality image fusion. IEEE Transactions on Image Processing 30 (2020), 1261–1274.
  39. Learning with nested scene modeling and cooperative architecture search for low-light vision. IEEE Transactions on Pattern Analysis and Machine Intelligence 45, 5 (2022), 5953–5969.
  40. Retinex-inspired unrolling with cooperative prior architecture search for low-light image enhancement. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 10561–10570.
  41. Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 10012–10022.
  42. MBLLEN: Low-Light Image/Video Enhancement Using CNNs.. In BMVC, Vol. 220. 4.
  43. Bilevel Fast Scene Adaptation for Low-Light Image Enhancement. arXiv preprint arXiv:2306.01343 (2023).
  44. Low-light image enhancement via self-reinforced retinex projection model. IEEE Transactions on Multimedia (2022).
  45. Practical Exposure Correction: Great Truths Are Always Simple. arXiv preprint arXiv:2212.14245 (2022).
  46. Deeplpf: Deep local parametric filters for image enhancement. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 12826–12835.
  47. Depth-induced multi-scale recurrent attention network for saliency detection. In Proceedings of the IEEE/CVF international conference on computer vision. 7254–7263.
  48. A2dele: Adaptive and attentive depth distiller for efficient RGB-D salient object detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 9060–9069.
  49. U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical image computing and computer-assisted intervention. Springer, 234–241.
  50. ACDC: The adverse conditions dataset with correspondences for semantic driving scene understanding. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 10765–10775.
  51. Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1874–1883.
  52. Local color distributions prior for image enhancement. In European Conference on Computer Vision. Springer, 343–359.
  53. Underexposed photo enhancement using deep illumination estimation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 6849–6857.
  54. Unsupervised face detection in the dark. IEEE Transactions on Pattern Analysis and Machine Intelligence 45, 1 (2022), 1250–1266.
  55. Uformer: A general u-shaped transformer for image restoration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 17683–17693.
  56. Deep retinex decomposition for low-light enhancement. arXiv preprint arXiv:1808.04560 (2018).
  57. Cvt: Introducing convolutions to vision transformers. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 22–31.
  58. Essential tensor learning for multi-view spectral clustering. IEEE Transactions on Image Processing 28, 12 (2019), 5910–5922.
  59. Star: A structure and texture aware retinex model. IEEE Transactions on Image Processing 29 (2020), 5022–5037.
  60. From fidelity to perceptual quality: A semi-supervised approach for low-light image enhancement. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 3063–3072.
  61. Advancing image understanding in poor visibility environments: A collective benchmark study. IEEE Transactions on Image Processing 29 (2020), 5737–5752.
  62. Incorporating convolution designs into visual transformers. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 579–588.
  63. Restormer: Efficient transformer for high-resolution image restoration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 5728–5739.
  64. EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers. arXiv preprint arXiv:2203.03952 (2022).
  65. Structure- and Texture-Aware Learning for Low-Light Image Enhancement. In Proceedings of the 30th ACM International Conference on Multimedia. 6483–6492.
  66. Select, supplement and focus for RGB-D saliency detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 3472–3481.
  67. High-quality exposure correction of underexposed photos. In Proceedings of the 26th ACM international conference on Multimedia. 582–590.
  68. Beyond brightening low-light images. International Journal of Computer Vision 129 (2021), 1013–1037.
  69. Kindling the darkness: A practical low-light image enhancer. In Proceedings of the 27th ACM international conference on multimedia. 1632–1640.
  70. Star: A structure-aware lightweight transformer for real-time image enhancement. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 4106–4115.
  71. Unsupervised Underexposed Image Enhancement via Self-Illuminated and Perceptual Guidance. IEEE Transactions on Multimedia (2022), 1–16. https://doi.org/10.1109/TMM.2022.3193059
Citations (4)

Summary

We haven't generated a summary for this paper yet.