Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
121 tokens/sec
GPT-4o
9 tokens/sec
Gemini 2.5 Pro Pro
47 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Specularity Factorization for Low-Light Enhancement (2404.01998v1)

Published 2 Apr 2024 in cs.CV and cs.LG

Abstract: We present a new additive image factorization technique that treats images to be composed of multiple latent specular components which can be simply estimated recursively by modulating the sparsity during decomposition. Our model-driven {\em RSFNet} estimates these factors by unrolling the optimization into network layers requiring only a few scalars to be learned. The resultant factors are interpretable by design and can be fused for different image enhancement tasks via a network or combined directly by the user in a controllable fashion. Based on RSFNet, we detail a zero-reference Low Light Enhancement (LLE) application trained without paired or unpaired supervision. Our system improves the state-of-the-art performance on standard benchmarks and achieves better generalization on multiple other datasets. We also integrate our factors with other task specific fusion networks for applications like deraining, deblurring and dehazing with negligible overhead thereby highlighting the multi-domain and multi-task generalizability of our proposed RSFNet. The code and data is released for reproducibility on the project homepage.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (108)
  1. Sparse coding with anomaly detection. In 2013 IEEE MLSP, 2013.
  2. Adobe Inc. Adobe photoshop, 2023.
  3. Learning multi-scale photo exposure correction. In CVPR, 2021.
  4. Semantic soft segmentation. ACM ToG (SIGGRAPH), 37(4), 2018.
  5. Shadingnet: Image intrinsics by fine-grained shading decomposition. IJCV, 129(8), 2021.
  6. Intrinsic images in the wild. ACM Trans. on Graphics (SIGGRAPH), 33(4), 2014.
  7. Deep burst super-resolution. CVPR, 2021a.
  8. Deep reparametrization of multi-frame super-resolution and denoising. ICCV, 2021b.
  9. Intrinsic decompositions for image editing. Computer Graphics Forum (Eurographics State of the Art Reports), 36(2), 2017.
  10. Convex Optimization. Cambridge University Press, 2004.
  11. Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and Trends in Machine Learning, 3(1), 2011.
  12. Learned robust pca: A scalable deep unfolding approach for high-dimensional outlier detection. NeurIPS, 34, 2021.
  13. Retinexformer: One-stage retinex-based transformer for low-light image enhancement. In ICCV, 2023.
  14. Emerging properties in self-supervised vision transformers. In Proceedings of the International Conference on Computer Vision (ICCV), 2021.
  15. Contextual and variational contrast enhancement. IEEE TIP, 20(12), 2011.
  16. Theoretical linear convergence of unfolded ista and its practical weights and thresholds. In NeurIPS, 2018.
  17. An iterative thresholding algorithm for linear inverse problems with a sparsity constraint. Communications on Pure and Applied Mathematics, 57, 2003.
  18. Half wavelet attention on m-net+ for low-light image enhancement. In IEEE ICIP, 2022.
  19. A general decoupled learning framework for parameterized image operators. 2019.
  20. Generative diffusion prior for unified image restoration and enhancement. In CVPR, 2023.
  21. Specular highlight removal for real-world images. Computer Graphics Forum, 38(7), 2019.
  22. A fusion-based enhancing method for weakly illuminated images. Signal Processing, 129, 2016a.
  23. A weighted variational model for simultaneous reflectance and illumination estimation. In CVPR, 2016b.
  24. Learning a simple low-light image enhancer from paired low-light instances. In CVPR, 2023.
  25. A survey on intrinsic images: Delving deep into lambert and beyond. IJCV, 2022.
  26. Learning fast approximations of sparse coding. In ICML, 2010.
  27. Zero-reference deep curve estimation for low-light image enhancement. CVPR, 2020.
  28. Single image highlight removal with a sparse and low-rank reflection model. In ECCV, 2018.
  29. Lime: Low-light image enhancement via illumination map estimation. IEEE TIP, 26(2), 2016.
  30. Interpreting intrinsic image decomposition using concept activations. In ACM ICVGIP, 2022.
  31. Low-light image enhancement with semi-decoupled decomposition. IEEE TMM, 22(12), 2020.
  32. Charles Hessel. Simulated Exposure Fusion. Image Processing On Line, 9, 2019.
  33. An extended exposure fusion and its application to single image contrast enhancement. In WACV, 2020.
  34. Revisiting shadow detection: A new benchmark dataset for complex world. IEEE TIP, 30, 2021.
  35. Deep fourier-based exposure correction network with spatial-frequency interaction. In ECCV, 2022.
  36. Enlightengan: Deep light enhancement without paired supervision. IEEE TIP, 30, 2021.
  37. Transweather: Transformer-based restoration of images degraded by adverse weather conditions. In CVPR, 2022.
  38. Johann Heinrich Lambert. Photometria sive de mensura et gradibus luminis, colorum et umbrae. Klett, 1760.
  39. Edwin Herbert Land. The retinex theory of color vision. Scientific American, 237 6, 1977.
  40. Designing and learning trainable priors with non-cooperative games. NeurIPS, 2020a.
  41. Fully trainable and interpretable non-local sparse models for image restoration. ECCV, 2020b.
  42. Contrast enhancement based on layered difference representation of 2d histograms. IEEE TIP, 22(12), 2013a.
  43. Contrast enhancement based on layered difference representation of 2d histograms. IEEE TIP, 22(12), 2013b.
  44. Benchmarking single-image dehazing and beyond. IEEE TIP, 28(1), 2019.
  45. All-In-One Image Restoration for Unknown Corruption. In CVPR, 2022.
  46. Low-light image and video enhancement using deep learning: A survey. IEEE TPAMI, 2021a.
  47. Learning to enhance low-light image via zero-reference deep curve estimation. IEEE TPAMI, 2021b.
  48. Self-supervised low-light image enhancement using discrepant untrained network priors. IEEE TCSVT, 32(11), 2022.
  49. Iterative prompt learning for unsupervised backlit image enhancement. In ICCV, 2023.
  50. Dslr: Deep stacked laplacian restorer for low-light image enhancement. IEEE TMM, 23, 2021.
  51. ALISTA: Analytic weights are as good as learned weights in LISTA. In ICLR, 2019a.
  52. Benchmarking low-light image enhancement and beyond. IJCV, 129, 2021.
  53. Tape: Task-agnostic prior embedding for image restoration. In ECCV, 2022.
  54. Learning deep priors for image dehazing. In ICCV, 2019b.
  55. Getting to know low-light images with the exclusively dark dataset. CVIU, 178, 2019.
  56. Perceptual quality assessment for multi-exposure image fusion. IEEE TIP, 24(11), 2015.
  57. Toward fast, flexible, and robust low-light image enhancement. In CVPR, 2022.
  58. John J. McCann. Retinex at 50: color theory and spatial algorithms, a review. Journal of Electronic Imaging, 26, 2017.
  59. Exposure fusion: A simple and practical alternative to high dynamic range photography. Computer Graphics Forum, 28, 2009.
  60. Making a “completely blind” image quality analyzer. IEEE Signal Processing Letters, 20(3), 2013.
  61. Algorithm unrolling: Interpretable, efficient deep learning for signal and image processing. IEEE Signal Processing Magazine, 38(2), 2021.
  62. Deep multi-scale convolutional neural network for dynamic scene deblurring. In CVPR, 2017.
  63. Psenet: Progressive self-enhancement network for unsupervised extreme-light image enhancement. In WACV, 2023.
  64. Towards unsupervised deep image enhancement with generative adversarial network. IEEE TIP, 29, 2020.
  65. Proximal algorithms. Foundations and Trends in Optimization, 1(3), 2014.
  66. Adaptive histogram equalization and its variations. CVGIP, 39(3), 1987.
  67. Wdrn: A wavelet decomposed relightnet for image relighting. In ECCV workshop, 2020.
  68. Enhanced latent space blind model for real image denoising via alternative optimization. In NeurIPS, 2022.
  69. Lr3m: Robust low-light enhancement via low-rank regularized retinex model. IEEE TIP, 29, 2020.
  70. Ali M. Reza. Realization of the contrast limited adaptive histogram equalization (clahe) for real-time image enhancement. J. VLSI Signal Process. Syst., 38(1), 2004.
  71. Kornia: an open source differentiable computer vision library for pytorch. In WACV, 2020.
  72. Retinex-inspired unrolling with cooperative prior architecture search for low-light image enhancement. In CVPR, 2021.
  73. Hybridnet: Classification and reconstruction cooperation for semi-supervised learning. In ECCV, 2018.
  74. Semantic priors for intrinsic image decomposition. In BMVC, 2018.
  75. Semantic hierarchical priors for intrinsic image decomposition. ArXiv, abs/1902.03830, 2019.
  76. Quaternion factorized simulated exposure fusion. In ACM ICVGIP, 2023.
  77. Intrinsic image decomposition using focal stacks. In ACM ICVGIP, 2016.
  78. Nighttime visibility enhancement by increasing the dynamic range and suppression of light effects. CVPR, 2021.
  79. Interactive photo editing on smartphones via intrinsic decomposition. Computer Graphics Forum, 40(2), 2021.
  80. The GIMP Development Team. Gimp, 2023.
  81. Shoji Tominaga. Dichromatic reflection models for a variety of materials. Color Research and Application, 19, 1994.
  82. Vassilios Vonikakis. Busting image enhancement and tone-mapping algorithms. https://sites.google.com/site/vonikakis/datasets/, 2007. [Online; accessed 26-Oct-2023].
  83. A model-driven deep neural network for single image rain removal. 2020.
  84. Naturalness preserved enhancement algorithm for non-uniform illumination images. IEEE TIP, 22(9), 2013a.
  85. Naturalness preserved enhancement algorithm for non-uniform illumination images. IEEE TIP, 22(9), 2013b.
  86. Image quality assessment: From error visibility to structural similarity. IEEE TIP, 13(4), 2004.
  87. Deep retinex decomposition for low-light enhancement. In BMVC, 2018.
  88. Uretinex-net: Retinex-based deep unfolding network for low-light image enhancement. In CVPR, 2022.
  89. Learning to restore low-light images via decomposition-and-enhancement. In CVPR, 2020.
  90. Snr-aware low-light image enhancement. In CVPR, 2022.
  91. Nighttime defogging using high-low frequency decomposition and grayscale-color networks. In ECCV, 2020.
  92. Implicit neural representation for cooperative low-light image enhancement. In ICCV, 2023.
  93. Deep joint rain detection and removal from a single image. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 1685–1694, 2017.
  94. Band representation-based semi-supervised low-light image enhancement: Bridging the gap between signal fidelity and perceptual quality. IEEE TIP, 30, 2021a.
  95. Sparse gradient regularized deep retinex network for robust low-light image enhancement. IEEE TIP, 30, 2021b.
  96. Unsupervised low-light image enhancement via histogram equalization prior. arXiv:2112.01766, 2021a.
  97. Ingredient-oriented multi-degradation learning for image restoration. CVPR, 2023.
  98. Zero-shot restoration of back-lit images using deep internal learning. ACM MM, 2019a.
  99. High-quality exposure correction of underexposed photos. In ACM MM, 2018a.
  100. Dual illumination estimation for robust exposure correction. Computer Graphics Forum, 38, 2019b.
  101. The unreasonable effectiveness of deep features as a perceptual metric. In CVPR, 2018b.
  102. Kindling the darkness: A practical low-light image enhancer. In ACM MM, 2019c.
  103. Beyond brightening low-light images. IJCV, 129, 2021b.
  104. Adaptive unfolding total variation network for low-light image enhancement. ICCV, 2021.
  105. Empowering low-light image enhancer through customized learnable priors. 2023.
  106. Semantic-guided zero-shot learning for low-light image/video enhancement. In WACV, 2022.
  107. Zero-shot restoration of underexposed images via robust retinex decomposition. ICME, 2020.
  108. Efficient model-driven network for shadow removal. AAAI, 2022.
Citations (1)

Summary

  • The paper introduces RSFNet, a zero-reference network that recursively factors latent specular components for efficient low-light enhancement.
  • The method unrolls optimization steps into interpretable network layers, achieving robust performance across diverse image enhancement tasks.
  • RSFNet demonstrates superior generalization on benchmarks without requiring paired training data, surpassing traditional LLE methods.

Recurrent Specularity Factorization Network for Enhanced Low-Light Images

Introduction

Low-Light Enhancement (LLE) is a critical pre-processing step for various computer vision applications that suffer from images captured under insufficient lighting conditions. Traditional LLE methods rely on manually-designed optimization strategies which do not necessarily capture all nuances of image degradation in low-light conditions. On the other hand, data-driven approaches, while effective, often lack interpretability and may not generalize well across different lighting conditions due to their reliance on extensive labeled data. This paper introduces a novel approach to LLE through Recursive Specularity Factorization (RSF), which decomposes images into multiple latent specular components, enabling efficient low-light image enhancement and supporting a variety of enhancement tasks including dehazing, deraining, and deblurring.

Recursive Specularity Factorization Network (RSFNet)

At the heart of the proposed method lies the RSFNet, a model-driven network that recursively decomposes an input image into latent specular factors. RSFNet operationalizes a new additive image factorization technique, leveraging the concept that images consist of multiple latent specular components separable through sparsity modulation. By unrolling the optimization steps into a sequence of network layers, the RSFNet accurately estimates these factors using a minimal set of parameters, making it notably efficient. The factors, by design, are interpretable and can be easily manipulated or fused for various image enhancement tasks.

Zero-Reference Low Light Enhancement

A significant advantage of the proposed method is its ability to operate in a zero-reference setting. The RSFNet does not rely on paired or unpaired supervision, making it robust across different datasets without the need for extensive training data. This quality is essential for real-world applicability where acquiring labeled data is often impractical. The paper demonstrates that RSFNet outperforms existing state-of-the-art LLE methods on standard benchmarks while ensuring better generalization across multiple datasets.

Implications and Theoretical Significance

The introduction of RSFNet marks a significant advancement in the way images are factorized for enhancement tasks. Unlike previous approaches that rely on fixed-factor models or intensity-based decompositions, RSF provides a more flexible and robust framework for understanding and manipulating the underlying components of low-light images. Theoretically, RSF's ability to generalize without the need for extensive labeled data challenges the prevailing trend in deep learning that often equates data quantity with performance. Practically, the method’s efficiency and versatility hold promise for integration into existing image processing pipelines, enhancing the performance of downstream tasks that are sensitive to lighting conditions.

Future Directions in Generative AI

The RSFNet introduces a promising direction for future research in generative AI, especially in the context of interpretable machine learning models. One potential area for further exploration is the application of RSFNet in tasks beyond LLE, such as image segmentation, object detection in low-light conditions, and even non-visual signal processing tasks where similar factorization principles could apply. Additionally, extending the RSFNet to handle dynamic scenes captured in videos presents an exciting challenge, potentially opening up new avenues for low-light video enhancement and analysis.

Conclusion

The Recursive Specularity Factorization Network offers a novel and effective solution to Low-Light Enhancement, combining the interpretability of model-based methods with the performance advantages of data-driven approaches. By decomposing images into interpretable specular components, RSFNet not only achieves superior enhancement results but also provides a versatile framework for a range of image processing tasks. As generative AI continues to evolve, the principles underpinning RSFNet could inspire new models that balance performance with interpretability and generalizability, pushing the boundaries of what's possible in low-light image processing and beyond.

X Twitter Logo Streamline Icon: https://streamlinehq.com