MambaLLIE: Implicit Retinex-Aware Low Light Enhancement with Global-then-Local State Space (2405.16105v1)
Abstract: Recent advances in low light image enhancement have been dominated by Retinex-based learning framework, leveraging convolutional neural networks (CNNs) and Transformers. However, the vanilla Retinex theory primarily addresses global illumination degradation and neglects local issues such as noise and blur in dark conditions. Moreover, CNNs and Transformers struggle to capture global degradation due to their limited receptive fields. While state space models (SSMs) have shown promise in the long-sequence modeling, they face challenges in combining local invariants and global context in visual data. In this paper, we introduce MambaLLIE, an implicit Retinex-aware low light enhancer featuring a global-then-local state space design. We first propose a Local-Enhanced State Space Module (LESSM) that incorporates an augmented local bias within a 2D selective scan mechanism, enhancing the original SSMs by preserving local 2D dependency. Additionally, an Implicit Retinex-aware Selective Kernel module (IRSK) dynamically selects features using spatially-varying operations, adapting to varying inputs through an adaptive kernel selection process. Our Global-then-Local State Space Block (GLSSB) integrates LESSM and IRSK with LayerNorm as its core. This design enables MambaLLIE to achieve comprehensive global long-range modeling and flexible local feature aggregation. Extensive experiments demonstrate that MambaLLIE significantly outperforms state-of-the-art CNN and Transformer-based methods. Project Page: https://mamballie.github.io/anon/
- A dynamic histogram equalization for image contrast enhancement. IEEE Trans. Consumer Electron., 53(2):593–600, 2007.
- Learning a deep single image contrast enhancer from multi-exposure images. IEEE Trans. Image Process., 27(4):2049–2062, 2018.
- Retinexformer: One-stage retinex-based transformer for low-light image enhancement. In ICCV, pages 12504–12513, October 2023.
- Seeing motion in the dark. In ICCV, pages 3184–3193. IEEE, 2019.
- Hany Farid. Blind inverse gamma correction. IEEE Trans. Image Process., 10(10):1428–1433, 2001.
- Hungry hungry hippos: Towards language modeling with state space models. In ICLR. OpenReview.net, 2023.
- Learning a simple low-light image enhancer from paired low-light instances. In CVPR, pages 22252–22261, 2023.
- Mamba: Linear-time sequence modeling with selective state spaces. ArXiv, abs/2312.00752, 2023.
- On the parameterization and initialization of diagonal state space models. In Sanmi Koyejo, S. Mohamed, A. Agarwal, Danielle Belgrave, K. Cho, and A. Oh, editors, NeurIPS, 2022.
- Efficiently modeling long sequences with structured state spaces. In ICLR, 2022.
- Combining recurrent, convolutional, and continuous-time models with linear state space layers. In NeurIPS, pages 572–585, 2021.
- Interpreting super-resolution networks with local attribution maps. In CVPR, pages 9199–9208. Computer Vision Foundation / IEEE, 2021.
- Zero-reference deep curve estimation for low-light image enhancement. In CVPR, pages 1777–1786, 2020.
- Mambair: A simple baseline for image restoration with state-space model. arXiv preprint arXiv:2402.15648, 2024.
- Localmamba: Visual state space model with windowed selective scan. arXiv preprint arXiv:2403.09338, 2024.
- Novel lmi conditions for observer-based stabilization of lipschitzian nonlinear systems and uncertain linear systems in discrete-time. Applied Mathematics and Computation, 206(2):579–588, 2008.
- Enlightengan: Deep light enhancement without paired supervision. IEEE Trans. Image Process., 30:2340–2349, 2021.
- Llformer: An efficient and real-time lidar lane detection method based on transformer. In Wenbing Zhao and Xinguo Yu, editors, PRIS, pages 18–23, 2023.
- Adam: A method for stochastic optimization. In ICLR (Poster), 2015.
- Lightness and retinex theory. Josa, 61(1):1–11, 1971.
- Low-light image and video enhancement using deep learning: A survey. IEEE Trans. Pattern Anal. Mach. Intell., 44(12):9396–9416, 2022.
- Videomamba: State space model for efficient video understanding. arXiv preprint arXiv:2403.06977, 2024.
- Large selective kernel network for remote sensing object detection. In ICCV, pages 16748–16759. IEEE, 2023.
- Retinex-inspired unrolling with cooperative prior architecture search for low-light image enhancement. In CVPR, pages 10561–10570. Computer Vision Foundation / IEEE, 2021.
- Vmamba: Visual state space model. arXiv preprint arXiv:2401.10166, 2024.
- Getting to know low-light images with the exclusively dark dataset. Comput. Vis. Image Underst., 178:30–42, 2019.
- LLNet: A deep autoencoder approach to natural low-light image enhancement. Pattern Recognit., 61:650–662, 2017.
- Understanding the effective receptive field in deep convolutional neural networks. NeurIPS, 29, 2016.
- MBLLEN: low-light image/video enhancement using cnns. In BMVC, page 220. BMVA Press, 2018.
- Unsupervised low-light video enhancement with spatial-temporal co-attention transformer. IEEE Trans. Image Process., 32:4701–4715, 2023.
- Toward fast, flexible, and robust low-light image enhancement. In CVPR, pages 5627–5636, 2022.
- S4ND: modeling images and videos as multidimensional signals with state spaces. In NeurIPS, 2022.
- Pytorch: An imperative style, high-performance deep learning library. In NeurIPS, pages 8024–8035, 2019.
- Efficientvmamba: Atrous selective scan for light weight visual mamba. arXiv preprint arXiv:2403.09977, 2024.
- S4++: Elevating long sequence modeling with state memory reply, 2024.
- Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767, 2018.
- Di-retinex: Digital-imaging retinex theory for low-light image enhancement. arXiv preprint arXiv:2404.03327, 2024.
- Attention is all you need. In NeurIPS, pages 5998–6008, 2017.
- Seeing dynamic scene in the dark: A high-quality video dataset with mechatronic alignment. In ICCV, pages 9680–9689, 2021.
- Underexposed photo enhancement using deep illumination estimation. In CVPR, pages 6849–6857, 2019.
- Zero-reference low-light enhancement via physical quadruple priors, 2024.
- Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process., 13(4):600–612, 2004.
- Deep retinex decomposition for low-light enhancement. In BMVC, page 155, 2018.
- Jan C Willems. From time series to linear system—part i. finite dimensional linear time invariant systems. Automatica, 22:561–580, 1986.
- Uretinex-net: Retinex-based deep unfolding network for low-light image enhancement. In CVPR, pages 5891–5900. IEEE, 2022.
- Snr-aware low-light image enhancement. In CVPR, pages 17693–17703. IEEE, 2022.
- Observer-based robust controller design for a linear system with time-varying perturbations. Journal of Mathematical Analysis and applications, 213(2):642–661, 1997.
- Sparse gradient regularized deep retinex network for robust low-light image enhancement. IEEE Trans. Image Process., 30:2072–2086, 2021.
- Diff-retinex: Rethinking low-light image enhancement with A generative diffusion model. In ICCV, pages 12268–12277. IEEE, 2023.
- Restormer: Efficient transformer for high-resolution image restoration. In CVPR, pages 5718–5729. IEEE, 2022.
- Learning enriched features for real image restoration and enhancement. In ECCV.
- Kindling the darkness: A practical low-light image enhancer. In ACM Multimedia, pages 1632–1640. ACM, 2019.
- Vision mamba: Efficient visual representation learning with bidirectional state space model. arXiv preprint arXiv:2401.09417, 2024.
- Efficient long sequence modeling via state space augmented transformer. arXiv preprint arXiv:2212.08136, 2022.
- Jiangwei Weng (3 papers)
- Zhiqiang Yan (43 papers)
- Ying Tai (88 papers)
- Jianjun Qian (24 papers)
- Jian Yang (505 papers)
- Jun Li (778 papers)