BVI-RLV: A Fully Registered Dataset and Benchmarks for Low-Light Video Enhancement
Abstract: Low-light videos often exhibit spatiotemporal incoherent noise, compromising visibility and performance in computer vision applications. One significant challenge in enhancing such content using deep learning is the scarcity of training data. This paper introduces a novel low-light video dataset, consisting of 40 scenes with various motion scenarios under two distinct low-lighting conditions, incorporating genuine noise and temporal artifacts. We provide fully registered ground truth data captured in normal light using a programmable motorized dolly and refine it via an image-based approach for pixel-wise frame alignment across different light levels. We provide benchmarks based on four different technologies: convolutional neural networks, transformers, diffusion models, and state space models (mamba). Our experimental results demonstrate the significance of fully registered video pairs for low-light video enhancement (LLVE) and the comprehensive evaluation shows that the models trained with our dataset outperform those trained with the existing datasets. Our dataset and links to benchmarks are publicly available at https://doi.org/10.21227/mzny-8c77.
- Seeing motion in the dark. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), October 2019.
- Seeing dynamic scene in the dark: High-quality video dataset with mechatronic alignment. In ICCV, 2021.
- Dancing in the dark: A benchmark towards general low-light video enhancement. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023.
- Self-supervised training for blind multi-frame video denoising. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pages 2724–2734, January 2021.
- Self-supervised low-light image enhancement using discrepant untrained network priors. IEEE Transactions on Circuits and Systems for Video Technology, 32(11):7332–7345, 2022.
- N. Anantrasirichai and David Bull. Contextual colorization and denoising for low-light ultra high resolution sequences. In ICIP proc., pages 1614–1618, 2021.
- A topological loss function for image denoising on a new BVI-lowlight dataset. Signal Processing, 211, 2023.
- Dancing under the stars: video denoising in starlight. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 16220–16230, 2022.
- BDD100K: A diverse driving dataset for heterogeneous multitask learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2020.
- Learning to see moving objects in the dark. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pages 7323–7332, 2019.
- Supervised raw video denoising with a benchmark dataset on dynamic scenes. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 2298–2307, 2020.
- Gerald Schaefer. An uncompressed benchmark image dataset for colour imaging. In 2010 IEEE International Conference on Image Processing, pages 3537–3540, 2010.
- Benchmarking denoising algorithms with real photographs. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 2750–2759, 2017.
- A high-quality denoising dataset for smartphone cameras. In CVPR proc., pages 1692–1700, 2018.
- Low-light image and video enhancement using deep learning: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(12):9396–9416, 2022.
- U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2015.
- Revisiting temporal alignment for video restoration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022.
- Enhancing low light videos by exploring high sensitivity camera noise. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pages 4110–4118, 2019.
- Deformable convolutional networks. In ICCV, pages 764–773, Oct 2017.
- Low-light video enhancement with synthetic event guidance. Proceedings of the AAAI Conference on Artificial Intelligence, 37(2):1692–1700, Jun. 2023.
- Low light video enhancement using synthetic data produced with an intermediate domain mapping. In European Conference on Computer Vision, pages 103–119. Springer, 2020.
- Atmospheric turbulence mitigation for sequences with moving objects using recursive image fusion. In 2018 25th IEEE International Conference on Image Processing (ICIP), pages 2895–2899, 2018.
- Dinggang Shen. Image registration by local histogram matching. Pattern Recognition, 40(4):1161–1172, 2007.
- Image registration by template matching using normalized cross-correlation. In 2009 International Conference on Advances in Computing, Control, and Telecommunication Technologies, pages 819–822, 2009.
- Noise flow: Noise modeling with conditional normalizing flows. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pages 3165–3173, 2019.
- Anonymous. A spatio-temporal aligned sunet model for low-light video enhancement. In Submitting to IEEE International Conference on Image Processing, 2024.
- Denoising diffusion implicit models. ICLR, 2021.
- Mamba: Linear-time sequence modeling with selective state spaces. arXiv preprint arXiv:2312.00752, 2023.
- Vmamba: Visual state space model. arXiv preprint arXiv:2401.10166, 2024.
- Low-light image enhancement with wavelet-based diffusion models. ACM Transactions on Graphics (TOG), 42(6):1–14, 2023.
- EDVR: Video restoration with enhanced deformable convolutional networks. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, June 2019.
- L. Sendur and I.W. Selesnick. Bivariate shrinkage functions for wavelet-based denoising exploiting interscale dependency. IEEE Transactions on Signal Processing, 50(11):2744–2756, 2002.
- Vision mamba: Efficient visual representation learning with bidirectional state space model. arXiv preprint arXiv:2401.09417, 2024.
- Swinir: Image restoration using swin transformer. In 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), pages 1833–1844, 2021.
- Alex O. Holcombe. Seeing slow and seeing fast: two limits on perception. Trends in Cognitive Sciences, pages 216–221, 2009.
- High-resolution image synthesis and semantic manipulation with conditional gans. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 8798–8807, 2018.
- Real image denoising with feature attention. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pages 3155–3164, 2019.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.