Driving-Video Dehazing with Non-Aligned Regularization for Safety Assistance (2405.09996v1)
Abstract: Real driving-video dehazing poses a significant challenge due to the inherent difficulty in acquiring precisely aligned hazy/clear video pairs for effective model training, especially in dynamic driving scenarios with unpredictable weather conditions. In this paper, we propose a pioneering approach that addresses this challenge through a nonaligned regularization strategy. Our core concept involves identifying clear frames that closely match hazy frames, serving as references to supervise a video dehazing network. Our approach comprises two key components: reference matching and video dehazing. Firstly, we introduce a non-aligned reference frame matching module, leveraging an adaptive sliding window to match high-quality reference frames from clear videos. Video dehazing incorporates flow-guided cosine attention sampler and deformable cosine attention fusion modules to enhance spatial multiframe alignment and fuse their improved information. To validate our approach, we collect a GoProHazy dataset captured effortlessly with GoPro cameras in diverse rural and urban road environments. Extensive experiments demonstrate the superiority of the proposed method over current state-of-the-art methods in the challenging task of real driving-video dehazing. Project page.
- Nh-haze: An image dehazing benchmark with non-homogeneous hazy and haze-free images. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops (CVPRW), pages 444–445, 2020.
- Non-local image dehazing. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pages 1674–1682, 2016.
- Basicvsr++: Improving video super-resolution with enhanced propagation and alignment. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), pages 5972–5981, 2022.
- Robust image and video dehazing with visual artifact suppression via gradient residual minimization. In Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part II 14, pages 576–591. Springer, 2016.
- Robust video content alignment and compensation for rain removal in a cnn framework. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pages 6286–6295, 2018.
- Unpaired deep image dehazing using contrastive disentanglement learning. In European Conference on Computer Vision (ECCV), pages 632–648. Springer, 2022.
- Psd: Principled synthetic-to-real dehazing guided by physical priors. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), pages 7180–7189, 2021.
- Referenceless prediction of perceptual fog density and perceptual image defogging. IEEE Transactions on Image Processing (TIP), 24(11):3888–3901, 2015.
- Discrete haze level dehazing network. In Proceedings of the 28th ACM International Conference on Multimedia (ACMMM), pages 1828–1836, 2020.
- Deformable convolutional networks. In Proceedings of the IEEE international conference on computer vision (ICCV), pages 764–773, 2017.
- Video demoireing with relation-based temporal consistency. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 17622–17631, 2022.
- Hardgan: A haze-aware representation distillation gan for single image dehazing. In European conference on computer vision (ECCV), pages 722–738. Springer, 2020.
- Deep multi-model fusion for single-image dehazing. In Proceedings of the IEEE/CVF international conference on computer vision (ICCV), pages 2453–2462, 2019.
- Fd-gan: Generative adversarial networks with fusion-discriminator for single image dehazing. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), pages 10729–10736, 2020.
- Non-aligned supervision for real image dehazing. arXiv preprint arXiv:2303.04940, 2023.
- Raanan Fattal. Dehazing using color-lines. ACM transactions on graphics (TOG), 34(1):1–14, 2014.
- Generative adversarial nets. Advances in neural information processing systems (NeurIPS), 27, 2014.
- Image dehazing transformer with transmission-aware 3d position embedding. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5812–5820, 2022.
- Fog simulation on real lidar point clouds for 3d object detection in adverse weather. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 15283–15292, 2021.
- Single image haze removal using dark channel prior. IEEE transactions on pattern analysis and machine intelligence (TPAMI), 33(12):2341–2353, 2010.
- Neural compression-based feature learning for video restoration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5872–5881, 2022.
- Spatio-temporal transformer network for video restoration. In Proceedings of the European conference on computer vision (ECCV), pages 106–122, 2018.
- Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
- End-to-end united video dehazing and detection. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2018a.
- Benchmarking single-image dehazing and beyond. IEEE Transactions on Image Processing (TIP), 28(1):492–505, 2018b.
- Zero-shot image dehazing. IEEE Transactions on Image Processing (TIP), 29:8457–8466, 2020.
- You only look yourself: Unsupervised and untrained single image dehazing neural network. International Journal of Computer Vision (ICCV), 129:1754–1767, 2021.
- Domain adaptive object detection for autonomous driving under foggy weather. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pages 612–622, 2023.
- Single image dehazing via conditional generative adversarial network. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pages 8202–8211, 2018c.
- Heavy rain image restoration: Integrating physics model and conditional adversarial learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), pages 1633–1642, 2019a.
- Lap-net: Level-aware progressive network for image dehazing. In Proceedings of the IEEE/CVF international conference on computer vision (ICCV), pages 3276–3285, 2019b.
- Simultaneous video defogging and stereo reconstruction. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pages 4988–4997, 2015.
- Recurrent video restoration transformer with guided deformable attention. Advances in Neural Information Processing Systems (NeurIPS), 35:378–393, 2022.
- Towards multi-domain single image dehazing via test-time training. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5831–5840, 2022a.
- Erase or fill? deep joint recurrent rain removal and reconstruction in videos. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pages 3233–3242, 2018.
- Griddehazenet: Attention-based multi-scale network for image dehazing. In Proceedings of the IEEE/CVF international conference on computer vision (ICCV), pages 7314–7323, 2019a.
- Learning deep priors for image dehazing. In Proceedings of the IEEE/CVF international conference on computer vision (ICCV), pages 2492–2500, 2019b.
- Phase-based memory network for video dehazing. In Proceedings of the 30th ACM International Conference on Multimedia (ACMMM), pages 5427–5435, 2022b.
- Earl J McCartney. Optics of the atmosphere: scattering by molecules and particles. New York, 1976.
- The contextual loss for image transformation with non-aligned data. In Proceedings of the European conference on computer vision (ECCV), pages 768–783, 2018.
- Making a “completely blind” image quality analyzer. IEEE Signal processing letters, 20(3):209–212, 2012.
- Image dehazing by joint estimation of transmittance and airlight using bi-directional consistency loss minimized fcn. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops (CVPRW), pages 920–928, 2018.
- Vision and the atmosphere. International journal of computer vision (IJCV), 48:233–254, 2002.
- Bidnet: Binocular image dehazing without explicit disparity estimation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), pages 5931–5940, 2020.
- Multi-weather image restoration via domain translation. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 21696–21705, 2023.
- Ffa-net: Feature fusion attention network for single image dehazing. In Proceedings of the AAAI conference on artificial intelligence (AAAI), pages 11908–11915, 2020.
- Mb-taylorformer: Multi-branch efficient transformer expanded by taylor formula for image dehazing. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 12802–12813, 2023.
- Enhanced pix2pix dehazing network. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), pages 8160–8168, 2019.
- Optical flow estimation using a spatial pyramid network. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pages 4161–4170, 2017.
- Deep video dehazing with semantic segmentation. IEEE transactions on image processing (TIP), 28(4):1895–1908, 2018.
- Domain adaptation for image dehazing. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), pages 2808–2817, 2020.
- Towards domain invariant single image dehazing. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), pages 9657–9665, 2021.
- Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556, 2014.
- Simultaneous deep stereo matching and dehazing with feature attention. International Journal of Computer Vision (IJCV), 128:799–817, 2020.
- Tdan: Temporally-deformable alignment network for video super-resolution. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), pages 3360–3369, 2020.
- Transweather: Transformer-based restoration of images degraded by adverse weather conditions. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 2353–2363, 2022.
- Edvr: Video restoration with enhanced deformable convolutional networks. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops (CVPRW), pages 0–0, 2019.
- Accurate transmission estimation for removing haze and noise from a single image. IEEE transactions on image processing (TIP), 29:2583–2597, 2019.
- Ridcp: Revitalizing real image dehazing via high-quality codebook priors. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 22282–22291, 2023.
- Video dehazing via a multi-range temporal alignment network with physical prior. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 18053–18062, 2023.
- Frame-consistent recurrent video deraining with dual-level flow. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), pages 1661–1670, 2019.
- Towards perceptual image dehazing by physics-based disentanglement and adversarial training. In Proceedings of the AAAI conference on artificial intelligence (AAAI), 2018.
- Self-augmented unpaired image dehazing via density and depth decomposition. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), pages 2037–2046, 2022.
- Video adverse-weather-component suppression network via weather messenger and adversarial backpropagation. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 13200–13210, 2023.
- Perceiving and modeling density for image dehazing. In European Conference on Computer Vision (ECCV), pages 130–145. Springer, 2022.
- Source-free domain adaptation for real-world image dehazing. In Proceedings of the 30th ACM International Conference on Multimedia (ACMMM), pages 6645–6654, 2022a.
- Memory-augmented non-local attention for video super-resolution. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 17834–17843, 2022b.
- Densely connected pyramid dehazing network. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pages 3194–3203, 2018.
- Joint transmission map estimation and dehazing using deep networks. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 30(7):1975–1986, 2019.
- Spatio-temporal deformable attention network for video deblurring. In European Conference on Computer Vision (ECCV), pages 581–596. Springer, 2022.
- Learning to restore hazy video: A new real-world dataset and a new method. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 9239–9248, 2021.
- Refinednet: A weakly supervised refinement framework for single image dehazing. IEEE Transactions on Image Processing (TIP), 30:3391–3404, 2021.
- Fast segment anything. arXiv preprint arXiv:2306.12156, 2023.
- Spatio-temporal filter adaptive network for video deblurring. In Proceedings of the IEEE/CVF international conference on computer vision (ICCV), pages 2482–2491, 2019.
- A fast single image haze removal algorithm using color attenuation prior. IEEE transactions on image processing (TIP), 24(11):3522–3533, 2015.