Driving-Video Dehazing with Non-Aligned Regularization for Safety Assistance (2405.09996v1)

Published 16 May 2024 in cs.CV

Abstract: Real driving-video dehazing poses a significant challenge due to the inherent difficulty in acquiring precisely aligned hazy/clear video pairs for effective model training, especially in dynamic driving scenarios with unpredictable weather conditions. In this paper, we propose a pioneering approach that addresses this challenge through a nonaligned regularization strategy. Our core concept involves identifying clear frames that closely match hazy frames, serving as references to supervise a video dehazing network. Our approach comprises two key components: reference matching and video dehazing. Firstly, we introduce a non-aligned reference frame matching module, leveraging an adaptive sliding window to match high-quality reference frames from clear videos. Video dehazing incorporates flow-guided cosine attention sampler and deformable cosine attention fusion modules to enhance spatial multiframe alignment and fuse their improved information. To validate our approach, we collect a GoProHazy dataset captured effortlessly with GoPro cameras in diverse rural and urban road environments. Extensive experiments demonstrate the superiority of the proposed method over current state-of-the-art methods in the challenging task of real driving-video dehazing. Project page.

References (75)

Citations (4)

View on Semantic Scholar

Summary

The paper introduces a non-aligned regularization strategy combining NRFM, FCAS, and DCAF modules to improve dehazing in dynamic driving scenarios.
It employs an adaptive sliding window and optical flow-guided cosine attention to align multi-frame features without strict ground-truth pairs.
Experiments on the GoProHazy dataset reveal superior dehazing performance with enhanced FADE and NIQE scores, promising improved safety for autonomous vehicles.

An Examination of Video Dehazing Techniques in Driving Scenarios

The paper "Driving-Video Dehazing with Non-Aligned Regularization for Safety Assistance" introduces a sophisticated framework for addressing the video dehazing challenges prevalent in real-world driving environments. This work is particularly significant considering the adverse impacts of haze, such as reduced visibility and contrast, which pose safety risks for autonomous driving systems. The authors tackle the intrinsic difficulty of obtaining accurately aligned hazy/clear video pairs necessary for model training, which is especially challenging in dynamic driving scenarios with unpredictable weather conditions.

Methodology Overview

The proposed dehazing method is centered around a novel non-aligned regularization strategy. This approach consists of two primary components: non-aligned reference frame matching and video dehazing. The authors introduced a Non-aligned Reference Frame Matching (NRFM) module that employs an adaptive sliding window to match high-quality clear reference frames from non-aligned videos. This strategy relaxes the requirement for precisely aligned ground truth during training.

For the video dehazing component, the paper introduces a Flow-guided Cosine Attention Sampler (FCAS) and a Deformable Cosine Attention Fusion (DCAF) module. FCAS improves multi-frame alignment by employing coarse optical flow to guide multi-scale cosine attention sampling. This enables the model to tackle large motion challenges in driving videos by improving spatial alignment. In contrast, DCAF leverages deformable convolution and cosine similarity for better multi-frame feature fusion, thereby enhancing dehazing performance.

Dataset and Results

The authors curated a real-world video dataset named GoProHazy, captured using GoPro cameras in various rural and urban road settings. This dataset was used to validate the proposed methodologies under realistic and diverse conditions. Through extensive experimentation, the proposed method showcased superiority over contemporary state-of-the-art techniques. For instance, the FADE and NIQE scores, which are common metrics for evaluating dehazing quality, demonstrated the method's enhanced performance in real-world video dehazing tasks without requiring strict temporal and spatial alignment of training data.

Implications and Future Directions

The presented approach promises improved visibility in autonomous vehicle systems, particularly under hazy conditions, ultimately enhancing safety. The framework's ability to operate effectively even with non-aligned video input is particularly notable, as it suggests potential applications in other domains where acquiring aligned data is difficult.

The paper's contributions open several avenues for future research. These include exploring the integration of this framework with other adverse weather conditions like rain or snow, extending its applicability across various sensor types beyond RGB cameras, and refining the model to operate in real-time scenarios. Moreover, the techniques developed could be further investigated to improve robustness and adapt to different haze intensities and dynamic driving environments.

Overall, this research makes a significant contribution to the field of computer vision and autonomous systems, addressing complex challenges with innovative solutions and demonstrating practical applicability through its superior results in real-world settings.

PDF Markdown

Tweets

https://twitter.com/CSVisionPapers/status/1791613767815504088

YouTube

Show All Videos