UWGAN: Underwater GAN for Real-world Underwater Color Restoration and Dehazing

Published 21 Dec 2019 in eess.IV and cs.CV | (1912.10269v2)

Abstract: In real-world underwater environment, exploration of seabed resources, underwater archaeology, and underwater fishing rely on a variety of sensors, vision sensor is the most important one due to its high information content, non-intrusive, and passive nature. However, wavelength-dependent light attenuation and back-scattering result in color distortion and haze effect, which degrade the visibility of images. To address this problem, firstly, we proposed an unsupervised generative adversarial network (GAN) for generating realistic underwater images (color distortion and haze effect) from in-air image and depth map pairs based on improved underwater imaging model. Secondly, U-Net, which is trained efficiently using synthetic underwater dataset, is adopted for color restoration and dehazing. Our model directly reconstructs underwater clear images using end-to-end autoencoder networks, while maintaining scene content structural similarity. The results obtained by our method were compared with existing methods qualitatively and quantitatively. Experimental results obtained by the proposed model demonstrate well performance on open real-world underwater datasets, and the processing speed can reach up to 125FPS running on one NVIDIA 1060 GPU. Source code, sample datasets are made publicly available at https://github.com/infrontofme/UWGAN_UIE.

Abstract PDF Upgrade to Chat

Citations (84)

View on Semantic Scholar

Summary

The paper introduces UWGAN as an unsupervised GAN that simulates underwater conditions using an improved imaging model for effective color restoration.
It employs a two-step approach, first generating synthetic underwater images and then enhancing them using an end-to-end U-Net architecture.
The method processes up to 125 FPS on an NVIDIA 1060 GPU and outperforms existing techniques in UIQM by preserving color fidelity and structural details.

UWGAN: Underwater GAN for Real-world Underwater Color Restoration and Dehazing

In their paper, the authors introduce a novel unsupervised generative adversarial network (GAN), termed UWGAN, designed specifically for addressing the challenges in underwater image processing, particularly targeting color restoration and dehazing. The fundamental problems underlying underwater imagery stem from wavelength-dependent light attenuation and back-scattering, resulting in significant color distortion and haze effects that compromise image visibility. This paper advances the field by proposing a new method to generate realistic underwater images from clear in-air images, using depth map pairs to guide the process.

Methodology and Results

The paper sheds light on a two-step process that first involves the generation of synthetic underwater images and then their enhancement. To simulate underwater conditions, the authors improve the traditional underwater imaging model to capture the nuanced interplay of light attenuation and scattering. This model paves the way for the training of UWGAN, which takes in-air RGB-D images and underwater image samples of specific survey sites as inputs. The GAN generates realistic underwater images through adversarial learning.

Following this, the authors employ U-Net architecture, based on an end-to-end autoencoder network, to perform color restoration and dehazing. The U-Net is trained on the synthetic dataset generated by the GAN, allowing it to effectively reconstruct clear underwater scenes. Impressively, the model operates at a high processing speed of up to 125 frames per second on an NVIDIA 1060 GPU.

The experimental evaluations include both qualitative and quantitative assessments compared against several existing methods like UCM, HE, and UGAN. The UWGAN model demonstrates superior performance in restoring color fidelity and improving image clarity while preserving structural similarity. Notably, the method achieves top scores in UIQM, showcasing its capacity for reliable real-world application.

Implications and Future Work

This research provides a significant contribution to the domain of underwater image enhancement by presenting a robust method for simulating and mitigating underwater visual distortions. The practical implications are substantial, given the reliance on visual data in applications such as seabed exploration and underwater archaeology. The dual-stage approach—image generation followed by restoration—offers a comprehensive framework adaptable to various underwater scenarios.

In terms of future research directions, the authors acknowledge the potential to refine the proposed model further by exploring alternative network architectures and loss functions. Additionally, increasing the diversity and scope of training datasets could enhance the model's generalizability across different underwater environments.

The study also prompts interest in the broader implications of unsupervised learning techniques in addressing domain-specific distortions, not just limited to underwater imaging. With continued exploration and validation in diverse conditions, the foundational work presented in UWGAN could catalyze advancements in related fields, fostering developments that extend beyond current underwater imaging challenges.

Markdown