Distort-and-Recover: Color Enhancement using Deep Reinforcement Learning (1804.04450v2)

Published 12 Apr 2018 in cs.CV

Abstract: Learning-based color enhancement approaches typically learn to map from input images to retouched images. Most of existing methods require expensive pairs of input-retouched images or produce results in a non-interpretable way. In this paper, we present a deep reinforcement learning (DRL) based method for color enhancement to explicitly model the step-wise nature of human retouching process. We cast a color enhancement process as a Markov Decision Process where actions are defined as global color adjustment operations. Then we train our agent to learn the optimal global enhancement sequence of the actions. In addition, we present a 'distort-and-recover' training scheme which only requires high-quality reference images for training instead of input and retouched image pairs. Given high-quality reference images, we distort the images' color distribution and form distorted-reference image pairs for training. Through extensive experiments, we show that our method produces decent enhancement results and our DRL approach is more suitable for the 'distort-and-recover' training scheme than previous supervised approaches. Supplementary material and code are available at https://sites.google.com/view/distort-and-recover/

Citations (189)

View on Semantic Scholar

Summary

The paper introduces a novel deep reinforcement learning method for color enhancement using a 'distort-and-recover' scheme trained on reference images, eliminating paired data needs.
Experiments show the DRL approach achieves competitive enhancement results and superior robustness against varying input distributions compared to supervised methods.
This approach overcomes dataset limitation bias, offers practical application in photo editing software for automated enhancement, and provides a robust foundation for future work.
title
meta_description
bullet_points

Distort-and-Recover: Color Enhancement using Deep Reinforcement Learning

The paper "Distort-and-Recover: Color Enhancement using Deep Reinforcement Learning" presents an innovative approach to automatic color enhancement, leveraging deep reinforcement learning (DRL). The authors tackle the limitations of current learning-based methods, which often require costly datasets of input-retouched image pairs and produce results that lack interpretability.

Core Methodology

The proposed approach models the color enhancement process as a Markov Decision Process (MDP), with actions representing global color adjustments such as contrast, brightness, and white balance modifications. The process is iterative, mirroring the step-by-step nature of human color retouching. A Deep Q-Network (DQN), a refined DRL framework, is employed to learn the sequence of actions that optimally enhance image colors. Crucially, this method introduces a novel "distort-and-recover" training scheme that eliminates the need for paired datasets. Instead, it utilizes only high-quality reference images, distort their color distributions randomly, and forms training pairs with the original references.

Experiments and Results

Extensive experiments demonstrate that the DRL approach achieves competitive results in color enhancement without requiring traditional paired datasets. The authors report that their method yields "decent enhancement results" and overcomes several disadvantages of supervised approaches. Using the MIT-Adobe FiveK dataset, the DRL model trained with the distort-and-recover scheme shows robustness against changes in input image distributions—a significant advantage over traditional models.

Theoretical and Practical Implications

This approach opens several avenues for future research and development. It suggests a path forward in overcoming the dataset limitation bias inherent in supervised learning methods by facilitating efficient training with reference images that are easily accessible. Practically, the application of this method could improve user experience in image editing software, enabling personalized and adaptive color enhancement with minimal user intervention. The ability to integrate this DRL-based approach into photo retouching applications like Adobe Photoshop or Lightroom could democratize professional-level editing by automating a task that typically requires significant expertise.

Future Directions

Looking ahead, the exploration of more sophisticated reinforcement learning algorithms or the integration of Generative Adversarial Networks (GANs) could further improve the capability and flexibility of automatic color enhancement systems. Moreover, future developments might involve refining the action space to include more complex or context-aware operations, thereby enhancing the model's adaptability to varied photographic styles or user preferences.

In summary, this paper proposes a distinct methodology that significantly reduces the data requirement burden while effectively modeling human-like retouching processes through DRL. It innovatively addresses the challenge of non-linear and subjective photo retouching, providing a robust foundation for future work in automatic image enhancement and editing tools.

PDF Markdown