Crafting a Toolchain for Image Restoration by Deep Reinforcement Learning (1804.03312v1)

Published 10 Apr 2018 in cs.CV

Abstract: We investigate a novel approach for image restoration by reinforcement learning. Unlike existing studies that mostly train a single large network for a specialized task, we prepare a toolbox consisting of small-scale convolutional networks of different complexities and specialized in different tasks. Our method, RL-Restore, then learns a policy to select appropriate tools from the toolbox to progressively restore the quality of a corrupted image. We formulate a step-wise reward function proportional to how well the image is restored at each step to learn the action policy. We also devise a joint learning scheme to train the agent and tools for better performance in handling uncertainty. In comparison to conventional human-designed networks, RL-Restore is capable of restoring images corrupted with complex and unknown distortions in a more parameter-efficient manner using the dynamically formed toolchain.

Citations (170)

View on Semantic Scholar

Summary

An Analysis of "Crafting a Toolchain for Image Restoration by Deep Reinforcement Learning"

The paper "Crafting a Toolchain for Image Restoration by Deep Reinforcement Learning" introduces an innovative method for addressing image restoration challenges using reinforcement learning (RL). The authors explore the effectiveness of a selective approach to using a toolkit of small convolutional neural networks (CNNs) rather than deploying a monolithic model to tackle diverse image degradation issues. This approach, termed RL-Restore, pioneers the application of RL in the domain of image processing, specifically for mixed and complex types of distortion, including Gaussian blur, noise, and JPEG compression artifacts.

Unlike conventional solutions where complex tasks are resolved by a single large-capacity CNN, this research advocates for a modular method. The proposal involves using a suite of specialized, smaller CNNs, each tailored to tackle different degradation levels and types. This paradigmatic shift effectively navigates the trade-off between computational expense and restoration capability, showing that sophisticated restoration can be achieved without inherently large computational costs.

The core of RL-Restore lies in its ability to adaptively form a chain of actions (a toolchain) to restore degraded images progressively. The RL agent dynamically selects tools from the toolbox by evaluating each corrupt image and deciding the best sequence of actions to enhance image quality. The agent is trained to maximize cumulative rewards that are proportional to the degree of image restoration achieved in each step. This framework is rendered efficient by implementing a stopping function that allows the agent to terminate the restoration process once optimal conditions are met, thereby preventing over-processing.

Quantitative evaluations presented are striking. On the DIV2K dataset, RL-Restore demonstrates comparable performance to established models like DnCNN and VDSR in mild to moderately corrupted test sets, providing evidence for its generalization capability. More importantly, RL-Restore surpasses these models on severely corrupted datasets, reinforcing its adaptability to mixings of real-world distortions. This suggests that RL-Restore’s dynamic tool selection confers upon it a responsiveness to unknown or more complex distortions, which static, single-CNN models appear less equipped to handle.

The implications of this research extend beyond efficiencies in parameter use. RL-Restore enhances transparency in the restoration process by revealing intermediate steps in distortion correction. The toolchain's flexibility fosters innovation in addressing real-world restoration challenges beyond current restoration capability paradigms. Future research might focus on integrating further sophisticated distortion-specific tools and exploring alternative reward functions to refine the restoration process further. Additionally, applications of this technology could branch into other fields of computer vision where RL could be pivotal in adaptive decision-making.

In conclusion, RL-Restore exemplifies a fresh and adaptable approach to image restoration, advancing the field by challenging the status quo of monolithic model deployment. This paper enriches the toolkit available for addressing the complexities of real-world imaging distortions and sets a compelling precedent for RL applications in low-level vision tasks.

Related Papers

YouTube

Show All Videos