Shadow Removal via Shadow Image Decomposition

Published 23 Aug 2019 in cs.CV | (1908.08628v1)

Abstract: We propose a novel deep learning method for shadow removal. Inspired by physical models of shadow formation, we use a linear illumination transformation to model the shadow effects in the image that allows the shadow image to be expressed as a combination of the shadow-free image, the shadow parameters, and a matte layer. We use two deep networks, namely SP-Net and M-Net, to predict the shadow parameters and the shadow matte respectively. This system allows us to remove the shadow effects on the images. We train and test our framework on the most challenging shadow removal dataset (ISTD). Compared to the state-of-the-art method, our model achieves a 40% error reduction in terms of root mean square error (RMSE) for the shadow area, reducing RMSE from 13.3 to 7.9. Moreover, we create an augmented ISTD dataset based on an image decomposition system by modifying the shadow parameters to generate new synthetic shadow images. Training our model on this new augmented ISTD dataset further lowers the RMSE on the shadow area to 7.4.

Abstract PDF Chat (Pro)

Citations (162)

View on Semantic Scholar

Summary

The paper proposes a novel method for shadow removal by integrating physical illumination models with deep learning techniques through a dual-network approach to decompose and relight images.
Empirical evaluations show a significant reduction in RMSE on the ISTD dataset, decreasing from 13.3 to 7.9 compared to state-of-the-art methods.
The proposed framework offers a robust method for pre-processing in computer vision applications and introduces a data augmentation technique for generating synthetic images with varying shadow effects.

An Evaluation of "Shadow Removal via Shadow Image Decomposition"

The paper "Shadow Removal via Shadow Image Decomposition" introduces a novel method for removing shadows from images by integrating physical models of shadow formation with deep learning techniques. The authors propose a dual-network approach to estimate shadow-related parameters which enable an image decomposition framework to effectively relight shadowed regions.

Core Contributions and Methodology

The paper’s fundamental proposition is a linear illumination model that characterizes the shadow impact on images. This model expresses a shadowed pixel's intensity as a linear transformation of its shadow-free counterpart, involving a scaling factor and additive constant, both specific to color channels. Such an approach is inspired by traditional physical models but is distinct in its application of deep learning for parameter estimation.

To operationalize this model, the authors introduce two deep neural networks: SP-Net, which predicts shadow parameters (scaling and shifting factors), and M-Net, responsible for calculating shadow mattes. The integration of these networks within the image decomposition framework allows for estimating shadow effects with improved accuracy and reduced artifacts in the final shadow-free images.

Significant Numerical Results

Empirical evaluations highlight the method's efficacy. The authors report a 40% reduction in RMSE from 13.3 to 7.9 on the ISTD dataset compared to state-of-the-art methods. They further note an RMSE reduction to 7.4 when using an augmented dataset generated via their method.

Image Augmentation and Future Directions

Another noteworthy contribution is the introduction of a data augmentation technique where shadow parameters are modified to generate synthetic images with varying shadow effects. This synthetic data can be instrumental in further refining the performance of the proposed model and is a promising direction for increasing robustness through diverse training samples.

Implications and Future Research

Practically, the proposed framework offers a robust method for pre-processing in applications where shadow artifacts can detrimentally affect object recognition, tracking, and segmentation tasks within computer vision. From a theoretical standpoint, the model demonstrates the potential of integrating explicit physical modeling with data-driven approaches, suggesting future exploration into more complex illumination models to enhance realism further.

Looking ahead, the prospects for this research include advancing shadow image augmentation techniques and exploring unsupervised learning settings that might eliminate the necessity for precise shadow detection. Additionally, there is room for optimizing network architectures to improve shadow removal in real-time applications or on resource-constrained devices.

In conclusion, this paper adds substantial value to the field of shadow removal in computer vision, with clear pathways for practical application and theoretical expansion. While it leverages established ideas of illumination modeling, its innovative integration with deep learning sets it apart in the ongoing evolution of image processing technologies.