Compound Multi-branch Feature Fusion for Real Image Restoration

Published 2 Jun 2022 in eess.IV, cs.CV, and cs.LG | (2206.02748v1)

Abstract: Image restoration is a challenging and ill-posed problem which also has been a long-standing issue. However, most of learning based restoration methods are proposed to target one degradation type which means they are lack of generalization. In this paper, we proposed a multi-branch restoration model inspired from the Human Visual System (i.e., Retinal Ganglion Cells) which can achieve multiple restoration tasks in a general framework. The experiments show that the proposed multi-branch architecture, called CMFNet, has competitive performance results on four datasets, including image dehazing, deraindrop, and deblurring, which are very common applications for autonomous cars. The source code and pretrained models of three restoration tasks are available at https://github.com/FanChiMao/CMFNet.

Abstract PDF Chat (Pro)

Citations (7)

View on Semantic Scholar

Summary

The paper introduces CMFNet, a novel multi-branch architecture inspired by retinal cells to integrate complementary features for restoring degraded images.
It leverages channel, spatial, and pixel attention mechanisms with a Mixed Skip Connection to effectively fuse information across deblurring, dehazing, and deraindrop tasks.
Evaluations across diverse datasets demonstrate that CMFNet consistently achieves competitive SSIM scores, notably scoring 0.533 on the D-Haze dehazing task.

Insightful Overview of "Compound Multi-branch Feature Fusion for Real Image Restoration"

The paper "Compound Multi-branch Feature Fusion for Real Image Restoration" introduces a novel methodology for improving image restoration tasks, addressing multiple types of degradation within a single framework. Developed by Chi-Mao Fan, Tsung-Jung Liu, and Kuan-Hsien Liu, this study presents CMFNet, a compound multi-branch architecture drawing inspiration from the Human Visual System (HVS), particularly Retinal Ganglion Cells (RGCs).

Conceptual Framework and Contributions

Image restoration is a complex computational task, often constrained to a single type of degradation when addressed with traditional models. The authors recognize the limitations of existing learning-based methods, particularly their lack of generalization to multiple degradation scenarios such as deblurring, dehazing, and deraining—tasks that are especially pertinent to enhancing camera sensor outputs in autonomous vehicles.

The CMFNet employs a multi-branch architecture, with each branch resembling different types of RGCs—namely, Parvocellular, Koniocellular, and Magnocellular cells—each designed to independently process complementary components of visual information. The network utilizes channel, spatial, and pixel attention blocks to simulate the distinct properties of these RGCs. Each branch outputs feature maps and reconstructed images that are integrated via a novel Mixed Skip Connection (MSC), further enhancing the network's flexibility. Moreover, the paper introduces a new loss function combining PSNR and SSIM metrics, which aligns well with human visual perception.

Methodological Rigor and Validation

The evaluation of CMFNet spans multiple datasets representing different restoration challenges—dense haze, raindrops, and motion blur are used to assess the model's robustness and versatility. The results indicate that CMFNet achieves competitive, often superior, performance in terms of SSIM across these tasks, suggesting that the network produces images more aligned with human perceptual expectations. Notably, for the image dehazing task on D-Haze dataset, CMFNet attained an impressive SSIM score of 0.533, highlighting its capacity to handle real-world haze scenarios effectively.

Additionally, the proposed method demonstrated strong performance in the deraindrop task, achieving top-tier SSIM scores. Although CMFNet did not achieve the highest score in the deblurring task, its evaluation metrics remained competitive, illustrating the model's wide applicability to high-complexity real-world problems.

Implications and Future Directions

This research advances the domain of image restoration by presenting a unified framework capable of tackling diverse degradation types, a significant step forward in achieving real-time image optimization for autonomous systems. The results suggest potential enhancements in the preprocessing stages of high-level vision tasks, offering pathways to improve object detection and segmentation outcomes.

Future developments could explore optimizations of the model to balance accuracy and computational efficiency, possibly extending its applications to other restoration tasks like low-light enhancement and defocus blur correction. Furthermore, exploring scalable adaptations of CMFNet for more resource-limited environments, such as edge devices, could significantly broaden the impact and utility of this methodology.

In summary, this paper makes a valuable contribution to the field of computer vision, offering insights into cross-task generalization in image restoration and paving the way for further research into multi-branch networks inspired by biological visual processes.