Complementary Random Masking for RGB-Thermal Semantic Segmentation (2303.17386v2)

Published 30 Mar 2023 in cs.CV, cs.AI, and cs.RO

Abstract: RGB-thermal semantic segmentation is one potential solution to achieve reliable semantic scene understanding in adverse weather and lighting conditions. However, the previous studies mostly focus on designing a multi-modal fusion module without consideration of the nature of multi-modality inputs. Therefore, the networks easily become over-reliant on a single modality, making it difficult to learn complementary and meaningful representations for each modality. This paper proposes 1) a complementary random masking strategy of RGB-T images and 2) self-distillation loss between clean and masked input modalities. The proposed masking strategy prevents over-reliance on a single modality. It also improves the accuracy and robustness of the neural network by forcing the network to segment and classify objects even when one modality is partially available. Also, the proposed self-distillation loss encourages the network to extract complementary and meaningful representations from a single modality or complementary masked modalities. Based on the proposed method, we achieve state-of-the-art performance over three RGB-T semantic segmentation benchmarks. Our source code is available at https://github.com/UkcheolShin/CRM_RGBTSeg.

References (49)

Authors (4)

Ukcheol Shin (16 papers)
Kyunghyun Lee (8 papers)
In So Kweon (156 papers)
Jean Oh (77 papers)

Citations (12)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

GitHub - UkcheolShin/CRM_RGBTSeg: Official implementation of the paper "Complementary Random Masking for RGB-T Semantic Segmentation." (57 stars)

Complementary Random Masking for RGB-Thermal Semantic Segmentation (2303.17386v2)

Summary

Related Papers

GitHub