Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Variational Probabilistic Fusion Network for RGB-T Semantic Segmentation (2307.08536v1)

Published 17 Jul 2023 in cs.CV

Abstract: RGB-T semantic segmentation has been widely adopted to handle hard scenes with poor lighting conditions by fusing different modality features of RGB and thermal images. Existing methods try to find an optimal fusion feature for segmentation, resulting in sensitivity to modality noise, class-imbalance, and modality bias. To overcome the problems, this paper proposes a novel Variational Probabilistic Fusion Network (VPFNet), which regards fusion features as random variables and obtains robust segmentation by averaging segmentation results under multiple samples of fusion features. The random samples generation of fusion features in VPFNet is realized by a novel Variational Feature Fusion Module (VFFM) designed based on variation attention. To further avoid class-imbalance and modality bias, we employ the weighted cross-entropy loss and introduce prior information of illumination and category to control the proposed VFFM. Experimental results on MFNet and PST900 datasets demonstrate that the proposed VPFNet can achieve state-of-the-art segmentation performance.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Baihong Lin (3 papers)
  2. Zengrong Lin (2 papers)
  3. Yulan Guo (89 papers)
  4. Yulan Zhang (5 papers)
  5. Jianxiao Zou (4 papers)
  6. Shicai Fan (7 papers)
Citations (3)

Summary

We haven't generated a summary for this paper yet.