Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Diffusion Features to Bridge Domain Gap for Semantic Segmentation (2406.00777v2)

Published 2 Jun 2024 in cs.CV and cs.AI

Abstract: Pre-trained diffusion models have demonstrated remarkable proficiency in synthesizing images across a wide range of scenarios with customizable prompts, indicating their effective capacity to capture universal features. Motivated by this, our study delves into the utilization of the implicit knowledge embedded within diffusion models to address challenges in cross-domain semantic segmentation. This paper investigates the approach that leverages the sampling and fusion techniques to harness the features of diffusion models efficiently. We propose DIffusion Feature Fusion (DIFF) as a backbone use for extracting and integrating effective semantic representations through the diffusion process. By leveraging the strength of text-to-image generation capability, we introduce a new training framework designed to implicitly learn posterior knowledge from it. Through rigorous evaluation in the contexts of domain generalization semantic segmentation, we establish that our methodology surpasses preceding approaches in mitigating discrepancies across distinct domains and attains the state-of-the-art (SOTA) benchmark.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Yuxiang Ji (8 papers)
  2. Boyong He (6 papers)
  3. Chenyuan Qu (3 papers)
  4. Zhuoyue Tan (6 papers)
  5. Chuan Qin (43 papers)
  6. Liaoni Wu (6 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.