Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
125 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Diffusion based Zero-shot Medical Image-to-Image Translation for Cross Modality Segmentation (2404.01102v2)

Published 1 Apr 2024 in eess.IV, cs.CV, and cs.LG

Abstract: Cross-modality image segmentation aims to segment the target modalities using a method designed in the source modality. Deep generative models can translate the target modality images into the source modality, thus enabling cross-modality segmentation. However, a vast body of existing cross-modality image translation methods relies on supervised learning. In this work, we aim to address the challenge of zero-shot learning-based image translation tasks (extreme scenarios in the target modality is unseen in the training phase). To leverage generative learning for zero-shot cross-modality image segmentation, we propose a novel unsupervised image translation method. The framework learns to translate the unseen source image to the target modality for image segmentation by leveraging the inherent statistical consistency between different modalities for diffusion guidance. Our framework captures identical cross-modality features in the statistical domain, offering diffusion guidance without relying on direct mappings between the source and target domains. This advantage allows our method to adapt to changing source domains without the need for retraining, making it highly practical when sufficient labeled source domain data is not available. The proposed framework is validated in zero-shot cross-modality image segmentation tasks through empirical comparisons with influential generative models, including adversarial-based and diffusion-based models.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (28)
  1. Image2StyleGAN++: How to edit the embedded images? In Proc. of the IEEE CVPR, June 2020.
  2. Unsupervised multi-modal image registration via geometry preserving image-to-image translation. In Proc. of the IEEE CVPR, June 2020.
  3. MedGAN: Medical image translation using GANs. Comput Med Imaging Graph, 79:101684, 2020.
  4. Non-uniform diffusion models, 2022.
  5. Neural photo editing with introspective adversarial networks. In ICLR, 2017.
  6. Professional CUDA c programming. John Wiley & Sons, 2014.
  7. ILVR: Conditioning method for denoising diffusion probabilistic models, 2021.
  8. Diffusion models beat gans on image synthesis. In NeurIPS, volume 34, pages 8780–8794. Curran Associates, Inc., 2021.
  9. Learning deep representations by mutual information estimation and maximization. In ICLR, 2019.
  10. Denoising diffusion restoration models, 2022.
  11. Denoising diffusion restoration models. arXiv preprint arXiv:2201.11793, 2022.
  12. Attention-aware discrimination for mr-to-ct image translation using cycle-consistent generative adversarial networks. Radiology. Artificial Intelligence, 2(2), 2020.
  13. SDEdit: Guided image synthesis and editing with stochastic differential equations. In ICLR, 2022.
  14. Unsupervised medical image translation with adversarial diffusion models. IEEE Trans. Med. Imag., pages 1–1, 2023.
  15. Restoring vision in adverse weather conditions with patch-based denoising diffusion models. arXiv preprint arXiv:2207.14626, 2022.
  16. Ayed Samy-Safwan Silva Santiago, Lorenzi Marco. IXI sample dataset, 2022.
  17. Score-based generative modeling through stochastic differential equations. In ICLR, 2021.
  18. Unified generative adversarial networks for controllable image-to-image translation. IEEE Trans. Image Process, 29:8916–8929, 2020.
  19. Sar2opt: Image alignment between multi-modal images using generative adversarial networks. In IGARSS 2019, pages 923–926. IEEE, 2019.
  20. L2R GAN: Lidar-to-radar translation. In Proc. of the ACCV, 2020.
  21. Inner-ear augmented metal artifact reduction with simulation-based 3d generative adversarial networks. Computerized Medical Imaging and Graphics, 93:101990, 2021.
  22. E2Style: Improve the efficiency and effectiveness of stylegan inversion. IEEE Trans. Image Process, 31:3267–3280, 2022.
  23. Deep MR to CT synthesis using unpaired data. In International workshop on simulation and synthesis in medical imaging, pages 14–23. Springer, 2017.
  24. A domain gap aware generative adversarial network for multi-domain image translation. IEEE Trans. Image Process, 31:72–84, 2022.
  25. Show, attend, and translate: Unsupervised image translation with self-regularization and attention. IEEE Trans. Image Process, 28(10):4845–4856, 2019.
  26. Egsde: Unpaired image-to-image translation via energy-guided stochastic differential equations, 2022.
  27. In-domain gan inversion for real image editing. In Andrea Vedaldi, Horst Bischof, Thomas Brox, and Jan-Michael Frahm, editors, ECCV 2020, pages 592–608, Cham, 2020. Springer.
  28. Unpaired image-to-image translation using cycle-consistent adversarial networks. In IEEE ICCV, 2017.

Summary

We haven't generated a summary for this paper yet.