Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Generative Adversarial Network in Medical Imaging: A Review (1809.07294v4)

Published 19 Sep 2018 in cs.CV and cs.LG

Abstract: Generative adversarial networks have gained a lot of attention in the computer vision community due to their capability of data generation without explicitly modelling the probability density function. The adversarial loss brought by the discriminator provides a clever way of incorporating unlabeled samples into training and imposing higher order consistency. This has proven to be useful in many cases, such as domain adaptation, data augmentation, and image-to-image translation. These properties have attracted researchers in the medical imaging community, and we have seen rapid adoption in many traditional and novel applications, such as image reconstruction, segmentation, detection, classification, and cross-modality synthesis. Based on our observations, this trend will continue and we therefore conducted a review of recent advances in medical imaging using the adversarial training scheme with the hope of benefiting researchers interested in this technique.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Xin Yi (37 papers)
  2. Ekta Walia (3 papers)
  3. Paul Babyn (11 papers)
Citations (1,288)

Summary

Generative Adversarial Network in Medical Imaging: A Review

The paper "Generative Adversarial Network in Medical Imaging: A Review" by Xin Yi, Ekta Walia, and Paul Babyn presents a comprehensive survey of the application of Generative Adversarial Networks (GANs) in the field of medical imaging. It meticulously categorizes the current literature based on primary medical imaging tasks and outlines both theoretical constructs and practical implementations. This review is pivotal for researchers aiming to understand the breadth of GAN applications in this specialized field.

Overview

GANs, since their inception, have shown remarkable potential in generating data without needing probabilistic models. This skill, primarily enabled through an adversarial loss structure brought forth by the discriminator, is leveraged to incorporate unlabeled samples and enforce higher-order consistency. This unique capability has spurred significant interest in diverse applications such as domain adaptation, data augmentation, and image-to-image translation within the medical imaging community.

Applications of GANs in Medical Imaging

Image Reconstruction

One of the prime applications of GANs in medical imaging is in image reconstruction. When clinical constraints limit image quality—such as in low-dose CT scans—GANs have proven effective in denoising and artifact removal. The majority of methods borrow from the Pix2Pix framework, utilizing paired data for tasks including CT, MR, and PET reconstruction. The paper details various optimization techniques, including incorporating perceptual loss and structure loss, to enhance this process.

Image Synthesis

With the dearth of labeled medical imaging data, GANs provide a robust mechanism for data augmentation. This proves crucial in overcoming challenges linked to patient privacy and rare disease representation. The review outlines the utilization of methods like DCGAN, WGAN, and PGGAN for synthetic image generation, particularly for generating lung nodules or liver lesions which are subsequently used to augment training datasets for classification tasks. The paper also explores the application of conditional GANs (cGANs) for tasks demanding cross-modality synthesis, transforming, for instance, MR images into CT-like images or vice versa to aid in comprehensive diagnoses without multiple acquisitions.

Segmentation

Segmentation tasks benefit from the regulation imposed by adversarial training, ensuring the spatial consistency of segmentation maps. The paper reviewed GANs' integration into U-net architectures, emphasizing the discriminator's role in offering shape regularization and enforcing higher-order consistency in segmentations. The strong numerical results emphasize GANs' enhanced accuracy over traditional methods, particularly in handling complex medical images such as liver CTs or brain MRIs.

Classification

The classification of medical images using GANs has primarily benefited from semi-supervised learning. The discriminator's ability to serve dual roles—as both a classifier and a discriminator—improves performance with limited labeled data. By leveraging this, studies have found improved sensitivity and specificity, particularly in differentiating between classes in medical datasets. GANs have addressed domain shift issues effectively in classification tasks over diversified datasets.

Abnormality Detection

The discriminator's capability to detect deviations from learned normal image distributions has been pivotal in detecting abnormalities. This review highlights the application of GANs in anomaly detection, particularly for retinal OCT images and brain MRIs. The unsupervised detection framework allows for identifying uncommon pathology without needing extensive labeled abnormal data.

Image Registration

GANs have been explored in facilitating multimodal image registration tasks. By generating transformation parameters or directly producing transformed images, GANs simplify registration workflows. The paper highlights the role of cGANs in tackling complex registration challenges between modalities like MR and TRUS, bolstering the fidelity and accuracy of these frameworks.

Implications and Future Directions

The practical implications of the reviewed research are manifold, ranging from enhanced diagnostic workflows to enriched training datasets for deep learning models. Theoretically, GANs provide a profound understanding of underlying data distributions and offer flexible frameworks capable of multi-faceted deployments. Future advancements might see GANs improving MR image quality, rendering high-fidelity synthetic datasets, and reducing diagnostic turnaround times.

One of the critical challenges moving forward lies in establishing robust evaluation metrics for generated medical images, ensuring they reflect clinical usability and diagnostic accuracy. Furthermore, addressing biases introduced during GAN training remains an open problem. Next-generation GAN models might need to incorporate localized attribute control to safeguard image fidelity, especially when altering pathology representations.

In conclusion, GANs exhibit extraordinary potential in the medical imaging landscape, driving both innovative research and practical applications. While challenges persist, the ongoing evolution of GAN frameworks underscores a promising trajectory for enhanced diagnostic and therapeutic capabilities in medical imaging.