Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

How Good Are Synthetic Medical Images? An Empirical Study with Lung Ultrasound (2310.03608v1)

Published 5 Oct 2023 in eess.IV and cs.CV

Abstract: Acquiring large quantities of data and annotations is known to be effective for developing high-performing deep learning models, but is difficult and expensive to do in the healthcare context. Adding synthetic training data using generative models offers a low-cost method to deal effectively with the data scarcity challenge, and can also address data imbalance and patient privacy issues. In this study, we propose a comprehensive framework that fits seamlessly into model development workflows for medical image analysis. We demonstrate, with datasets of varying size, (i) the benefits of generative models as a data augmentation method; (ii) how adversarial methods can protect patient privacy via data substitution; (iii) novel performance metrics for these use cases by testing models on real holdout data. We show that training with both synthetic and real data outperforms training with real data alone, and that models trained solely with synthetic data approach their real-only counterparts. Code is available at https://github.com/Global-Health-Labs/US-DCGAN.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Menghan Yu (5 papers)
  2. Sourabh Kulhare (4 papers)
  3. Courosh Mehanian (6 papers)
  4. Zohreh Laverriere (1 paper)
  5. Ishan Shah (1 paper)
  6. Charles B Delahunt (3 papers)
  7. Daniel E Shea (1 paper)
  8. Matthew P Horning (1 paper)
Citations (2)
Github Logo Streamline Icon: https://streamlinehq.com