Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
60 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
8 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

The Beauty or the Beast: Which Aspect of Synthetic Medical Images Deserves Our Focus? (2305.09789v2)

Published 3 May 2023 in eess.IV and cs.CV

Abstract: Training medical AI algorithms requires large volumes of accurately labeled datasets, which are difficult to obtain in the real world. Synthetic images generated from deep generative models can help alleviate the data scarcity problem, but their effectiveness relies on their fidelity to real-world images. Typically, researchers select synthesis models based on image quality measurements, prioritizing synthetic images that appear realistic. However, our empirical analysis shows that high-fidelity and visually appealing synthetic images are not necessarily superior. In fact, we present a case where low-fidelity synthetic images outperformed their high-fidelity counterparts in downstream tasks. Our findings highlight the importance of comprehensive analysis before incorporating synthetic data into real-world applications. We hope our results will raise awareness among the research community of the value of low-fidelity synthetic images in medical AI algorithm training.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Xiaodan Xing (35 papers)
  2. Yang Nan (40 papers)
  3. Federico Felder (3 papers)
  4. Simon Walsh (16 papers)
  5. Guang Yang (422 papers)
Citations (5)