The Beauty or the Beast: Which Aspect of Synthetic Medical Images Deserves Our Focus? (2305.09789v2)

Published 3 May 2023 in eess.IV and cs.CV

Abstract: Training medical AI algorithms requires large volumes of accurately labeled datasets, which are difficult to obtain in the real world. Synthetic images generated from deep generative models can help alleviate the data scarcity problem, but their effectiveness relies on their fidelity to real-world images. Typically, researchers select synthesis models based on image quality measurements, prioritizing synthetic images that appear realistic. However, our empirical analysis shows that high-fidelity and visually appealing synthetic images are not necessarily superior. In fact, we present a case where low-fidelity synthetic images outperformed their high-fidelity counterparts in downstream tasks. Our findings highlight the importance of comprehensive analysis before incorporating synthetic data into real-world applications. We hope our results will raise awareness among the research community of the value of low-fidelity synthetic images in medical AI algorithm training.

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (5)

Xiaodan Xing (35 papers)
Yang Nan (40 papers)
Federico Felder (3 papers)
Simon Walsh (16 papers)
Guang Yang (422 papers)

Citations (5)

View on Semantic Scholar

The Beauty or the Beast: Which Aspect of Synthetic Medical Images Deserves Our Focus? (2305.09789v2)

Related Papers