Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Improving dermatology classifiers across populations using images generated by large diffusion models (2211.13352v1)

Published 23 Nov 2022 in eess.IV, cs.CV, and cs.LG

Abstract: Dermatological classification algorithms developed without sufficiently diverse training data may generalize poorly across populations. While intentional data collection and annotation offer the best means for improving representation, new computational approaches for generating training data may also aid in mitigating the effects of sampling bias. In this paper, we show that DALL$\cdot$E 2, a large-scale text-to-image diffusion model, can produce photorealistic images of skin disease across skin types. Using the Fitzpatrick 17k dataset as a benchmark, we demonstrate that augmenting training data with DALL$\cdot$E 2-generated synthetic images improves classification of skin disease overall and especially for underrepresented groups.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Luke W. Sagers (2 papers)
  2. James A. Diao (3 papers)
  3. Matthew Groh (20 papers)
  4. Pranav Rajpurkar (69 papers)
  5. Adewole S. Adamson (2 papers)
  6. Arjun K. Manrai (5 papers)
Citations (25)
X Twitter Logo Streamline Icon: https://streamlinehq.com