Derm-T2IM: Harnessing Synthetic Skin Lesion Data via Stable Diffusion Models for Enhanced Skin Disease Classification using ViT and CNN (2401.05159v1)
Abstract: This study explores the utilization of Dermatoscopic synthetic data generated through stable diffusion models as a strategy for enhancing the robustness of machine learning model training. Synthetic data generation plays a pivotal role in mitigating challenges associated with limited labeled datasets, thereby facilitating more effective model training. In this context, we aim to incorporate enhanced data transformation techniques by extending the recent success of few-shot learning and a small amount of data representation in text-to-image latent diffusion models. The optimally tuned model is further used for rendering high-quality skin lesion synthetic data with diverse and realistic characteristics, providing a valuable supplement and diversity to the existing training data. We investigate the impact of incorporating newly generated synthetic data into the training pipeline of state-of-art machine learning models, assessing its effectiveness in enhancing model performance and generalization to unseen real-world data. Our experimental results demonstrate the efficacy of the synthetic data generated through stable diffusion models helps in improving the robustness and adaptability of end-to-end CNN and vision transformer models on two different real-world skin lesion datasets.
- American Academy of Dermatology. Skin cancer statistics, 2023. https://www.aad.org/media/stats-skin-cancer, Last accessed on 27-Nov-2023.
- Cancer statistics, 2023. CA: A Cancer Journal for Clinicians, 73(1):17–48, 2023.
- Burden of skin cancer in colombia. International Journal of Dermatology, 61, 02 2022.
- Skin cancer disease detection using transfer learning technique. Applied Sciences, 12(11):5714, 2022.
- Non-invasive diagnostic techniques in pigmentary skin disorders and skin cancer. Journal of cosmetic dermatology, 21(2):444–450, 2022.
- Malignant skin neoplasms. Medical Clinics, 93(6):1241–1264, 2009.
- Extracting training data from diffusion models. In 32nd USENIX Security Symposium (USENIX Security 23), pages 5253–5270, 2023.
- Photorealistic text-to-image diffusion models with deep language understanding. Proc. Int. Conf. Neural Inf. Process. Syst., 35:36479–36494, 2022.
- Generalizing from a few examples: A survey on few-shot learning. ACM computing surveys (csur), 53(3):1–34, 2020.
- High-resolution image synthesis with latent diffusion models, 2021.
- Dreambooth: Fine tuning text-to-image diffusion models for subject-driven generation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 22500–22510, 2023.
- An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929, 2020.
- Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861, 2017.
- Artificial intelligence and machine learning algorithms for early detection of skin cancer in community and primary care settings: a systematic review. The Lancet Digital Health, 4(6):e466–e476, 2022.
- Automatic lesion detection system (alds) for skin cancer classification using svm and neural classifiers. In 2016 IEEE 16th International Conference on Bioinformatics and Bioengineering (BIBE), pages 301–308. IEEE, 2016.
- State-of-the-art machine learning techniques for melanoma skin cancer detection and classification: a comprehensive review. Intelligent Medicine, 3(03):180–190, 2023.
- Exploring the strengths of pre-trained cnn models with machine learning techniques for skin cancer diagnosis. In 2022 IEEE 2nd Mysore Sub Section International Conference (MysuruCon), pages 1–6. IEEE, 2022.
- International Skin Imaging Collaboration: Melanoma Project, 2023. https://isic-archive.com, Last accessed on 27-Nov-2023.
- Ph 2-a dermoscopic image database for research and benchmarking. In 2013 35th annual international conference of the IEEE engineering in medicine and biology society (EMBC), pages 5437–5440. IEEE, 2013.
- The ham10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions. Scientific data, 5(1):1–9, 2018.
- A Color and Texture Based Hierarchical K-NN Approach to the Classification of Non-melanoma Skin Lesions, volume 6, pages 63–86. 01 2013.
- Dermatology Information System, 2023. https://www.dermis.net/dermisroot/en/home/index.htm, Last accessed on 27-Nov-2023.
- Gans for medical image synthesis: An empirical study. Journal of Imaging, 9(3):69, 2023.
- Adaptive augmentation of medical data using independently conditional variational auto-encoders. IEEE transactions on medical imaging, 38(12):2807–2820, 2019.
- Diffusion models in medical imaging: A comprehensive survey. Medical Image Analysis, page 102846, 2023.
- Medical image synthesis for data augmentation and anonymization using generative adversarial networks. In Simulation and Synthesis in Medical Imaging: Third International Workshop, SASHIMI 2018, Held in Conjunction with MICCAI 2018, Granada, Spain, September 16, 2018, Proceedings 3, pages 1–11. Springer, 2018.
- Medical image synthesis with deep convolutional adversarial networks. IEEE Transactions on Biomedical Engineering, 65(12):2720–2730, 2018.
- Diffusion-based data augmentation for skin disease classification: Impact across original medical datasets to fully synthetic images. arXiv preprint arXiv:2301.04802, 2023.
- Resvit: Residual vision transformers for multimodal medical image synthesis. IEEE Transactions on Medical Imaging, 41(10):2598–2614, 2022.
- Sketch guided and progressive growing gan for realistic and editable ultrasound image synthesis. Medical Image Analysis, 79:102461, 2022.
- Neural architecture search with a lightweight transformer for text-to-image synthesis. IEEE Transactions on Network Science and Engineering, 9(3):1567–1576, 2022.
- Medical diffusion on a budget: textual inversion for medical image generation. arXiv preprint arXiv:2303.13430, 2023.
- Finetuning of glide stable diffusion model for ai-based text-conditional image synthesis of dermoscopic images. Frontiers in Medicine, 10, 2023.
- Denoising diffusion probabilistic models. Proc. Int. Conf. Neural Inf. Process. Syst., 33:6840–6851, 2020.
- High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10684–10695, 2022.
- Dullrazor®: A software approach to hair removal from images. Computers in biology and medicine, 27(6):533–543, 1997.
- LoRA: Low-rank adaptation of large language models. In Proc. Int. Conf. Learn. Representations, 2022.
- Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101, 2017.
- Samplers in Stable Diffusion, 2023. https://www.felixsanz.dev/articles/complete-guide-to-samplers-in-stable-diffusion#:~:text=DDPM%20(paper)%20(Denoising%20Diffusion,to%20achieve%20a%20decent%20result, Last accessed on 27-Nov-2023.
- A comprehensive survey on transfer learning. Proc. IEEE, 109(1):43–76, 2021.
- Philipp Tschandl. The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions, 2018.
- Muhammad Ali Farooq (19 papers)
- Wang Yao (166 papers)
- Michael Schukat (9 papers)
- Mark A Little (1 paper)
- Peter Corcoran (54 papers)