Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Data Alignment for Zero-Shot Concept Generation in Dermatology AI (2404.13043v2)

Published 19 Apr 2024 in cs.CV, cs.CL, and cs.LG

Abstract: AI in dermatology is evolving at a rapid pace but the major limitation to training trustworthy classifiers is the scarcity of data with ground-truth concept level labels, which are meta-labels semantically meaningful to humans. Foundation models like CLIP providing zero-shot capabilities can help alleviate this challenge by leveraging vast amounts of image-caption pairs available on the internet. CLIP can be fine-tuned using domain specific image-caption pairs to improve classification performance. However, CLIP's pre-training data is not well-aligned with the medical jargon that clinicians use to perform diagnoses. The development of LLMs in recent years has led to the possibility of leveraging the expressive nature of these models to generate rich text. Our goal is to use these models to generate caption text that aligns well with both the clinical lexicon and with the natural human language used in CLIP's pre-training data. Starting with captions used for images in PubMed articles, we extend them by passing the raw captions through an LLM fine-tuned on the field's several textbooks. We find that using captions generated by an expressive fine-tuned LLM like GPT-3.5 improves downstream zero-shot concept classification performance.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (32)
  1. Differential diagnosis in dermatology. CRC Press, 2021.
  2. A multitask, multilingual, multimodal evaluation of chatgpt on reasoning, hallucination, and interactivity. arXiv preprint arXiv:2302.04023, 2023.
  3. Dermatology e-book. Elsevier Health Sciences, 2012.
  4. Language models are few-shot learners. Advances in neural information processing systems, 33:1877–1901, 2020.
  5. Skin lesion analysis toward melanoma detection: A challenge at the 2017 international symposium on biomedical imaging (isbi), hosted by the international skin imaging collaboration (isic). In 2018 IEEE 15th international symposium on biomedical imaging (ISBI 2018), pp.  168–172. IEEE, 2018.
  6. Instructblip: Towards general-purpose vision-language models with instruction tuning, 2023.
  7. Skincon: A skin disease dataset densely annotated by domain experts for fine-grained debugging and analysis. In Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track, 2022. URL https://openreview.net/forum?id=gud0qopqJc4.
  8. Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition, pp.  248–255. Ieee, 2009.
  9. Clinical abcde rule for early melanoma detection. European Journal of Dermatology, 31(6):771–778, 2021.
  10. Link the head to the" beak": Zero shot learning from noisy text description at part precision. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.  5640–5649, 2017.
  11. John SC English. General Dermatology. Atlas Medical Publishing Limited, 2007.
  12. Evaluating deep neural networks trained on clinical images in dermatology with the fitzpatrick 17k dataset. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp.  1820–1828, 2021.
  13. Deep learning algorithms for detection of diabetic retinopathy in retinal fundus photographs: A systematic review and meta-analysis. Computer Methods and Programs in Biomedicine, 191:105320, 2020.
  14. Mimic-cxr, a de-identified publicly available database of chest radiographs with free-text reports. Scientific data, 6(1):1–8, 2019.
  15. Fostering transparent medical image ai via an image-text foundation model grounded in medical literature. medRxiv, 2023.
  16. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
  17. Concept bottleneck models. In International conference on machine learning, pp.  5338–5348. PMLR, 2020.
  18. Hallucinations in neural machine translation. 2018.
  19. Artificial intelligence in dermatology: past, present, and future, 2019.
  20. Medical image classification with convolutional neural network. In 2014 13th international conference on control automation robotics & vision (ICARCV), pp.  844–848. IEEE, 2014.
  21. Visual instruction tuning, 2023.
  22. Handbook of dermoscopy. CRC Press, 2006.
  23. Training language models to follow instructions with human feedback, 2022.
  24. Zest: Zero-shot learning from text descriptions using textual similarity and visual summarization. arXiv preprint arXiv:2010.03276, 2020.
  25. What does a platypus look like? generating customized prompts for zero-shot image classification. arXiv preprint arXiv:2209.03320, 2022.
  26. Language models are unsupervised multitask learners. OpenAI blog, 1(8):9, 2019.
  27. Learning transferable visual models from natural language supervision. In International Conference on Machine Learning, pp.  8748–8763. PMLR, 2021.
  28. Top 50 Dermatology Case Studies for Primary Care. Springer, 2017.
  29. Expert-level detection of pathologies from unannotated chest x-ray images via self-supervised learning. Nature Biomedical Engineering, pp.  1–8, 2022.
  30. Huggingface’s transformers: State-of-the-art natural language processing. arXiv preprint arXiv:1910.03771, 2019.
  31. Deep convolutional neural network based medical image classification for disease diagnosis. Journal of Big Data, 6(1):1–18, 2019.
  32. Instruction tuning for large language models: A survey, 2023.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Soham Gadgil (9 papers)
  2. Mahtab Bigverdi (5 papers)

Summary

We haven't generated a summary for this paper yet.