Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Enhancing Medical Task Performance in GPT-4V: A Comprehensive Study on Prompt Engineering Strategies (2312.04344v2)

Published 7 Dec 2023 in cs.CL, cs.AI, cs.CV, and cs.LG

Abstract: OpenAI's latest large vision-LLM (LVLM), GPT-4V(ision), has piqued considerable interest for its potential in medical applications. Despite its promise, recent studies and internal reviews highlight its underperformance in specialized medical tasks. This paper explores the boundary of GPT-4V's capabilities in medicine, particularly in processing complex imaging data from endoscopies, CT scans, and MRIs etc. Leveraging open-source datasets, we assessed its foundational competencies, identifying substantial areas for enhancement. Our research emphasizes prompt engineering, an often-underutilized strategy for improving AI responsiveness. Through iterative testing, we refined the model's prompts, significantly improving its interpretative accuracy and relevance in medical imaging. From our comprehensive evaluations, we distilled 10 effective prompt engineering techniques, each fortifying GPT-4V's medical acumen. These methodical enhancements facilitate more reliable, precise, and clinically valuable insights from GPT-4V, advancing its operability in critical healthcare environments. Our findings are pivotal for those employing AI in medicine, providing clear, actionable guidance on harnessing GPT-4V's full diagnostic potential.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Pengcheng Chen (22 papers)
  2. Ziyan Huang (18 papers)
  3. Zhongying Deng (25 papers)
  4. Tianbin Li (20 papers)
  5. Yanzhou Su (26 papers)
  6. Haoyu Wang (309 papers)
  7. Jin Ye (38 papers)
  8. Yu Qiao (563 papers)
  9. Junjun He (77 papers)
Citations (1)