Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI (2411.14522v1)

Published 21 Nov 2024 in cs.CV

Abstract: Despite significant advancements in general artificial intelligence, such as GPT-4, their effectiveness in the medical domain (general medical AI, GMAI) remains constrained due to the absence of specialized medical knowledge. To address this challenge, we present GMAI-VL-5.5M, a comprehensive multimodal medical dataset created by converting hundreds of specialized medical datasets into meticulously constructed image-text pairs. This dataset features comprehensive task coverage, diverse modalities, and high-quality image-text data. Building upon this multimodal dataset, we propose GMAI-VL, a general medical vision-LLM with a progressively three-stage training strategy. This approach significantly enhances the model's ability by integrating visual and textual information, thereby improving its ability to process multimodal data and support accurate diagnosis and clinical decision-making. Experimental evaluations demonstrate that GMAI-VL achieves state-of-the-art results across a wide range of multimodal medical tasks, such as visual question answering and medical image diagnosis. Our contributions include the development of the GMAI-VL-5.5M dataset, the introduction of the GMAI-VL model, and the establishment of new benchmarks in multiple medical domains. Code and dataset will be released at https://github.com/uni-medical/GMAI-VL.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (18)
  1. Tianbin Li (20 papers)
  2. Yanzhou Su (26 papers)
  3. Wei Li (1121 papers)
  4. Bin Fu (74 papers)
  5. Zhe Chen (237 papers)
  6. Ziyan Huang (18 papers)
  7. Guoan Wang (13 papers)
  8. Chenglong Ma (18 papers)
  9. Ying Chen (333 papers)
  10. Ming Hu (110 papers)
  11. Yanjun Li (56 papers)
  12. Pengcheng Chen (22 papers)
  13. Xiaowei Hu (54 papers)
  14. Zhongying Deng (25 papers)
  15. Yuanfeng Ji (20 papers)
  16. Jin Ye (38 papers)
  17. Yu Qiao (563 papers)
  18. Junjun He (77 papers)
Citations (1)