Attainability of Med-AGI via Multimodal Foundation Models
Determine whether scaling large multimodal foundation models, including vision–language models applied to surgical image analysis, can lead to Medical Artificial General Intelligence (Med-AGI) capable of functioning in operative surgical settings.
References
Despite progress on visual tasks such as surgery, whether these models would lead to Med-AGI is an open question.
— A Comparative Study in Surgical AI: Datasets, Foundation Models, and Barriers to Med-AGI
(2603.27341 - Skobelev et al., 28 Mar 2026) in Section 1, Introduction