Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Multimodal Machine Learning in Image-Based and Clinical Biomedicine: Survey and Prospects (2311.02332v5)

Published 4 Nov 2023 in cs.LG and cs.CV

Abstract: Machine learning (ML) applications in medical AI systems have shifted from traditional and statistical methods to increasing application of deep learning models. This survey navigates the current landscape of multimodal ML, focusing on its profound impact on medical image analysis and clinical decision support systems. Emphasizing challenges and innovations in addressing multimodal representation, fusion, translation, alignment, and co-learning, the paper explores the transformative potential of multimodal models for clinical predictions. It also highlights the need for principled assessments and practical implementation of such models, bringing attention to the dynamics between decision support systems and healthcare providers and personnel. Despite advancements, challenges such as data biases and the scarcity of "big data" in many biomedical domains persist. We conclude with a discussion on principled innovation and collaborative efforts to further the mission of seamless integration of multimodal ML models into biomedical practice.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Elisa Warner (1 paper)
  2. Joonsang Lee (2 papers)
  3. William Hsu (25 papers)
  4. Tanveer Syeda-Mahmood (34 papers)
  5. Charles Kahn (3 papers)
  6. Olivier Gevaert (22 papers)
  7. Arvind Rao (15 papers)
Citations (4)