Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
60 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
8 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Deep Multi-modal Fusion of Image and Non-image Data in Disease Diagnosis and Prognosis: A Review (2203.15588v3)

Published 25 Mar 2022 in cs.LG, cs.AI, and cs.CV

Abstract: The rapid development of diagnostic technologies in healthcare is leading to higher requirements for physicians to handle and integrate the heterogeneous, yet complementary data that are produced during routine practice. For instance, the personalized diagnosis and treatment planning for a single cancer patient relies on the various images (e.g., radiological, pathological, and camera images) and non-image data (e.g., clinical data and genomic data). However, such decision-making procedures can be subjective, qualitative, and have large inter-subject variabilities. With the recent advances in multi-modal deep learning technologies, an increasingly large number of efforts have been devoted to a key question: how do we extract and aggregate multi-modal information to ultimately provide more objective, quantitative computer-aided clinical decision making? This paper reviews the recent studies on dealing with such a question. Briefly, this review will include the (1) overview of current multi-modal learning workflows, (2) summarization of multi-modal fusion methods, (3) discussion of the performance, (4) applications in disease diagnosis and prognosis, and (5) challenges and future directions.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Can Cui (95 papers)
  2. Haichun Yang (46 papers)
  3. Yaohong Wang (15 papers)
  4. Shilin Zhao (20 papers)
  5. Zuhayr Asad (16 papers)
  6. Lori A. Coburn (10 papers)
  7. Keith T. Wilson (9 papers)
  8. Bennett A. Landman (123 papers)
  9. Yuankai Huo (160 papers)
Citations (63)