Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

MEDFuse: Multimodal EHR Data Fusion with Masked Lab-Test Modeling and Large Language Models (2407.12309v1)

Published 17 Jul 2024 in cs.CL

Abstract: Electronic health records (EHRs) are multimodal by nature, consisting of structured tabular features like lab tests and unstructured clinical notes. In real-life clinical practice, doctors use complementary multimodal EHR data sources to get a clearer picture of patients' health and support clinical decision-making. However, most EHR predictive models do not reflect these procedures, as they either focus on a single modality or overlook the inter-modality interactions/redundancy. In this work, we propose MEDFuse, a Multimodal EHR Data Fusion framework that incorporates masked lab-test modeling and LLMs to effectively integrate structured and unstructured medical data. MEDFuse leverages multimodal embeddings extracted from two sources: LLMs fine-tuned on free clinical text and masked tabular transformers trained on structured lab test results. We design a disentangled transformer module, optimized by a mutual information loss to 1) decouple modality-specific and modality-shared information and 2) extract useful joint representation from the noise and redundancy present in clinical notes. Through comprehensive validation on the public MIMIC-III dataset and the in-house FEMH dataset, MEDFuse demonstrates great potential in advancing clinical predictions, achieving over 90% F1 score in the 10-disease multi-label classification task.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (10)
  1. Thao Minh Nguyen Phan (1 paper)
  2. Cong-Tinh Dao (4 papers)
  3. Chenwei Wu (23 papers)
  4. Jian-Zhe Wang (2 papers)
  5. Shun Liu (9 papers)
  6. Jun-En Ding (14 papers)
  7. David Restrepo (11 papers)
  8. Feng Liu (1212 papers)
  9. Fang-Ming Hung (5 papers)
  10. Wen-Chih Peng (47 papers)
Citations (1)
X Twitter Logo Streamline Icon: https://streamlinehq.com