Multimodal Fusion of EHR in Structures and Semantics: Integrating Clinical Records and Notes with Hypergraph and LLM (2403.08818v1)
Abstract: Electronic Health Records (EHRs) have become increasingly popular to support clinical decision-making and healthcare in recent decades. EHRs usually contain heterogeneous information, such as structural data in tabular form and unstructured data in textual notes. Different types of information in EHRs can complement each other and provide a more complete picture of the health status of a patient. While there has been a lot of research on representation learning of structured EHR data, the fusion of different types of EHR data (multimodal fusion) is not well studied. This is mostly because of the complex medical coding systems used and the noise and redundancy present in the written notes. In this work, we propose a new framework called MINGLE, which integrates both structures and semantics in EHR effectively. Our framework uses a two-level infusion strategy to combine medical concept semantics and clinical note semantics into hypergraph neural networks, which learn the complex interactions between different types of data to generate visit representations for downstream prediction. Experiment results on two EHR datasets, the public MIMIC-III and private CRADLE, show that MINGLE can effectively improve predictive performance by 11.83% relatively, enhancing semantic integration as well as multimodal fusion for structural and textual EHR data.
- Hypergraph convolution and hypergraph attention. Pattern Recognition 110 (2021), 107637.
- Hypergraph Contrastive Learning for Electronic Health Records. In SDM. SIAM, 127–135.
- Learning the graphical structure of electronic health records with graph convolutional transformer. In AAAI.
- Prevalence of cardiovascular disease in type 2 diabetes: a systematic literature review of scientific evidence from across the world in 2007–2017. Cardiovascular diabetology 17 (2018), 1–19.
- Hypergraph neural networks. In AAAI, Vol. 33. 3558–3565.
- Multitask learning and benchmarking with clinical time series data. Scientific data 6, 1 (2019), 1–18.
- MIMIC-III, a freely accessible critical care database. Scientific data 3, 1 (2016), 1–9.
- Hi-BEHRT: Hierarchical Transformer-based model for accurate prediction of clinical events using multimodal longitudinal electronic health records. IEEE journal of biomedical and health informatics 27 (2022), 1106–1117.
- Multimodal data matters: language model pre-training over structured and unstructured electronic health records. IEEE Journal of Biomedical and Health Informatics 27 (2022), 504–514.
- Juan G Diaz Ochoa and Faizan E Mustafa. 2022. Graph neural network modelling as a potentially effective method for predicting and analyzing procedures based on patients’ diagnoses. Artificial Intelligence in Medicine (2022), 102359.
- Deepwalk: Online learning of social representations. In KDD.
- Deep EHR: a survey of recent advances in deep learning techniques for electronic health record (EHR) analysis. BHI 22 (2017), 1589–1604.
- Graph Attention Networks. In ICLR.
- Hypergraph Transformers for EHR-based Clinical Predictions. AMIA (2023).
- Hypergcn: A new method for training graph convolutional networks on hypergraphs. NeurIPS (2019).
- Hejie Cui (33 papers)
- Xinyu Fang (20 papers)
- Ran Xu (89 papers)
- Xuan Kan (18 papers)
- Joyce C. Ho (32 papers)
- Carl Yang (130 papers)