SleepFM: Multi-modal Representation Learning for Sleep Across Brain Activity, ECG and Respiratory Signals (2405.17766v1)
Abstract: Sleep is a complex physiological process evaluated through various modalities recording electrical brain, cardiac, and respiratory activities. We curate a large polysomnography dataset from over 14,000 participants comprising over 100,000 hours of multi-modal sleep recordings. Leveraging this extensive dataset, we developed SleepFM, the first multi-modal foundation model for sleep analysis. We show that a novel leave-one-out approach for contrastive learning significantly improves downstream task performance compared to representations from standard pairwise contrastive learning. A logistic regression model trained on SleepFM's learned embeddings outperforms an end-to-end trained convolutional neural network (CNN) on sleep stage classification (macro AUROC 0.88 vs 0.72 and macro AUPRC 0.72 vs 0.48) and sleep disordered breathing detection (AUROC 0.85 vs 0.69 and AUPRC 0.77 vs 0.61). Notably, the learned embeddings achieve 48% top-1 average accuracy in retrieving the corresponding recording clips of other modalities from 90,000 candidates. This work demonstrates the value of holistic multi-modal sleep modeling to fully capture the richness of sleep recordings. SleepFM is open source and available at https://github.com/rthapa84/sleepfm-codebase.
- Learning to exploit temporal structure for biomedical vision-language processing. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 15016–15027, 2023.
- The AASM manual for the scoring of sleep and associated events. Rules, Terminology and Technical Specifications, Darien, Illinois, American Academy of Sleep Medicine, 176:2012, 2012.
- Automatic signal abnormality detection using time-frequency features and machine learning: A newborn EEG seizure case study. Knowledge-Based Systems, 106:38–50, 2016.
- Making the most of text semantics to improve biomedical vision–language processing. In European conference on computer vision, pp. 1–21. Springer, 2022.
- Age estimation from sleep studies using deep learning predicts life expectancy. NPJ digital medicine, 5(1):103, 2022.
- A simple framework for contrastive learning of visual representations. In International conference on machine learning, pp. 1597–1607. PMLR, 2020.
- You snooze, you win: the physionet/computing in cardiology challenge 2018. In 2018 Computing in Cardiology Conference (CinC), volume 45, pp. 1–4. IEEE, 2018.
- 3kg: Contrastive learning of 12-lead electrocardiograms using physiologically-inspired augmentations. In Machine Learning for Health, pp. 156–167. PMLR, 2021.
- Convolutional neural networks on multiple respiratory channels to detect hypopnea and obstructive apnea events. In 2018 International Joint Conference on Neural Networks (IJCNN), pp. 1–7. IEEE, 2018.
- Automated identification of sleep states from EEG signals by means of ensemble empirical mode decomposition and random under sampling boosting. Computer methods and programs in biomedicine, 140:201–210, 2017.
- Momentum contrast for unsupervised visual representation learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 9729–9738, 2020.
- Gloria: A multimodal global-local representation learning framework for label-efficient medical image recognition. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 3942–3951, 2021.
- Supervised contrastive learning. Advances in neural information processing systems, 33:18661–18673, 2020.
- Clocs: Contrastive learning of cardiac signals across space, time, and patients. In International Conference on Machine Learning, pp. 5606–5615. PMLR, 2021.
- Principles and practice of sleep medicine fifth edition, 2010.
- ECG representation learning with multi-modal EHR data. Transactions on Machine Learning Research, 2023.
- Living to dream—reply. JAMA neurology, 78(4):495–496, 2021.
- A deep learning method approach for sleep stage classification with eeg spectrogram. International Journal of Environmental Research and Public Health, 19(10):6322, 2022.
- Visual language pretrained multiple instance zero-shot transfer for histopathology images. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 19764–19775, 2023.
- Cascaded lstm recurrent neural network for automated sleep stage classification using single-channel eeg signals. Computers in biology and medicine, 106:71–81, 2019.
- Multi-objective hyperparameter optimization of convolutional neural network for obstructive sleep apnea detection. IEEE Access, 8:129586–129599, 2020.
- Sleepeegnet: Automated sleep stage scoring with sequence to sequence deep learning approach. PloS one, 14(5):e0216456, 2019.
- Automated scoring of respiratory events in sleep with a single effort belt and deep neural networks. IEEE transactions on biomedical engineering, 69(6):2094–2104, 2021.
- CSLP-AE: A contrastive split-latent permutation autoencoder framework for zero-shot electroencephalography signal conversion. arXiv preprint arXiv:2311.07788, 2023.
- Representation learning with contrastive predictive coding. arXiv preprint arXiv:1807.03748, 2018.
- Electrocardiographic deep learning for predicting post-procedural mortality. arXiv preprint arXiv:2205.03242, 2022.
- U-sleep: Resilient high-frequency sleep staging. NPJ digital medicine, 4 (1), 72, 2021.
- Seqsleepnet: end-to-end hierarchical recurrent neural network for sequence-to-sequence automatic sleep staging. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 27(3):400–410, 2019.
- Xsleepnet: Multi-view sequential model for automatic sleep staging. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(9):5903–5915, 2021.
- Learning transferable visual models from natural language supervision. In International conference on machine learning, pp. 8748–8763. PMLR, 2021.
- Contrastive pre-training for multimodal medical time series. In NeurIPS 2022 Workshop on Learning from Time Series for Health, 2022.
- Detection of sleep apnea using machine learning algorithms based on ECG signals: A comprehensive systematic review. Expert Systems with Applications, 187:115950, 2022.
- Mobilenetv2: Inverted residuals and linear bottlenecks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4510–4520, 2018.
- Intra-and inter-epoch temporal context network (iitnet) using sub-epoch features for automatic sleep scoring on raw single-channel eeg. Biomedical signal processing and control, 61:102037, 2020.
- A convolutional neural network for sleep stage scoring from raw single-channel EEG. Biomedical Signal Processing and Control, 42:107–114, 2018.
- Neural network analysis of sleep stages enables efficient diagnosis of narcolepsy. Nature communications, 9(1):5229, 2018.
- Deepsleepnet: A model for automatic sleep stage scoring based on raw single-channel EEG. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 25(11):1998–2008, 2017.
- Efficientnet: Rethinking model scaling for convolutional neural networks. In International conference on machine learning, pp. 6105–6114. PMLR, 2019.
- Automated sleep apnea detection from cardio-pulmonary signal using bivariate fast and adaptive emd coupled with cross time–frequency analysis. Computers in Biology and Medicine, 120:103769, 2020.
- Automatic sleep stage scoring with single-channel EEG using convolutional neural networks. arxiv 2016. arXiv preprint arXiv:1610.01683.
- Automatic sleep stage scoring using time-frequency analysis and stacked sparse autoencoders. Annals of biomedical engineering, 44:1587–1597, 2016.
- Automatic detection of sleep-disordered breathing events using recurrent neural networks from an electrocardiogram signal. Neural computing and applications, 32:4733–4742, 2020.
- Worley, S. L. The extraordinary importance of sleep: the detrimental effects of inadequate sleep on health and public safety drive an explosion of sleep research. Pharmacy and Therapeutics, 43(12):758, 2018.
- Respiratory event detection during sleep using electrocardiogram and respiratory related signals: Using polysomnogram and patch-type wearable device data. IEEE Journal of Biomedical and Health Informatics, 26(2):550–560, 2021.
- A deep learning model for automated sleep stages classification using PSG signals. International journal of environmental research and public health, 16(4):599, 2019.
- A sleep apnea-hypopnea syndrome automatic detection and subtype classification method based on LSTM-CNN. Biomedical Signal Processing and Control, 71:103240, 2022.
- MSED: A multi-modal sleep event detection model for clinical sleep analysis. IEEE Transactions on Biomedical Engineering, 2023.
- Contrastive learning of medical visual representations from paired images and text. In Machine Learning for Healthcare Conference, pp. 2–25. PMLR, 2022.
- Classification of sleep apnea based on EEG sub-band signal characteristics. Scientific Reports, 11(1):5824, 2021.