Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of Electrocardiogram (2402.09450v3)
Abstract: Electrocardiograms (ECG) are widely employed as a diagnostic tool for monitoring electrical signals originating from a heart. Recent machine learning research efforts have focused on the application of screening various diseases using ECG signals. However, adapting to the application of screening disease is challenging in that labeled ECG data are limited. Achieving general representation through self-supervised learning (SSL) is a well-known approach to overcome the scarcity of labeled data; however, a naive application of SSL to ECG data, without considering the spatial-temporal relationships inherent in ECG signals, may yield suboptimal results. In this paper, we introduce ST-MEM (Spatio-Temporal Masked Electrocardiogram Modeling), designed to learn spatio-temporal features by reconstructing masked 12-lead ECG data. ST-MEM outperforms other SSL baseline methods in various experimental settings for arrhythmia classification tasks. Moreover, we demonstrate that ST-MEM is adaptable to various lead combinations. Through quantitative and qualitative analysis, we show a spatio-temporal relationship within ECG data. Our code is available at https://github.com/bakqui/ST-MEM.
- A public domain dataset for human activity recognition using smartphones. In Esann, volume 3, pp. 3, 2013.
- Language models are few-shot learners. Advances in neural information processing systems, 33:1877–1901, 2020.
- Emerging properties in self-supervised vision transformers. In Proceedings of the IEEE/CVF international conference on computer vision, pp. 9650–9660, 2021.
- A simple framework for contrastive learning of visual representations. In International conference on machine learning, pp. 1597–1607. PMLR, 2020.
- Af classification from a short single lead ecg recording: The physionet/computing in cardiology challenge 2017. In 2017 Computing in Cardiology (CinC), pp. 1–4. IEEE, 2017.
- Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805, 2018.
- An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929, 2020.
- Accuracy and knowledge in 12-lead ecg placement among nursing students and nurses: A web-based italian study. Acta Bio Medica: Atenei Parmensis, 91(Suppl 12), 2020.
- 3kg: Contrastive learning of 12-lead electrocardiograms using physiologically-inspired augmentations. In Machine Learning for Health, pp. 156–167. PMLR, 2021.
- Bootstrap your own latent-a new approach to self-supervised learning. Advances in neural information processing systems, 33:21271–21284, 2020.
- Automated detection of acute myocardial infarction using asynchronous electrocardiogram signals—preview of implementing artificial intelligence with multichannel electrocardiographs obtained from smartwatches: retrospective study. Journal of Medical Internet Research, 23(9):e31129, 2021.
- Cardiologist-level arrhythmia detection and classification in ambulatory electrocardiograms using a deep neural network. Nature medicine, 25(1):65–69, 2019.
- Momentum contrast for unsupervised visual representation learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 9729–9738, 2020.
- Masked autoencoders are scalable vision learners. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 16000–16009, 2022.
- A transformer-based deep neural network for arrhythmia detection using continuous ecg signals. Computers in Biology and Medicine, 144:105325, 2022.
- Spatiotemporal self-supervised representation learning from multi-lead ecg signals. Biomedical Signal Processing and Control, 84:104772, 2023.
- Self-supervised visual feature learning with deep neural networks: A survey. IEEE transactions on pattern analysis and machine intelligence, 43(11):4037–4058, 2020.
- Ecg arrhythmia classification using a 2-d convolutional neural network. arXiv preprint arXiv:1804.06812, 2018.
- Clocs: Contrastive learning of cardiac signals across space, time, and patients. In International Conference on Machine Learning, pp. 5606–5615. PMLR, 2021.
- Practical intelligent diagnostic algorithm for wearable 12-lead ecg via self-supervised learning on large-scale dataset. Nature Communications, 14(1):3741, 2023.
- Intra-inter subject self-supervised learning for multivariate cardiac signals. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, pp. 4532–4540, 2022.
- Towards better time series contrastive learning: A dynamic bad pair mining approach. arXiv preprint arXiv:2302.03357, 2023.
- scl-st: Supervised contrastive learning with semantic transformations for multiple lead ecg arrhythmia classification. IEEE journal of biomedical and health informatics, 2023.
- Using the apple watch to record multiple-lead electrocardiograms in detecting myocardial infarction: Where are we now? Texas Heart Institute Journal, 49(4):e227845, 2022.
- Toward improving ecg biometric identification using cascaded convolutional neural networks. Neurocomputing, 391:83–95, 2020.
- An open access database for evaluating the algorithms of electrocardiogram rhythm and morphology abnormality detection. Journal of Medical Imaging and Health Informatics, 8(7):1368–1373, 2018.
- A novel application of deep learning for single-lead ecg classification. Computers in biology and medicine, 99:53–62, 2018.
- Self-supervised representation learning from 12-lead ecg data. Computers in biology and medicine, 141:105114, 2022.
- Data augmentation for electrocardiogram classification with deep neural network. arXiv preprint arXiv:2009.04398, 2020.
- Lead-agnostic self-supervised learning for local and global representations of electrocardiogram. In Conference on Health, Inference, and Learning, pp. 338–353. PMLR, 2022.
- Representation learning with contrastive predictive coding. arXiv preprint arXiv:1807.03748, 2018.
- What do self-supervised vision transformers learn? In The Eleventh International Conference on Learning Representations, 2022.
- Mvkt-ecg: Efficient single-lead ecg classification on multi-label arrhythmia by multi-view knowledge transferring. arXiv preprint arXiv:2301.12178, 2023.
- Carl Rasmussen. The infinite gaussian mixture model. Advances in neural information processing systems, 12, 1999.
- Will two do? varying dimensions in electrocardiography: the physionet/computing in cardiology challenge 2021. In 2021 Computing in Cardiology (CinC), volume 48, pp. 1–4. IEEE, 2021.
- Code-15%: A large scale annotated dataset of 12-lead ecgs. Zenodo, Jun, 9, 2021.
- Automatic diagnosis of the 12-lead ecg using a deep neural network. Nature communications, 11(1):1760, 2020.
- Self-supervised ecg representation learning for emotion recognition. IEEE Transactions on Affective Computing, 13(3):1541–1554, 2020.
- Masked autoencoder-based self-supervised learning for electrocardiograms to detect left ventricular systolic dysfunction. In NeurIPS 2022 Workshop on Learning from Time Series for Health, 2022.
- Artificial intelligence-enhanced electrocardiography in cardiovascular disease management. Nature Reviews Cardiology, 18(7):465–478, 2021.
- Analysis of augmentations for contrastive ecg representation learning. In 2022 International Joint Conference on Neural Networks (IJCNN), pp. 1–10. IEEE, 2022.
- Deep learning for ecg analysis: Benchmarks and insights from ptb-xl. IEEE Journal of Biomedical and Health Informatics, 25(5):1519–1528, 2020.
- Deep learning for automatic detection of periodic limb movement disorder based on electrocardiogram signals. Diagnostics, 12(9):2149, 2022.
- Laurens Van der Maaten and Geoffrey Hinton. Visualizing data using t-sne. Journal of machine learning research, 9(11), 2008.
- Ptb-xl, a large publicly available electrocardiography dataset. Scientific data, 7(1):154, 2020.
- Contrastive heartbeats: Contrastive learning for self-supervised ecg representation and phenotyping. In ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1126–1130. IEEE, 2022.
- Simmim: A simple framework for masked image modeling. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9653–9663, 2022.
- Maefe: Masked autoencoders family of electrocardiogram for self-supervised pretraining and transfer learning. IEEE Transactions on Instrumentation and Measurement, 72:1–15, 2022.
- Optimal multi-stage arrhythmia classification approach. Scientific reports, 10(1):2898, 2020a.
- A 12-lead electrocardiogram database for arrhythmia research covering more than 10,000 patients. Scientific data, 7(1):48, 2020b.
- Yeongyeon Na (1 paper)
- Minje Park (9 papers)
- Yunwon Tae (5 papers)
- Sunghoon Joo (1 paper)