Scaling convolutional neural networks achieves expert-level seizure detection in neonatal EEG (2405.09911v1)
Abstract: Background: Neonatal seizures are a neurological emergency that require urgent treatment. They are hard to diagnose clinically and can go undetected if EEG monitoring is unavailable. EEG interpretation requires specialised expertise which is not widely available. Algorithms to detect EEG seizures can address this limitation but have yet to reach widespread clinical adoption. Methods: Retrospective EEG data from 332 neonates was used to develop and validate a seizure-detection model. The model was trained and tested with a development dataset ($n=202$) that was annotated with over 12k seizure events on a per-channel basis. This dataset was used to develop a convolutional neural network (CNN) using a modern architecture and training methods. The final model was then validated on two independent multi-reviewer datasets ($n=51$ and $n=79$). Results: Increasing dataset and model size improved model performance: Matthews correlation coefficient (MCC) and Pearson's correlation ($r$) increased by up to 50% with data scaling and up to 15% with model scaling. Over 50k hours of annotated single-channel EEG was used for training a model with 21 million parameters. State-of-the-art was achieved on an open-access dataset (MCC=0.764, $r=0.824$, and AUC=0.982). The CNN attains expert-level performance on both held-out validation sets, with no significant difference in inter-rater agreement among the experts and among experts and algorithm ($\Delta \kappa < -0.095$, $p>0.05$). Conclusion: With orders of magnitude increases in data and model scale we have produced a new state-of-the-art model for neonatal seizure detection. Expert-level equivalence on completely unseen data, a first in this field, provides a strong indication that the model is ready for further clinical validation.
- The current etiologic profile and neurodevelopmental outcome of seizures in term newborn infants. Pediatrics, 117(4):1270–1280, April 2006. doi: 10.1542/peds.2005-1178.
- Soul JS. Acute symptomatic seizures in term neonates: Etiologies and treatments. Semin Fetal Neonatal Med, 23(3):183–190, June 2018. doi: 10.1016/j.siny.2018.02.002.
- Seizures in the neonate: A review of etiologies and outcomes. Seizure, 85:48–56, February 2021. doi: 10.1016/j.seizure.2020.12.023.
- Characterisation of neonatal seizures and their treatment using continuous EEG monitoring: a multicentre experience. Arch Dis Child Fetal Neonatal Ed, 104(5):F493–F501, November 2018. doi: 10.1136/archdischild-2018-315624.
- Outcome following neonatal seizures. Semin Fetal Neonatal Med, 18(4):224–232, August 2013. doi: 10.1016/j.siny.2013.01.002.
- Treating EEG seizures in hypoxic ischemic encephalopathy: A randomized controlled trial. Pediatrics, 136(5):e1302–e1309, November 2015. doi: 10.1542/peds.2014-3777.
- Defining the gap between electrographic seizure burden, clinical expression and staff recognition of neonatal seizures. Arch Dis Child Fetal Neonatal Ed, 93(3):F187–F191, 2008. doi: 10.1136/adc.2005.086314.
- Neonatal seizure management: Is the timing of treatment critical? J Pediatr, 243:61–68.e2, April 2022. doi: 10.1016/j.jpeds.2021.09.058.
- A machine-learning algorithm for neonatal seizure recognition: a multicentre, randomised, controlled trial. Lancet Child Adolesc Health, 4(10):740–749, 2020. doi: 10.1016/S2352-4642(20)30239-X.
- EEG-based neonatal seizure detection with support vector machines. Clin Neurophysiol, 122(3):464–473, 2011. doi: 10.1016/j.clinph.2010.06.034.
- Neonatal seizure detection using deep convolutional neural networks. Int J Neural Syst, 29(04):1850011, May 2019. doi: 10.1142/s0129065718500119.
- Attention-based network for weak labels in neonatal seizure detection. In Proc Machine Learning Healthcare Conf, volume 126, pages 479–507. PMLR, 07–08 Aug 2020. URL https://proceedings.mlr.press/v126/isaev20a.html.
- Neonatal seizure detection from raw multi-channel EEG using a fully convolutional architecture. Neural Netw, 123:12–25, March 2020. doi: 10.1016/j.neunet.2019.11.023.
- Towards deeper neural networks for neonatal seizure detection. In Int Conf Eng Med Bio Soc (EMBC). IEEE, 2021. doi: 10.1109/EMBC46164.2021.9629485.
- Transfer learning to detect neonatal seizure from electroencephalography signals. Neural Comput Appl, pages 12087–12101, March 2021. doi: 10.1007/s00521-021-05878-y.
- Convolutional neural networks ensemble model for neonatal seizure detection. J Neurosci Methods, 358:109197, July 2021. doi: 10.1016/j.jneumeth.2021.109197.
- A deep learning framework for epileptic seizure detection based on neonatal EEG signals. Sci Rep, 12(1), July 2022. doi: 10.1038/s41598-022-15830-2.
- A graph convolutional neural network for the automated detection of seizures in the neonatal EEG. Comput Methods Programs Biomed, 222:106950, July 2022. doi: 10.1016/j.cmpb.2022.106950.
- Bridging the source-target mismatch with pseudo labeling for neonatal seizure detection. In Eur Signal Proc Conf (EUSIPCO). IEEE, 2023. doi: 10.23919/EUSIPCO58844.2023.10290015.
- A class-imbalance aware and explainable spatio-temporal graph attention network for neonatal seizure detection. Int J Neural Syst, 33(09), July 2023. doi: 10.1142/s0129065723500466.
- A dataset of neonatal EEG recordings with seizure annotations. Sci Data, 6(1):1–8, 2019a. doi: 10.1038/sdata.2019.39.
- Sleep–wake cycle of the healthy term newborn infant in the immediate postnatal period. Clin Neurophysiol, 127(4):2095–2101, 2016. doi: 10.1016/j.clinph.2015.12.015.
- Interobserver agreement for neonatal seizure detection using multichannel EEG. Ann Clin Transl Neurol, 2(11):1002–1011, 2015. doi: 10.1002/acn3.249.
- A ConvNet for the 2020s. In Proc IEEE/CVF Int Conf Computer Vision (CVPR), pages 11966–11976, 2022. doi: 10.1109/CVPR52688.2022.01167.
- AUC: a misleading measure of the performance of predictive distribution models. Glob Ecol Biogeogr, 17(2):145–151, September 2007. doi: 10.1111/j.1466-8238.2007.00358.x.
- The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PLOS One, 10(3):e0118432, 2015. doi: 10.1371/journal.pone.0118432.
- The Matthews correlation coefficient (MCC) should replace the ROC AUC as the standard metric for assessing binary classification. BioData Min, 16(1), February 2023. doi: 10.1186/s13040-023-00322-4.
- The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation. BMC Genomics, 21(1), January 2020. doi: 10.1186/s12864-019-6413-7.
- Hybrid neonatal EEG seizure detection algorithms achieve the benchmark of visual interpretation of the human expert. In Int Conf Eng Med Bio Soc (EMBC), pages 5991–5994, 2019b. doi: 10.1109/EMBC.2019.8857367.
- Time-varying EEG correlations improve automated neonatal seizure detection. Int J Neural Syst, 29(04):1850030, 2019. doi: 10.1142/S0129065718500302.
- Validating an SVM-based neonatal seizure detection algorithm for generalizability, non-inferiority and clinical efficacy. Comput Biol Med, 145:105399, 2022. doi: 10.1016/j.compbiomed.2022.105399.
- Deep double descent: where bigger models and more data hurt. J Stat Mech: Theory Exp, 2021(12):124003, December 2021. doi: 10.1088/1742-5468/ac3a74.
- Scaling laws for neural language models. arXiv preprint arXiv:2001.08361, 2020. doi: 10.48550/arXiv.2001.08361.
- Training compute-optimal large language models. arXiv preprint arXiv:2203.15556, 2022. doi: 10.48550/arXiv.2203.15556.
- EEG in the healthy term newborn within 12 hours of birth. Clin Neurophysiol, 120(6):1046–53, June 2009. doi: 10.1016/j.clinph.2009.03.015.
- Identifying tracé alternant activity in neonatal EEG using an inter-burst detection approach. In Int Conf Eng Med Bio Soc (EMBC), pages 5984–5987. IEEE, 2020. doi: 10.1109/EMBC44109.2020.9176147.
- Multichannel EEG abnormalities during the first 6 hours in infants with mild hypoxic–ischaemic encephalopathy. Pediatr Res, 90(1):117–124, July 2021. doi: 10.1038/s41390-021-01412-x.
- Super-convergence: Very fast training of neural networks using large learning rates. In AI Machine Learning Multidomain Operations Appl, volume 11006, pages 369–386. SPIE, 2019. doi: 10.1117/12.2520589.
- Cutmix: Regularization strategy to train strong classifiers with localizable features. In Proc IEEE/CVF Int Conf Computer Vision (CVPR), pages 6023–6032, 2019. doi: 10.1109/ICCV.2019.00612.
- Improved regularization of convolutional neural networks with cutout. arXiv preprint arXiv:1708.04552, 2017. doi: 10.48550/arXiv.1708.04552.
- mixup: Beyond empirical risk minimization. In Int Conf Learning Representations (ICLR), 2018. doi: 10.48550/arXiv.1710.09412.