Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

OCT-SelfNet: A Self-Supervised Framework with Multi-Modal Datasets for Generalized and Robust Retinal Disease Detection (2401.12344v1)

Published 22 Jan 2024 in cs.CV, cs.AI, and cs.LG

Abstract: Despite the revolutionary impact of AI and the development of locally trained algorithms, achieving widespread generalized learning from multi-modal data in medical AI remains a significant challenge. This gap hinders the practical deployment of scalable medical AI solutions. Addressing this challenge, our research contributes a self-supervised robust machine learning framework, OCT-SelfNet, for detecting eye diseases using optical coherence tomography (OCT) images. In this work, various data sets from various institutions are combined enabling a more comprehensive range of representation. Our method addresses the issue using a two-phase training approach that combines self-supervised pretraining and supervised fine-tuning with a mask autoencoder based on the SwinV2 backbone by providing a solution for real-world clinical deployment. Extensive experiments on three datasets with different encoder backbones, low data settings, unseen data settings, and the effect of augmentation show that our method outperforms the baseline model, Resnet-50 by consistently attaining AUC-ROC performance surpassing 77% across all tests, whereas the baseline model exceeds 54%. Moreover, in terms of the AUC-PR metric, our proposed method exceeded 42%, showcasing a substantial increase of at least 10% in performance compared to the baseline, which exceeded only 33%. This contributes to our understanding of our approach's potential and emphasizes its usefulness in clinical settings.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (57)
  1. Av-net: deep learning for fully automated artery-vein classification in optical coherence tomography angiography. Biomed. Opt. Express, 11(9):5249–5257, Sep 2020.
  2. Quantitative oct angiography features for objective classification and staging of diabetic retinopathy. Retina (Philadelphia, Pa.), 2020.
  3. Olive disease classification based on vision transformer and cnn models. Computational Intelligence and Neuroscience, 2022, 2022.
  4. Classification of sd-oct images using a deep learning approach. In 2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA), pages 489–492, 2017.
  5. Vision-transformer-based transfer learning for mammogram classification. Diagnostics, 13(2):178, 2023.
  6. Beit: Bert pre-training of image transformers. arXiv preprint arXiv:2106.08254, 2021.
  7. Comparing different deep learning architectures for classification of chest radiographs. Scientific reports, 10(1):13590, 2020.
  8. A simple framework for contrastive learning of visual representations. In International conference on machine learning, pages 1597–1607. PMLR, 2020.
  9. Deep learning models for screening of high myopia using optical coherence tomography. Scientific reports, 11(1):21663, 2021.
  10. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805, 2018.
  11. An image is worth 16x16 words: Transformers for image recognition at scale. CoRR, abs/2010.11929, 2020.
  12. Self-supervised patient-specific features learning for oct image classification. Medical & Biological Engineering & Computing, 60(10):2851–2863, 2022.
  13. Analysis of the relationship between drusen size and drusen area in eyes with age-related macular degeneration. Ophthalmic Surgery, Lasers and Imaging Retina, 42(5):369–375, 2011.
  14. Transfer learning for domain adaptation in mri: Application in brain lesion segmentation. In Medical Image Computing and Computer Assisted Intervention- MICCAI 2017: 20th International Conference, Quebec City, QC, Canada, September 11-13, 2017, Proceedings, Part III 20, pages 516–524. Springer, 2017.
  15. Federated learning for diagnosis of age-related macular degeneration. Frontiers in Medicine, 10, 2023.
  16. Classifying neovascular age-related macular degeneration with a deep convolutional neural network based on optical coherence tomography images. Scientific Reports, 12(1):2232, 2022.
  17. Masked autoencoders are scalable vision learners. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 16000–16009, 2022.
  18. Momentum contrast for unsupervised visual representation learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 9729–9738, 2020.
  19. Deep residual learning for image recognition. CoRR, abs/1512.03385, 2015.
  20. Khawar Islam. Recent advances in vision transformer: A survey and outlook of recent work. arXiv preprint arXiv:2203.01536, 2022.
  21. Self-supervised visual feature learning with deep neural networks: A survey. IEEE transactions on pattern analysis and machine intelligence, 43(11):4037–4058, 2020.
  22. Identifying medical diagnoses and treatable diseases by image-based deep learning. cell, 172(5):1122–1131, 2018.
  23. Detection of nonexudative macular neovascularization on structural oct images using vision transformers. Ophthalmology Science, 2(4):100197, 2022.
  24. Transfer learning for medical image classification: a literature review. BMC medical imaging, 22(1):69, 2022.
  25. Pre-trained deep learning models for brain mri image classification. Frontiers in Human Neuroscience, 17:1150120, 2023.
  26. Imagenet classification with deep convolutional neural networks. In F. Pereira, C.J. Burges, L. Bottou, and K.Q. Weinberger, editors, Advances in Neural Information Processing Systems, volume 25. Curran Associates, Inc., 2012.
  27. Transfer Learning for Automated OCTA Detection of Diabetic Retinopathy. Translational Vision Science & Technology, 9(2):35–35, 07 2020.
  28. Oct-based deep-learning models for the identification of retinal key signs. Scientific Reports, 13(1):14628, 2023.
  29. Deep learning is effective for the classification of oct images of normal versus age-related macular degeneration. ophthalmol retina. 2017; 1 (4): 322–7.
  30. Deep learning is effective for classifying normal versus age-related macular degeneration oct images. Ophthalmology Retina, 1(4):322–327, 2017.
  31. Automated deep learning-based amd detection and staging in real-world oct datasets (pinnacle study report 5). Scientific Reports, 13(1):19545, 2023.
  32. Octa-500: A retinal dataset for optical coherence tomography angiography study. arXiv e-prints, pages arXiv–2012, 2020.
  33. Swin transformer v2: Scaling up capacity and resolution. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 12009–12019, 2022.
  34. Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF international conference on computer vision, pages 10012–10022, 2021.
  35. Decoupled weight decay regularization, 2019.
  36. Deep learning-based automated classification of multi-categorical abnormalities from optical coherence tomography images. Translational vision science & technology, 7(6):41–41, 2018.
  37. Ievit: An enhanced vision transformer architecture for chest x-ray image classification. Computer Methods and Programs in Biomedicine, 226:107141, 2022.
  38. A survey on efficient vision transformers: algorithms, techniques, and performance benchmarking. arXiv preprint arXiv:2309.02031, 2023.
  39. Self-supervised iterative refinement learning for macular oct volumetric data classification. Computers in biology and medicine, 111:103327, 2019.
  40. Prediction of individual disease conversion in early amd using artificial intelligence. Investigative ophthalmology & visual science, 59(8):3199–3208, 2018.
  41. Long-term follow-up of vascular endothelial growth factor inhibitor therapy for neovascular age-related macular degeneration. Current opinion in ophthalmology, 24(3):190–196, 2013.
  42. A comparative study of multiple neural network for detection of covid-19 on chest x-ray. EURASIP journal on advances in signal processing, 2021:1–16, 2021.
  43. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556, 2014.
  44. Multi-scale convolutional neural network for automated amd classification using retinal oct images. Computers in Biology and Medicine, 144:105368, 2022.
  45. Deep transfer learning approaches in performance analysis of brain tumor classification using mri images. Journal of Healthcare Engineering, 2022, 2022.
  46. Fully automated detection of diabetic macular edema and dry age-related macular degeneration from optical coherence tomography images. Biomedical optics express, 5(10):3568–3577, 2014.
  47. Going deeper with convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 1–9, 2015.
  48. Classification of optical coherence tomography images using a capsule network. BMC ophthalmology, 20(1):1–9, 2020.
  49. Attention is all you need. Advances in neural information processing systems, 30, 2017.
  50. Genetic and environmental factors strongly influence risk, severity and progression of age-related macular degeneration. Signal transduction and targeted therapy, 1(1):1–6, 2016.
  51. Semi-supervised vision transformer with adaptive token sampling for breast cancer classification. Frontiers in Pharmacology, 13:929755, 2022.
  52. World Health Organization. Blindness and visual impairment, 8 2023.
  53. Vision transformer-based recognition of diabetic retinopathy grade. Medical Physics, 48(12):7850–7863, 2021.
  54. Simmim: A simple framework for masked image modeling. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9653–9663, 2022.
  55. Resnet and its application to medical image processing: Research progress and challenges. Computer Methods and Programs in Biomedicine, page 107660, 2023.
  56. Detection and analysis of covid-19 in medical images using deep learning techniques. Scientific Reports, 11(1):19638, 2021.
  57. Spectral domain optical coherence tomography for quantitative evaluation of drusen and associated structural changes in non-neovascular age-related macular degeneration. British Journal of Ophthalmology, 93(2):176–181, 2009.
Citations (1)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets