Self-supervised learning for skin cancer diagnosis with limited training data (2401.00692v3)
Abstract: Early cancer detection is crucial for prognosis, but many cancer types lack large labelled datasets required for developing deep learning models. This paper investigates self-supervised learning (SSL) as an alternative to the standard supervised pre-training on ImageNet for scenarios with limited training data using a deep learning model (ResNet-50). We first demonstrate that SSL pre-training on ImageNet (via the Barlow Twins SSL algorithm) outperforms supervised pre-training (SL) using a skin lesion dataset with limited training samples. We then consider \textit{further} SSL pre-training (of the two ImageNet pre-trained models) on task-specific datasets, where our implementation is motivated by supervised transfer learning. This approach significantly enhances initially SL pre-trained models, closing the performance gap with initially SSL pre-trained ones. Surprisingly, further pre-training on just the limited fine-tuning data achieves this performance equivalence. Linear probe experiments reveal that improvement stems from enhanced feature extraction. Hence, we find that minimal further SSL pre-training on task-specific data can be as effective as large-scale SSL pre-training on ImageNet for medical image classification tasks with limited labelled data. We validate these results on an oral cancer histopathology dataset, suggesting broader applicability across medical imaging domains facing labelled data scarcity.
- Vision-transformer-based transfer learning for mammogram classification. Diagnostics 13. URL: https://www.mdpi.com/2075-4418/13/2/178, doi:10.3390/diagnostics13020178.
- Chordoma: a systematic review of the epidemiology and clinical prognostic factors predicting progression-free and overall survival. European Spine Journal 27, 3043–3058. URL: https://doi.org/10.1007/s00586-018-5764-0, doi:10.1007/s00586-018-5764-0.
- Deep transfer learning for pancreatic cancer detection, in: 2021 12th International Conference on Computing Communication and Networking Technologies (ICCCNT), Kharagpur, India. pp. 1–7. doi:10.1109/ICCCNT51525.2021.9580000.
- Vicreg: Variance-invariance-covariance regularization for self-supervised learning. arXiv:2105.04906.
- Early diagnosis of oral cancer. The Journal of International Medical Research 38, 737--749. doi:10.1177/147323001003800302.
- Deep neural networks are superior to dermatologists in melanoma image classification. European Journal of Cancer 119, 11--17.
- Cancer.net, 2022. Oral and oropharyngeal cancer: Statistics. URL: https://www.cancer.net/cancer-types/oral-and-oropharyngeal-cancer/statistics. adapted from the American Cancer Society’s (ACS) publication, Cancer Facts & Figures 2022, the ACS website, the International Agency for Research on Cancer website, and the National Cancer Institute’s Surveillance, Epidemiology, and End Results (SEER) Program. (All sources accessed February 2022.).
- VicReg: Variance-invariance-covariance regularization. GitHub repository. Https://github.com/facebookresearch/vicreg.
- A simple framework for contrastive learning of visual representations, in: International Conference on Learning Representations (ICLR).
- The cancer imaging archive (tcia): maintaining and operating a public information repository. Journal of Digital Imaging 26, 1045--1057. URL: https://link.springer.com/article/10.1007/s10278-013-9622-7, doi:10.1007/s10278-013-9622-7.
- ImageNet: A Large-Scale Hierarchical Image Database, in: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248--255. URL: https://ieeexplore.ieee.org/document/5206848, doi:10.1109/CVPR.2009.5206848.
- Skin cancer detection: A review using deep learning techniques. International Journal of Environmental Research and Public Health 18, 5479.
- Artificial intelligence in medicine and cardiac imaging: harnessing big data and advanced computing to provide personalized medical diagnosis and treatment. Current cardiology reports 16, 1--8.
- An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 .
- Dermatologist-level classification of skin cancer with deep neural networks. Nature 542, 115--118. URL: https://doi-org.wwwproxy1.library.unsw.edu.au/10.1038/nature21056, doi:10.1038/nature21056.
- fast.ai, 2021. One cycle policy. URL: https://fastai1.fast.ai/callbacks.one_cycle.html. accessed: 2023-03-29.
- Cancer statistics for the year 2020: An overview. Int J Cancer Online ahead of print.
- Prostate cancer detection and prognosis: from prostate specific antigen (psa) to exosomal biomarkers. International journal of molecular sciences 17, 1784.
- Deep Learning. MIT Press. URL: http://www.deeplearningbook.org.
- Bootstrap your own latent: A new approach to self-supervised learning. Advances in Neural Information Processing Systems 33.
- The application of medical artificial intelligence technology in rural areas of developing countries. Health equity 2, 174--181.
- Multiple convolutional neural network for skin dermoscopic image classification. arXiv preprint arXiv:1807.08114.
- Momentum contrast for unsupervised visual representation learning. arXiv preprint arXiv:1911.05722 .
- Deep residual learning for image recognition, in: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770--778.
- Deep Learning for Coders with Fastai and Pytorch: AI Applications Without a PhD. O’Reilly Media, Incorporated. URL: https://books.google.no/books?id=xd6LxgEACAAJ.
- Receptive fields, binocular interaction and functional architecture in the cat’s visual cortex. The Journal of Physiology 160, 106--154.
- The lung image database consortium (lidc) and image database resource initiative (idri): a completed reference database of lung nodules on ct scans. Medical Physics 38, 915--931. URL: https://doi.org/10.1118/1.3528204, doi:10.1118/1.3528204.
- International Skin Imaging Collaboration, 2023. ISIC archive: A comprehensive resource for skin imaging data. URL: https://www.isic-archive.com. accessed: 2023-04-20.
- Transfer learning-based model for diabetic retinopathy diagnosis using retinal images. Brain Sciences 12, 535. doi:10.3390/brainsci12050535.
- A survey on contrastive self-supervised learning. arXiv preprint arXiv:2011.00362 .
- Biomarkers for detection and prognosis of breast cancer identified by a functional hypermethylome screen. Epigenetics 7, 701--709.
- Augmenting medical diagnosis decisions? an investigation into physicians’ decision-making process with artificial intelligence. Information Systems Research 32, 713--735.
- Pancreatic cancer. The Lancet 388, 73--85.
- Testing breast cancer serum biomarkers for early detection and prognosis in pre-diagnosis samples. British journal of cancer 116, 501--508.
- A transfer learning approach for multiclass classification of Alzheimer’s disease using MRI images. Frontiers in Neuroscience 16, 1050777. doi:10.3389/fnins.2022.1050777.
- Transfer learning for medical image classification: a literature review. BMC Medical Imaging 22, 69.
- Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 .
- Fine-tuning can distort pretrained features and underperform out-of-distribution. arXiv preprint arXiv:2202.10054 .
- A survey on deep learning approaches for breast cancer diagnosis. arXiv preprint arXiv:2109.08853.
- Gradient-based learning applied to document recognition. Proceedings of the IEEE 86, 2278--2324.
- Self-supervised learning: The dark matter of intelligence. Facebook AI Blog.
- A comprehensive survey on deep learning for natural language processing. arXiv preprint arXiv:2003.01200 .
- A survey on deep learning in medical image analysis. Medical Image Analysis 42, 60--88. URL: https://www.sciencedirect.com/science/article/pii/S1361841517301135, doi:10.1016/j.media.2017.07.005.
- Ensemble of convolutional neural networks for dermoscopic images classification. arXiv preprint arXiv:1808.05071.
- Systematic outperformance of 112 dermatologists in multiclass skin cancer image classification by convolutional neural networks. European Journal of Cancer 119, 57--65. URL: https://www.sciencedirect.com/science/article/pii/S0959804919303818, doi:https://doi.org/10.1016/j.ejca.2019.06.013.
- A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering 22, 1345--1359. doi:10.1109/TKDE.2009.191.
- Pytorch: An imperative style, high-performance deep learning library.
- On lines and planes of closest fit to systems of points in space. Philosophical Magazine 6, 559--572.
- PyTorch, 2021. torchvision.models.resnet50. URL: https://pytorch.org/vision/main/models/generated/torchvision.models.resnet50.html. accessed: 2023-03-29.
- Improving language understanding by generative pre-training .
- Barlow twins: Self-Supervised Learning via Redundancy Reduction. URL: https://github.com/facebookresearch/barlowtwins. accessed: 2023-03-29.
- The evolution of melanoma diagnosis: 25 years beyond the abcds. CA: A Cancer Journal for Clinicians 60, 301--316.
- Imagenet large scale visual recognition challenge. International Journal of Computer Vision 115, 211--252. doi:10.1007/s11263-015-0816-y.
- The precision-recall plot is more informative than the roc plot when evaluating binary classifiers on imbalanced datasets. PLoS ONE 10, e0118432. URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0118432, doi:10.1371/journal.pone.0118432.
- Breast cancer classification using transfer learning, in: Singh, P., Noor, A., Kolekar, M., Tanwar, S., Bhatnagar, R., Khanna, S. (Eds.), Evolving Technologies for Computing, Communication and Smart World, Springer, Singapore. doi:10.1007/978-981-15-7804-5_32.
- Scarcity of publicly available oral cancer image datasets for machine learning research. Oral Oncology 126, 105737.
- Better aggregation in test-time augmentation, in: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), IEEE Computer Society. pp. 1194--1203. URL: https://doi.ieeecomputersociety.org/10.1109/ICCV48922.2021.00125, doi:10.1109/ICCV48922.2021.00125.
- A disciplined approach to neural network hyper-parameters: Part 1 -- learning rate, batch size, momentum, and weight decay. arXiv preprint arXiv:1803.09820 arXiv:1803.09820.
- Super-convergence: Very fast training of neural networks using large learning rates. arXiv preprint arXiv:1708.07120 arXiv:1708.07120.
- Classification of imbalanced oral cancer image data from high-risk population. J Biomed Opt 26, 105001.
- A survey on deep transfer learning. Artificial Intelligence Review 52, 1--40.
- ISIC 2019: Skin lesion analysis towards melanoma detection. https://www.kaggle.com/datasets/andrewmvd/isic-2019. Accessed: 2023-03-01.
- Imagenet. URL: https://www.image-net.org/. online; accessed 12-April-2023.
- Precision health data: Requirements, challenges and existing techniques for data security and privacy. Computers in biology and medicine 129, 104130.
- Self-supervised learning library. URL: https://github.com/KeremTurgutlu/self_supervised. available at: https://github.com/KeremTurgutlu/self_supervised.
- Attention is all you need, in: Advances in Neural Information Processing Systems, pp. 5998--6008.
- Extracting and composing robust features with denoising autoencoders, in: Proceedings of the 25th international conference on Machine learning, ACM. pp. 1096--1103.
- Classification of pathological types of lung cancer from ct images by deep residual neural networks with transfer learning strategy. Open Medicine (Warsaw) 15, 190--197. doi:10.1515/med-2020-0028.
- Global epidemiology of oral and oropharyngeal cancer. Oral Oncology 45, 309--316.
- Google’s neural machine translation system: Bridging the gap between human and machine translation. arXiv preprint arXiv:1609.08144 .
- A survey on deep learning techniques for lung cancer detection. International Journal of Innovative Technology and Exploring Engineering (IJITEE) 8, 1216--1220. URL: https://www.ijitee.org/wp-content/uploads/papers/v8i10s/J10430881019.pdf.
- How transferable are features in deep neural networks?, in: Advances in Neural Information Processing Systems, pp. 3320--3328.
- How does learning rate decay help modern neural networks? arXiv preprint arXiv:1908.01878 .
- Barlow twins: Self-supervised learning via redundancy reduction, in: International Conference on Learning Representations (ICLR).
- A comprehensive survey on transfer learning. Proceedings of the IEEE 109, 43--76.
- Hamish Haggerty (1 paper)
- Rohitash Chandra (64 papers)