Adapting Self-Supervised Learning for Computational Pathology (2405.01688v1)
Abstract: Self-supervised learning (SSL) has emerged as a key technique for training networks that can generalize well to diverse tasks without task-specific supervision. This property makes SSL desirable for computational pathology, the study of digitized images of tissues, as there are many target applications and often limited labeled training samples. However, SSL algorithms and models have been primarily developed in the field of natural images and whether their performance can be improved by adaptation to particular domains remains an open question. In this work, we present an investigation of modifications to SSL for pathology data, specifically focusing on the DINOv2 algorithm. We propose alternative augmentations, regularization functions, and position encodings motivated by the characteristics of pathology images. We evaluate the impact of these changes on several benchmarks to demonstrate the value of tailored approaches.
- A comprehensive multi-domain dataset for mitotic figure detection. Scientific Data, 10(1):484, 2023.
- Robust and data-efficient generalization of self-supervised machine learning for diagnostic imaging. Nature Biomedical Engineering, 7(6):756–779, 2023.
- The effects of regularization and data augmentation are class dependent, 2022.
- Vicreg: Variance-invariance-covariance regularization for self-supervised learning, 2022.
- Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer. Jama, 318(22):2199–2210, 2017.
- Towards democratizing joint-embedding self-supervised learning, 2023.
- Discrete energy on rectifiable sets. Springer Monographs in Mathematics, 2019.
- Emerging properties in self-supervised vision transformers, 2021.
- Towards a general-purpose foundation model for computational pathology. Nature Medicine, pages 1–13, 2024.
- A simple framework for contrastive learning of visual representations, 2020.
- Exploring simple siamese representation learning, 2020.
- Self supervised contrastive learning for digital histopathology. Machine Learning with Applications, 7:100198, 2022.
- Machine learning in computational histopathology: Challenges and opportunities. Genes, Chromosomes and Cancer, 2023.
- Deep learning in digital pathology image analysis: a survey. Frontiers of medicine, 14:470–487, 2020.
- RudolfV: A foundation model by pathologists for pathologists, 2024.
- An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929, 2020.
- Scaling self-supervised learning for histopathology with masked image modeling. medRxiv, pages 2023–07, 2023.
- Bootstrap your own latent: A new approach to self-supervised learning, 2020.
- Synthetic domain-targeted augmentation (S-DOTA) improves model generalization in digital pathology. arXiv preprint arXiv:2305.02401, 2023.
- Augmentation component analysis: Modeling similarity via the augmentation overlaps. arXiv preprint arXiv:2206.00471, 2022.
- Masked autoencoders are scalable vision learners, 2021.
- Towards the generalization of contrastive self-supervised learning. arXiv preprint arXiv:2111.00743, 2021.
- Benchmarking self-supervised learning on diverse pathology datasets. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3344–3354, 2023.
- 100,000 histological images of human colorectal cancer and healthy tissue. Zenodo, 2018.
- Dinov2: Learning robust visual features without supervision, 2024.
- Scale-mae: A scale-aware masked autoencoder for multiscale geospatial representation learning, 2023.
- Spreading vectors for similarity search, 2019.
- Artificial intelligence for digital and computational pathology. Nature Reviews Bioengineering, 1(12):930–949, 2023.
- Deep neural network models for computational histopathology: A survey. Medical Image Analysis, 67:101813, 2021.
- Quantifying the effects of data augmentation and stain color normalization in convolutional neural networks for computational pathology. Medical image analysis, 58:101544, 2019.
- Rotation equivariant cnns for digital pathology. In Medical Image Computing and Computer Assisted Intervention–MICCAI 2018: 21st International Conference, Granada, Spain, September 16-20, 2018, Proceedings, Part II 11, pages 210–218. Springer, 2018.
- Virchow: A million-slide digital pathology foundation model, 2024.
- Understanding contrastive representation learning through alignment and uniformity on the hypersphere, 2022.
- Transformer-based unsupervised contrastive learning for histopathological image classification. Medical image analysis, 81:102559, 2022.
- A petri dish for histopathology image analysis. In Artificial Intelligence in Medicine: 19th International Conference on Artificial Intelligence in Medicine, AIME 2021, Virtual Event, June 15–18, 2021, Proceedings, pages 11–24. Springer, 2021.
- Decoupled contrastive learning, 2022.
- Barlow twins: Self-supervised learning via redundancy reduction, 2021.
- Eric Zimmermann (11 papers)
- Neil Tenenholtz (14 papers)
- James Hall (8 papers)
- George Shaikovski (5 papers)
- Michal Zelechowski (4 papers)
- Adam Casson (5 papers)
- Fausto Milletari (15 papers)
- Julian Viret (5 papers)
- Eugene Vorontsov (19 papers)
- Siqi Liu (94 papers)
- Kristen Severson (7 papers)