MyriadAL: Active Few Shot Learning for Histopathology (2310.16161v2)
Abstract: Active Learning (AL) and Few Shot Learning (FSL) are two label-efficient methods which have achieved excellent results recently. However, most prior arts in both learning paradigms fail to explore the wealth of the vast unlabelled data. In this study, we address this issue in the scenario where the annotation budget is very limited, yet a large amount of unlabelled data for the target task is available. We frame this work in the context of histopathology where labelling is prohibitively expensive. To this end, we introduce an active few shot learning framework, Myriad Active Learning (MAL), including a contrastive-learning encoder, pseudo-label generation, and novel query sample selection in the loop. Specifically, we propose to massage unlabelled data in a self-supervised manner, where the obtained data representations and clustering knowledge form the basis to activate the AL loop. With feedback from the oracle in each AL cycle, the pseudo-labels of the unlabelled data are refined by optimizing a shallow task-specific net on top of the encoder. These updated pseudo-labels serve to inform and improve the active learning query selection process. Furthermore, we introduce a novel recipe to combine existing uncertainty measures and utilize the entire uncertainty list to reduce sample redundancy in AL. Extensive experiments on two public histopathology datasets show that MAL has superior test accuracy, macro F1-score, and label efficiency compared to prior works, and can achieve a comparable test accuracy to a fully supervised algorithm while labelling only 5% of the dataset.
- A cookbook of self-supervised learning, 2023.
- A survey on active learning and human-in-the-loop deep learning for medical image analysis. Medical Image Analysis, 71(102062):102062, July 2021.
- Active self-supervised learning: A few low-cost relationships are all you need, 2023.
- Improved baselines with momentum contrastive learning, 2020.
- Self supervised contrastive learning for digital histopathology. Machine Learning with Applications, 7:100198, 2022.
- Imagenet: A large-scale hierarchical image database. In 2009 IEEE Conference on Computer Vision and Pattern Recognition, pages 248–255, 2009.
- Deep learning for colon cancer histopathological images analysis. Computers in Biology and Medicine, 136:104730, 2021.
- Weighted distillation with unlabeled examples. In NeurIPS, 2022.
- Colorectal histology tumor detection using ensemble deep neural network. Engineering Applications of Artificial Intelligence, 100:104202, 2021.
- Consistency-based semi-supervised active learning: Towards minimizing labeling cost, 2020.
- Cost-effective active learning for melanoma segmentation, 2017.
- Momentum contrast for unsupervised visual representation learning, 2020.
- A survey on contrastive self-supervised learning, 2021.
- Why are deep learning models not consistently winning recommender systems competitions yet? a position paper. In Proceedings of the Recommender Systems Challenge 2020, RecSysChallenge ’20, page 44–49, New York, NY, USA, 2020. Association for Computing Machinery.
- Multiplex cellular communities in multi-gigapixel colorectal cancer histology images for tissue phenotyping. IEEE Transactions on Image Processing, 29:9204–9219, 2020.
- Histossl: Self-supervised representation learning for classifying histopathology images. Mathematics, 11(1), 2023.
- 100,000 histological images of human colorectal cancer and healthy tissue, Apr. 2018.
- Adam: A method for stochastic optimization, 2017.
- Breast cancer classification on histopathological images affected by data imbalance using active learning and deep convolutional neural network. In Igor V. Tetko, Vvera Krurkova, Pavel Karpov, and Fabian Theis, editors, Artificial Neural Networks and Machine Learning – ICANN 2019: Workshop and Special Sessions, pages 299–312, Cham, 2019. Springer International Publishing.
- Deep learning. Nature, 521(7553):436–444, May 2015.
- Self-supervised learning model for skin cancer diagnosis. In 2015 7th International IEEE/EMBS Conference on Neural Engineering (NER), pages 1012–1015, 2015.
- Review paper on research direction towards cancer prediction and prognosis using machine learning and deep learning models. J. Ambient Intell. Humaniz. Comput., Mar. 2021.
- Active few-shot learning with fasl, 2022.
- Reading digits in natural images with unsupervised feature learning. In NIPS Workshop on Deep Learning and Unsupervised Feature Learning 2011, 2011.
- A survey of deep active learning. ACM Comput. Surv., 54(9), oct 2021.
- Iqbal H Sarker. Deep learning: A comprehensive overview on techniques, taxonomy, applications and research directions. SN Comput. Sci., 2(6):420, Aug. 2021.
- Burr Settles. Active learning literature survey. 2009.
- Fhist: A benchmark for few-shot classification of histological images, 2022.
- C. E. Shannon. A mathematical theory of communication. SIGMOBILE Mob. Comput. Commun. Rev., 5(1):3–55, jan 2001.
- Variational adversarial active learning. arXiv preprint arXiv:1904.00370, 2019.
- A comprehensive survey of few-shot learning: Evolution, applications, challenges, and opportunities, 2022.
- A dataset for breast cancer histopathological image classification. IEEE Transactions on Biomedical Engineering, 63(7):1455–1462, 2016.
- Efficientnet: Rethinking model scaling for convolutional neural networks. In International Conference on Machine Learning, 2019.
- Deep learning in histopathology: the path to the clinic. Nat. Med., 27(5):775–784, May 2021.
- Cost-effective active learning for deep image classification. IEEE Transactions on Circuits and Systems for Video Technology, 27(12):2591–2600, dec 2017.
- Active one-shot learning, 2017.
- A comparative survey of deep active learning, 2022.
- Nico Schiavone (2 papers)
- Jingyi Wang (105 papers)
- Shuangzhi Li (4 papers)
- Roger Zemp (3 papers)
- Xingyu Li (104 papers)