Few Shot Semantic Segmentation: a review of methodologies, benchmarks, and open challenges (2304.05832v2)
Abstract: Semantic segmentation, vital for applications ranging from autonomous driving to robotics, faces significant challenges in domains where collecting large annotated datasets is difficult or prohibitively expensive. In such contexts, such as medicine and agriculture, the scarcity of training images hampers progress. Introducing Few-Shot Semantic Segmentation, a novel task in computer vision, which aims at designing models capable of segmenting new semantic classes with only a few examples. This paper consists of a comprehensive survey of Few-Shot Semantic Segmentation, tracing its evolution and exploring various model designs, from the more popular conditional and prototypical networks to the more niche latent space optimization methods, presenting also the new opportunities offered by recent foundational models. Through a chronological narrative, we dissect influential trends and methodologies, providing insights into their strengths and limitations. A temporal timeline offers a visual roadmap, marking key milestones in the field's progression. Complemented by quantitative analyses on benchmark datasets and qualitative showcases of seminal works, this survey equips readers with a deep understanding of the topic. By elucidating current challenges, state-of-the-art models, and prospects, we aid researchers and practitioners in navigating the intricacies of Few-Shot Semantic Segmentation and provide ground for future development.
- Flamingo: a Visual Language Model for Few-Shot Learning. In Advances in Neural Information Processing Systems.
- Few-shot semantic segmentation via mask aggregation. Neural Processing Letters (2024).
- On the texture bias for few-shot cnn segmentation. In IEEE/CVF Winter conference on Applications of Computer Vision (WACV).
- Learning representations by maximizing mutual information across views. Advances in neural information processing systems (2019).
- Segnet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE Transactions on pattern analysis and machine intelligence (2017).
- On the opportunities and risks of foundation models. arXiv preprint arXiv:2108.07258 (2021).
- Few-shot segmentation without meta-learning: A good transductive inference is all you need?. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Language models are few-shot learners. Advances in neural information processing systems (2020).
- Language models are few-shot learners. In Conference on Neural Information Processing Systems (NeurIPS).
- Robust vessel segmentation in fundus images. International journal of biomedical imaging (2013).
- Lung Segmentation in Chest Radiographs Using Anatomical Atlases With Nonrigid Registration. IEEE Transactions on Medical Imaging (2014).
- A theoretical analysis of the number of shots in few-shot learning. arXiv preprint arXiv:1909.11722 (2019).
- More than the Sum of Its Parts: Ensembling Backbone Networks for Few-Shot Segmentation. arXiv preprint arXiv:2402.06581 (2024).
- Prototype-based Incremental Few-Shot Semantic Segmentation. In British Machine Vision Conference (BMVC).
- Few-shot semantic segmentation: a review on recent approaches. Neural Computing and Applications (2023).
- Pixel Matching Network for Cross-Domain Few-Shot Segmentation. In IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).
- Apanet: adaptive prototypes alignment network for few-shot semantic segmentation. arXiv preprint arXiv:2111.12263 (2021).
- Encoder-decoder with atrous separable convolution for semantic image segmentation. In European Conference on Computer Vision (ECCV).
- Xinlei Chen and Kaiming He. 2021. Exploring simple siamese representation learning. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Skin Lesion Analysis Toward Melanoma Detection 2018: A Challenge Hosted by the International Skin Imaging Collaboration (ISIC). ArXiv (2019).
- The cityscapes dataset for semantic urban scene understanding. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Deepglobe 2018: A challenge to parse the earth through satellite images. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops.
- ImageNet: A large-scale hierarchical image database. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Nanqing Dong and Eric P. Xing. 2018. Few-Shot Semantic Segmentation with Prototype Learning. In British Machine Vision Conference (BMVC).
- Discriminative Unsupervised Feature Learning with Convolutional Neural Networks. In Advances in Neural Information Processing Systems.
- The PASCAL Visual Object Classes Challenge 2012 (VOC2012) Results. http://www.pascal-network.org/challenges/VOC/voc2012/workshop/index.html.
- VSE++: Improving Visual-Semantic Embeddings with Hard Negatives. (2018).
- DARNet: Bridging Domain Gaps in Cross-Domain Few-Shot Segmentation with Dynamic Adaptation. ArXiv abs/2312.04813 (2023).
- Self-support Few-Shot Semantic Segmentation. In European Conference on Computer Vision (ECCV).
- An Ensemble Classification-Based Approach Applied to Retinal Blood Vessel Segmentation. IEEE Transactions on Biomedical Engineering (2012).
- SimPropNet: Improved Similarity Propagation for Few-shot Image Segmentation. In International Joint Conference on Artificial Intelligence (IJCAI).
- A survey on deep learning techniques for image and video semantic segmentation. Applied Soft Computing (2018).
- Albert Gu and Tri Dao. 2023. Mamba: Linear-Time Sequence Modeling with Selective State Spaces. arXiv preprint arXiv:2312.00752 (2023).
- A review of semantic segmentation using deep neural networks. International journal of multimedia information retrieval (2018).
- A Strong Baseline for Generalized Few-Shot Semantic Segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Language models are general-purpose interfaces. arXiv preprint arXiv:2206.06336 (2022).
- Simultaneous detection and segmentation. In European Conference on Computer Vision (ECCV).
- Deep residual learning for image recognition. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Meta-learning initializations for image segmentation. arXiv preprint arXiv:1912.06290 (2019).
- GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium. In Neural Information Processing Systems.
- Cost aggregation with 4d convolutional swin transformer for few-shot segmentation. In European Conference on Computer Vision (ECCV).
- Locating blood vessels in retinal images by piecewise threshold probing of a matched filter response. IEEE Transactions on Medical Imaging (2000).
- Attention-based multi-context guiding for few-shot semantic segmentation. In AAAI conference on artificial intelligence.
- A survey of self-supervised and few-shot object detection. IEEE Transactions on Pattern Analysis and Machine Intelligence (2022).
- Rethinking Federated Learning With Domain Shift: A Prototype View. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- RestNet: Boosting Cross-Domain Few-Shot Segmentation with Residual Transformation Network. In British Machine Vision Conference (BMVC).
- MSANet: Multi-Similarity and Attention Guidance for Boosting Few-Shot Segmentation. arXiv preprint arXiv:2206.09667 (2022).
- Automatic Tuberculosis Screening Using Chest Radiographs. IEEE Transactions on Medical Imaging (2014).
- Suvarna Kadam and Vinay Vaidya. 2020. Review and analysis of zero, one and few shot learning approaches. In Intelligent Systems Design and Applications (ISDA).
- Dahyun Kang and Minsu Cho. 2022. Integrative few-shot learning for classification and segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Andrej Karpathy and Li Fei-Fei. 2015. Deep visual-semantic alignments for generating image descriptions. In 2015 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- A style-based generator architecture for generative adversarial networks. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Analyzing and improving the image quality of stylegan. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Jacob Devlin Ming-Wei Chang Kenton and Lee Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL).
- Diederik P Kingma and Max Welling. 2013. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114 (2013).
- Segment Anything. arXiv:2304.02643 (2023).
- Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models. ArXiv abs/1411.2539 (2014).
- Learning what not to segment: A new perspective on few-shot segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Beyond the Prototype: Divide-and-conquer Proxies for Few-shot Segmentation. In International Joint Conference on Artificial Intelligence (IJCAI).
- Cross-Domain Few-Shot Semantic Segmentation. In European Conference on Computer Vision (ECCV).
- A Survey on Semantic Segmentation. In 2018 IEEE International Conference on Data Mining Workshops (ICDMW).
- Adaptive prototype learning and allocation for few-shot segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Fss-1000: A 1000-class dataset for few-shot segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Few-shot Semantic Segmentation with Self-supervision from Pseudo-classes. British Machine Vision Conference (BMVC) (2021).
- Look into Person: Joint Body Parsing & Pose Estimation Network and a New Benchmark. IEEE Transactions on Pattern Analysis and Machine Intelligence (2019).
- Microsoft COCO: Common Objects in Context. In European Conference on Computer Vision (ECCV).
- Harmonic Feature Activation for Few-Shot Semantic Segmentation. IEEE Transactions on Image Processing (2021).
- Fecanet: Boosting few-shot semantic segmentation with feature-enhanced context-aware network. IEEE Transactions on Multimedia (2023).
- Learning Orthogonal Prototypes for Generalized Few-Shot Semantic Segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Harmonizing Base and Novel Classes: A Class-Contrastive Approach for Generalized Few-Shot Segmentation. ArXiv (2023).
- Few-shot Segmentation with Optimal Transport Matching and Message Flow. IEEE Transactions on Multimedia (2022).
- Crnet: Cross-reference networks for few-shot segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Intermediate Prototype Mining Transformer for Few-Shot Semantic Segmentation. In Advances in Neural Information Processing Systems (NeurIPS).
- Part-aware prototype network for few-shot semantic segmentation. In European Conference on Computer Vision (ECCV).
- Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching. In The Twelfth International Conference on Learning Representations (ICLR).
- Fully convolutional networks for semantic segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Prediction Calibration for Generalized Few-Shot Semantic Segmentation. IEEE Transactions on Image Processing (2023).
- Simpler is better: Few-shot semantic segmentation with classifier weight transformer. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Timo Lüddecke and Alexander Ecker. 2022. Image Segmentation Using Text and Image Prompts. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation. ArXiv (2023).
- Hypercorrelation squeeze for few-shot segmentation. In IEEE/CVF international conference on computer vision (ICCV).
- Tom M Mitchell and Tom M Mitchell. 1997. Machine learning. McGraw-hill New York.
- Foundations of machine learning. MIT press.
- Khoi Nguyen and Sinisa Todorovic. 2019. Feature weighting and boosting for few-shot segmentation. In IEEE/CVF International Conference on Computer Vision (ICCV).
- Atsuro Okazawa. 2022. Interclass Prototype Relation for Few-Shot Segmentation. In European Conference on Computer Vision (ECCV).
- DINOv2: Learning Robust Visual Features without Supervision. Transactions on Machine Learning Research (2024).
- Learning transferable visual models from natural language supervision. In International conference on machine learning. PMLR.
- Improving Language Understanding by Generative Pre-Training. (2018).
- Few-shot segmentation propagation with guided networks. arXiv preprint arXiv:1806.07373 (2018).
- PACO: Parts and Attributes of Common Objects. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Learning Deep Representations of Fine-Grained Visual Descriptions. In 2016 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Visual Semantic Segmentation Based on Few/Zero-Shot Learning: An Overview. arXiv preprint arXiv:2211.08352 (2022).
- Stochastic backpropagation and approximate inference in deep generative models. In International Conference on Machine Learning (ICML).
- U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical Image Computing and Computer-assisted Intervention (MICCAI).
- GANORCON: Are Generative Models Useful for Few-shot Segmentation?. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- One-Shot Learning for Semantic Segmentation. In British Machine Vision Conference (BMVC).
- Dense cross-query-and-support attention weighted mask aggregation for few-shot segmentation. In European Conference on Computer Vision (ECCV).
- Mennatullah Siam and Boris Oreshkin. 2019. Adaptive masked weight imprinting for few-shot segmentation. (2019).
- AMP: Adaptive Masked Proxies for Few-Shot Segmentation. In IEEE/CVF International Conference on Computer Vision (ICCV).
- Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. In International Conference on Learning Representations (ICLR).
- Prototypical networks for few-shot learning. Advances in neural information processing systems (2017).
- Ridge-based vessel segmentation in color images of the retina. IEEE Transactions on Medical Imaging (2004).
- Attentional prototype inference for few-shot segmentation. Pattern Recognition (2023).
- Generalized few-shot semantic segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Prior guided feature enrichment network for few-shot segmentation. IEEE transactions on pattern analysis and machine intelligence (2020).
- Repurposing gans for one-shot semantic part segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions. Scientific Data (2018).
- Attention is all you need. Advances in neural information processing systems (2017).
- Matching Networks for One Shot Learning. In Advances in Neural Information Processing Systems.
- Variational prototype inference for few-shot semantic segmentation. In IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).
- Few-shot semantic segmentation with democratic attention networks. In European Conference on Computer Vision (ECCV).
- LoveDA: A Remote Sensing Land-Cover Dataset for Domain Adaptive Semantic Segmentation. In Neural Information Processing Systems Track on Datasets and Benchmarks (NeurIPS).
- Panet: Few-shot image semantic segmentation with prototype alignment. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Images Speak in Images: A Generalist Painter for In-Context Visual Learning. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- SegGPT: Towards Segmenting Everything In Context. In IEEE/CVF International Conference on Computer Vision (ICCV).
- Generalizing from a few examples: A survey on few-shot learning. ACM computing surveys (CSUR) (2020).
- CRIS: CLIP-Driven Referring Image Segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- iSAID: A Large-scale Dataset for Instance Segmentation in Aerial Images. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops.
- Learning meta-class memory for few-shot semantic segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- CLIP Driven Few-Shot Panoptic Segmentation. IEEE Access (2023).
- Scale-aware graph neural network for few-shot semantic segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Few-Shot Semantic Segmentation With Cyclic Memory Network. In IEEE/CVF International Conference on Computer Vision (ICCV).
- Few-Shot Semantic Segmentation with Cyclic Memory Network. In IEEE/CVF International Conference on Computer Vision (ICCV).
- Few-shot semantic segmentation with cyclic memory network. In IEEE/CVF International Conference on Computer Vision (2021).
- Multilevel Features-Guided Network for Few-Shot Segmentation. MDPI Electronics (2022).
- Prototype mixture models for few-shot semantic segmentation. In European Conference on Computer Vision (ECCV).
- Brinet: Towards bridging the intra-class and inter-class gaps in one-shot segmentation. arXiv preprint arXiv:2008.06226 (2020).
- A new local transformation module for few-shot segmentation. In MultiMedia Modeling: 26th International Conference, MMM 2020, Daejeon, South Korea, January 5–8, 2020, Proceedings, Part II 26.
- Self-Guided and Cross-Guided Learning for Few-Shot Segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Self-guided and cross-guided learning for few-shot segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Pyramid graph networks with connection attentions for region-based one-shot semantic segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Canet: Class-agnostic segmentation networks with iterative refinement and attentive few-shot learning. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Few-shot segmentation via cycle-consistent transformer. Advances in Neural Information Processing Systems (2021).
- Personalize Segment Anything Model with One Shot. In The Twelfth International Conference on Learning Representations (ICLR).
- Rich Embedding Features for One-Shot Semantic Segmentation. IEEE Transactions on Neural Networks and Learning Systems (2022).
- Sg-one: Similarity guidance network for one-shot semantic segmentation. IEEE transactions on cybernetics (2020).
- Datasetgan: Efficient labeled data factory with minimal human effort. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Pyramid scene parsing network. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Advancing Incremental Few-Shot Semantic Segmentation via Semantic-Guided Relation Alignment and Adaptation. In MultiMedia Modeling.
- ZegCLIP: Towards adapting CLIP for zero-shot semantic segmentation. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023).
- Self-Supervised Tuning for Few-Shot Segmentation. In International Joint Conference on Artificial Intelligence (IJCAI).
- Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model. arXiv preprint arXiv:2401.09417 (2024).