MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model for Few-Shot Instance Segmentation (2303.05105v2)
Abstract: Few-shot instance segmentation extends the few-shot learning paradigm to the instance segmentation task, which tries to segment instance objects from a query image with a few annotated examples of novel categories. Conventional approaches have attempted to address the task via prototype learning, known as point estimation. However, this mechanism depends on prototypes (\eg mean of $K-$shot) for prediction, leading to performance instability. To overcome the disadvantage of the point estimation mechanism, we propose a novel approach, dubbed MaskDiff, which models the underlying conditional distribution of a binary mask, which is conditioned on an object region and $K-$shot information. Inspired by augmentation approaches that perturb data with Gaussian noise for populating low data density regions, we model the mask distribution with a diffusion probabilistic model. We also propose to utilize classifier-free guided mask sampling to integrate category information into the binary mask generation process. Without bells and whistles, our proposed method consistently outperforms state-of-the-art methods on both base and novel classes of the COCO dataset while simultaneously being more stable than existing methods. The source code is available at: https://github.com/minhquanlecs/MaskDiff.
- Diffusion Models Beat GANs on Image Synthesis. In NeurIPS, volume 34, 8780–8794.
- Incremental Few-Shot Instance Segmentation. In CVPR, 1185–1194.
- Mask R-CNN. In ICCV (ICCV).
- Denoising Diffusion Probabilistic Models. In NeurIPS, volume 33, 6840–6851.
- Classifier-Free Diffusion Guidance. In NeurIPS Workshops.
- Class-incremental few-shot object detection. arXiv preprint arXiv:2105.07637.
- Fully convolutional networks for semantic segmentation. In CVPR, 3431–3440.
- FAPIS: A Few-Shot Anchor-Free Part-Based Instance Segmenter. In CVPR, 11099–11108.
- iFS-RCNN: An Incremental Few-Shot Instance Segmenter. In CVPR, 7010–7019.
- Incremental few-shot object detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 13846–13855.
- Generative Modeling by Estimating Gradients of the Data Distribution. In NeurIPS, volume 32.
- Dynamic Transformer for Few-Shot Instance Segmentation. In ACM MM, 2969–2977.
- Frustratingly Simple Few-Shot Object Detection. In ICML, volume 119, 9919–9928.
- Few-shot object detection and viewpoint estimation for objects in the wild. In ECCV, 192–210.
- Meta r-cnn: Towards general solver for instance-level low-shot learning. In ICCV, 9577–9586.
- Minh-Quan Le (11 papers)
- Tam V. Nguyen (38 papers)
- Trung-Nghia Le (42 papers)
- Thanh-Toan Do (92 papers)
- Minh N. Do (38 papers)
- Minh-Triet Tran (70 papers)