Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model for Few-Shot Instance Segmentation (2303.05105v2)

Published 9 Mar 2023 in cs.CV

Abstract: Few-shot instance segmentation extends the few-shot learning paradigm to the instance segmentation task, which tries to segment instance objects from a query image with a few annotated examples of novel categories. Conventional approaches have attempted to address the task via prototype learning, known as point estimation. However, this mechanism depends on prototypes (\eg mean of $K-$shot) for prediction, leading to performance instability. To overcome the disadvantage of the point estimation mechanism, we propose a novel approach, dubbed MaskDiff, which models the underlying conditional distribution of a binary mask, which is conditioned on an object region and $K-$shot information. Inspired by augmentation approaches that perturb data with Gaussian noise for populating low data density regions, we model the mask distribution with a diffusion probabilistic model. We also propose to utilize classifier-free guided mask sampling to integrate category information into the binary mask generation process. Without bells and whistles, our proposed method consistently outperforms state-of-the-art methods on both base and novel classes of the COCO dataset while simultaneously being more stable than existing methods. The source code is available at: https://github.com/minhquanlecs/MaskDiff.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (15)
  1. Diffusion Models Beat GANs on Image Synthesis. In NeurIPS, volume 34, 8780–8794.
  2. Incremental Few-Shot Instance Segmentation. In CVPR, 1185–1194.
  3. Mask R-CNN. In ICCV (ICCV).
  4. Denoising Diffusion Probabilistic Models. In NeurIPS, volume 33, 6840–6851.
  5. Classifier-Free Diffusion Guidance. In NeurIPS Workshops.
  6. Class-incremental few-shot object detection. arXiv preprint arXiv:2105.07637.
  7. Fully convolutional networks for semantic segmentation. In CVPR, 3431–3440.
  8. FAPIS: A Few-Shot Anchor-Free Part-Based Instance Segmenter. In CVPR, 11099–11108.
  9. iFS-RCNN: An Incremental Few-Shot Instance Segmenter. In CVPR, 7010–7019.
  10. Incremental few-shot object detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 13846–13855.
  11. Generative Modeling by Estimating Gradients of the Data Distribution. In NeurIPS, volume 32.
  12. Dynamic Transformer for Few-Shot Instance Segmentation. In ACM MM, 2969–2977.
  13. Frustratingly Simple Few-Shot Object Detection. In ICML, volume 119, 9919–9928.
  14. Few-shot object detection and viewpoint estimation for objects in the wild. In ECCV, 192–210.
  15. Meta r-cnn: Towards general solver for instance-level low-shot learning. In ICCV, 9577–9586.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Minh-Quan Le (11 papers)
  2. Tam V. Nguyen (38 papers)
  3. Trung-Nghia Le (42 papers)
  4. Thanh-Toan Do (92 papers)
  5. Minh N. Do (38 papers)
  6. Minh-Triet Tran (70 papers)
Citations (8)

Summary

We haven't generated a summary for this paper yet.