Variance-insensitive and Target-preserving Mask Refinement for Interactive Image Segmentation (2312.14387v1)
Abstract: Point-based interactive image segmentation can ease the burden of mask annotation in applications such as semantic segmentation and image editing. However, fully extracting the target mask with limited user inputs remains challenging. We introduce a novel method, Variance-Insensitive and Target-Preserving Mask Refinement to enhance segmentation quality with fewer user inputs. Regarding the last segmentation result as the initial mask, an iterative refinement process is commonly employed to continually enhance the initial mask. Nevertheless, conventional techniques suffer from sensitivity to the variance in the initial mask. To circumvent this problem, our proposed method incorporates a mask matching algorithm for ensuring consistent inferences from different types of initial masks. We also introduce a target-aware zooming algorithm to preserve object information during downsampling, balancing efficiency and accuracy. Experiments on GrabCut, Berkeley, SBD, and DAVIS datasets demonstrate our method's state-of-the-art performance in interactive image segmentation.
- Image inpainting. In Proceedings of the 27th annual conference on Computer graphics and interactive techniques, 417–424.
- Bonaccorso, G. 2017. Machine learning algorithms. Packt Publishing Ltd.
- Conditional Diffusion for Interactive Segmentation. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 7345–7354.
- FocalClick: Towards Practical Interactive Image Segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 1300–1309.
- CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation via Global and Local Refinement. In CVPR.
- LVIS: A Dataset for Large Vocabulary Instance Segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- EdgeFlow: Achieving Practical Interactive Segmentation with Edge-Guided Flow. arXiv:2109.09406.
- EdgeFlow: Achieving Practical Interactive Segmentation With Edge-Guided Flow. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, 1551–1560.
- Semantic contours from inverse detectors. In 2011 International Conference on Computer Vision, 991–998.
- Interactive Image Segmentation via Backpropagating Refinement Scheme. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Interactive Image Segmentation With Latent Diversity. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
- Regional Interactive Image Segmentation Networks. In 2017 IEEE International Conference on Computer Vision (ICCV), 2746–2754.
- Scribblesup: Scribble-supervised convolutional networks for semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition, 3159–3167.
- Microsoft COCO: Common Objects in Context. In Fleet, D.; Pajdla, T.; Schiele, B.; and Tuytelaars, T., eds., Computer Vision – ECCV 2014, 740–755. Cham: Springer International Publishing. ISBN 978-3-319-10602-1.
- FocusCut: Diving Into a Focus View in Interactive Segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2637–2646.
- Interactive Image Segmentation With First Click Attention. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Iteratively Trained Interactive Segmentation. arXiv:1805.04398.
- Content-Aware Multi-Level Guidance for Interactive Instance Segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001, volume 2, 416–423 vol.2.
- A Benchmark Dataset and Evaluation Methodology for Video Object Segmentation. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
- ”GrabCut”: Interactive Foreground Extraction Using Iterated Graph Cuts. ACM Trans. Graph., 23(3): 309–314.
- Regularization with stochastic transformations and perturbations for deep semi-supervised learning. Advances in neural information processing systems, 29.
- AdaptIS: Adaptive Instance Selection Network. arXiv:1909.07829.
- F-BRS: Rethinking Backpropagating Refinement for Interactive Segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Reviving Iterative Training with Mask Guidance for Interactive Segmentation. In 2022 IEEE International Conference on Image Processing (ICIP), 3141–3145.
- Reviving iterative training with mask guidance for interactive segmentation. In 2022 IEEE International Conference on Image Processing (ICIP), 3141–3145. IEEE.
- Learning to Zoom and Unzoom. arXiv:2303.15390.
- Focused and Collaborative Feedback Integration for Interactive Image Segmentation. arXiv:2303.11880.
- SegFormer: Simple and efficient design for semantic segmentation with transformers. Advances in Neural Information Processing Systems, 34: 12077–12090.
- Unsupervised data augmentation for consistency training. Advances in neural information processing systems, 33: 6256–6268.
- Deep Interactive Object Selection. arXiv:1603.04042.
- Deep GrabCut for Object Selection. arXiv:1707.00243.
- Loosecut: Interactive image segmentation with loosely bounded boxes. In 2017 IEEE International Conference on Image Processing (ICIP), 3335–3339. IEEE.
- Interactive Segmentation as Gaussian Process Classification. arXiv:2302.14578.
- Chaowei Fang (32 papers)
- Ziyin Zhou (8 papers)
- Junye Chen (7 papers)
- Hanjing Su (7 papers)
- Qingyao Wu (39 papers)
- Guanbin Li (177 papers)