Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Box-supervised Instance Segmentation with Level Set Evolution (2207.09055v1)

Published 19 Jul 2022 in cs.CV

Abstract: In contrast to the fully supervised methods using pixel-wise mask labels, box-supervised instance segmentation takes advantage of the simple box annotations, which has recently attracted a lot of research attentions. In this paper, we propose a novel single-shot box-supervised instance segmentation approach, which integrates the classical level set model with deep neural network delicately. Specifically, our proposed method iteratively learns a series of level sets through a continuous Chan-Vese energy-based function in an end-to-end fashion. A simple mask supervised SOLOv2 model is adapted to predict the instance-aware mask map as the level set for each instance. Both the input image and its deep features are employed as the input data to evolve the level set curves, where a box projection function is employed to obtain the initial boundary. By minimizing the fully differentiable energy function, the level set for each instance is iteratively optimized within its corresponding bounding box annotation. The experimental results on four challenging benchmarks demonstrate the leading performance of our proposed approach to robust instance segmentation in various scenarios. The code is available at: https://github.com/LiWentomng/boxlevelset.

Box-supervised Instance Segmentation with Level Set Evolution

The paper presents a novel approach to the burgeoning field of box-supervised instance segmentation by integrating the classical level set model with deep learning. Traditional approaches in instance segmentation rely on fully-supervised methods that require pixel-wise mask annotations, leading to significant labeling costs. In contrast, this method leverages simpler box annotations to facilitate segmentation, which has emerged as an attractive alternative for research exploration.

The authors propose a single-shot framework that synergizes the Chan-Vese energy-based level set model with neural networks. This integration is achieved by iteratively learning a series of level set functions that enable implicit curve evolution within an annotated bounding box. The iterative learning process uses a differentiable energy function as the optimization target, which is uniquely adapted for end-to-end training.

Key to this architecture is the adaptation of a mask prediction model, SOLOv2, which generates instance-aware mask maps treated as level sets for each instance. These mask maps, combined with both the input image and its deep features, serve as input data for the level set evolution process. The authors employ a box projection function to derive an initial boundary, which is then refined through iterative optimization by minimizing the defined energy function within each bounding box. This technique effectively bridged box annotation with pixel-level instance distinctions.

The experimental results on multiple datasets including COCO, Pascal VOC, iSAID, and LiTS, suggest that this approach exhibits state-of-the-art performance among box-supervised methods, particularly outperforming existing techniques like BoxInst and DiscoBox in various configurations. This approach narrows the performance gap with fully supervised segmentation methods, demonstrating its practical viability and robustness in diverse scenarios.

The paper's unique contribution lies in being the first to address box-supervised instance segmentation using a level set-based method, which traditionally required fully annotated training samples or separate auxiliary tasks to generate pseudo-labels. The framework proposed reduces the complexity and resource requirements typically associated with training instance segmentation models while maintaining competitive accuracy levels.

The implication of utilizing level set evolution alongside modern deep learning architectures raises interesting prospects for future research. The method's capacity to effectively operate with less supervision points to broader applications where detailed annotations are financially or logistically impractical. Additionally, exploring alternative formulations of the energy function or various input feature configurations could further improve segmentation quality and efficiency.

In conclusion, this research provides a compelling advancement in the field of instance segmentation through innovative integration of classical methods with machine learning, paving the way for cost-effective applications and complex improvements in machine vision systems. This paper not only highlights a strong numerical performance across various benchmarks but also suggests further investigations into diverse feature interactions and potential cross-domain application scenarios.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Wentong Li (25 papers)
  2. Wenyu Liu (146 papers)
  3. Jianke Zhu (68 papers)
  4. Miaomiao Cui (27 papers)
  5. Xiansheng Hua (26 papers)
  6. Lei Zhang (1689 papers)
Citations (51)