Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
153 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

MISS: Memory-efficient Instance Segmentation Framework By Visual Inductive Priors Flow Propagation (2403.11576v1)

Published 18 Mar 2024 in cs.CV and cs.MM

Abstract: Instance segmentation, a cornerstone task in computer vision, has wide-ranging applications in diverse industries. The advent of deep learning and artificial intelligence has underscored the criticality of training effective models, particularly in data-scarce scenarios - a concern that resonates in both academic and industrial circles. A significant impediment in this domain is the resource-intensive nature of procuring high-quality, annotated data for instance segmentation, a hurdle that amplifies the challenge of developing robust models under resource constraints. In this context, the strategic integration of a visual prior into the training dataset emerges as a potential solution to enhance congruity with the testing data distribution, consequently reducing the dependency on computational resources and the need for highly complex models. However, effectively embedding a visual prior into the learning process remains a complex endeavor. Addressing this challenge, we introduce the MISS (Memory-efficient Instance Segmentation System) framework. MISS leverages visual inductive prior flow propagation, integrating intrinsic prior knowledge from the Synergy-basketball dataset at various stages: data preprocessing, augmentation, training, and inference. Our empirical evaluations underscore the efficacy of MISS, demonstrating commendable performance in scenarios characterized by limited data availability and memory constraints.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (14)
  1. “Instance segmentation challenge track technical report, vipriors workshop at iccv 2021: Task-specific copy-paste data augmentation method for instance segmentation,” arXiv preprint arXiv:2110.00470, 2021.
  2. “Task-specific data augmentation and inference processing for vipriors instance segmentation challenge,” arXiv preprint arXiv:2211.11282, 2022.
  3. “Cascade r-cnn: Delving into high quality object detection,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018.
  4. “Per-pixel classification is not all you need for semantic segmentation,” in Advances in Neural Information Processing Systems, 2021.
  5. “Instances as queries,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, October 2021, pp. 6910–6919.
  6. “Cbnet: A composite backbone network architecture for object detection,” Proceedings of the AAAI Conference on Artificial Intelligence, 2022.
  7. “Simple copy-paste is a strong data augmentation method for instance segmentation,” Proceedings of the IEEE conference on computer vision and pattern recognition, 2021.
  8. “Vipriors 2: Visual inductive priors for data-efficient deep learning challenges,” arXiv preprint arXiv:2201.08625, 2021.
  9. “Vipriors 3: Visual inductive priors for data-efficient deep learning challenges,” arXiv preprint arXiv:2305.19688, 2022.
  10. John Canny, “A computational approach to edge detection,” IEEE Transactions on pattern analysis and machine intelligence, , no. 6, pp. 679–698, 1986.
  11. “Use of the hough transformation to detect lines and curves in pictures,” Communications of the ACM, vol. 15, no. 1, pp. 11–15, 1972.
  12. “MMDetection: Open mmlab detection toolbox and benchmark,” arXiv preprint arXiv:1906.07155, 2019.
  13. “Swa object detection,” arXiv preprint arXiv:2012.12645, 2020.
  14. “Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time,” International Conference on Machine Learning, 2022.

Summary

We haven't generated a summary for this paper yet.