Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

RSPrompter: Learning to Prompt for Remote Sensing Instance Segmentation based on Visual Foundation Model (2306.16269v2)

Published 28 Jun 2023 in cs.CV

Abstract: Leveraging the extensive training data from SA-1B, the Segment Anything Model (SAM) demonstrates remarkable generalization and zero-shot capabilities. However, as a category-agnostic instance segmentation method, SAM heavily relies on prior manual guidance, including points, boxes, and coarse-grained masks. Furthermore, its performance in remote sensing image segmentation tasks remains largely unexplored and unproven. In this paper, we aim to develop an automated instance segmentation approach for remote sensing images, based on the foundational SAM model and incorporating semantic category information. Drawing inspiration from prompt learning, we propose a method to learn the generation of appropriate prompts for SAM. This enables SAM to produce semantically discernible segmentation results for remote sensing images, a concept we have termed RSPrompter. We also propose several ongoing derivatives for instance segmentation tasks, drawing on recent advancements within the SAM community, and compare their performance with RSPrompter. Extensive experimental results, derived from the WHU building, NWPU VHR-10, and SSDD datasets, validate the effectiveness of our proposed method. The code for our method is publicly available at kychen.me/RSPrompter.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Keyan Chen (34 papers)
  2. Chenyang Liu (26 papers)
  3. Hao Chen (1006 papers)
  4. Haotian Zhang (107 papers)
  5. Wenyuan Li (47 papers)
  6. Zhengxia Zou (52 papers)
  7. Zhenwei Shi (77 papers)
Citations (131)

Summary

We haven't generated a summary for this paper yet.