SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model (2305.02034v4)

Published 3 May 2023 in cs.CV

Abstract: The success of the Segment Anything Model (SAM) demonstrates the significance of data-centric machine learning. However, due to the difficulties and high costs associated with annotating Remote Sensing (RS) images, a large amount of valuable RS data remains unlabeled, particularly at the pixel level. In this study, we leverage SAM and existing RS object detection datasets to develop an efficient pipeline for generating a large-scale RS segmentation dataset, dubbed SAMRS. SAMRS totally possesses 105,090 images and 1,668,241 instances, surpassing existing high-resolution RS segmentation datasets in size by several orders of magnitude. It provides object category, location, and instance information that can be used for semantic segmentation, instance segmentation, and object detection, either individually or in combination. We also provide a comprehensive analysis of SAMRS from various aspects. Moreover, preliminary experiments highlight the importance of conducting segmentation pre-training with SAMRS to address task discrepancies and alleviate the limitations posed by limited training data during fine-tuning. The code and dataset will be available at https://github.com/ViTAE-Transformer/SAMRS.

References (48)

Authors (7)

Di Wang (407 papers)
Jing Zhang (731 papers)
Bo Du (264 papers)
Minqiang Xu (17 papers)
Lin Liu (190 papers)
Dacheng Tao (829 papers)
Liangpei Zhang (113 papers)

Citations (80)

View on Semantic Scholar

Summary

The paper introduces a scalable method using SAM to efficiently annotate remote sensing images, significantly reducing manual labeling costs.
The SAMRS dataset comprises 105,090 images and 1,668,241 instances, offering diverse annotations for segmentation, instance segmentation, and object detection tasks.
Preliminary experiments indicate that pre-training with SAMRS improves RS segmentation performance, especially in data-sparse fine-tuning scenarios.

An Overview of SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model

The paper "SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model" addresses a significant challenge in the field of remote sensing (RS)—the creation and scaling of segmentation datasets. The primary obstacle in developing these datasets is the labor-intensive and expensive process of annotating RS images at the pixel level. This paper leverages the distinct advantages of the Segment Anything Model (SAM) in conjunction with existing RS object detection datasets to propose an efficient methodology for generating large-scale RS segmentation datasets.

Data-centric Approach with SAM

The success of SAM highlights data-centric strategies in machine learning. SAM demonstrates a considerable zero-shot segmentation capability, successfully processing RS images despite the varied conditions compared to its training data consisting of natural images. This observation drives the core idea of the paper: leveraging SAM to annotate unlabeled RS datasets and efficiently construct a robust segmentation dataset named SAMRS.

The SAMRS Dataset

SAMRS comprises 105,090 images and 1,668,241 instances, eclipsing existing high-resolution RS segmentation datasets by several magnitudes in size. The dataset retains critical object-level details—including category, location, and instance information—facilitating its use across semantic segmentation, instance segmentation, and object detection tasks.

The authors analyzed several prompt types to find optimal configurations for SAM application in RS annotation. The results, encapsulated in SAMRS, offer more diverse category representation and finer annotations compared to existing benchmarks. The capacity for multiplying the dataset's volume without increasing annotation overhead is a significant achievement, made possible through SAM's application.

Preliminary Experiments and Implications

Preliminary experiments carried out in this paper underpin the utility of pre-training segmentation networks with the SAMRS, particularly for addressing task discrepancies traditionally tackled by transferring weights from classification-pretrained models. This aspect is crucial given the noted gaps in performance when models adapted for classification are directly applied to RS segmentation tasks without considering task-specific pre-training.

The paper's analysis demonstrates that segmentation pre-training with SAMRS can mitigate the challenges posed by task discrepancies, thereby unlocking the potential for enhanced performance in RS segmentation applications. Notably, the authors' findings reveal significant improvements in scenario-limited fine-tuning conditions, indicating the effectiveness of pre-training in contexts where annotated training data is sparse.

Future Directions

The paper presents several avenues for further research. Future work could involve evaluating larger models pretrained on SAMRS, which might offer insights into the scalable application of SAM for even broader RS tasks. Additionally, exploring the impact of SAMRS on instance segmentation and object detection will help ascertain its broader applicability and validate the model transferability hypotheses posited in this paper.

Moreover, since the SAMRS dataset demonstrates efficacy in mitigating the training data limitations inherent in RS segmentation, its methodology might apply to broader domains within Earth observation and beyond. Thus, the scalability of the technique to other specialized imaging disciplines could be a significant focal point for future investigations.

Conclusion

In sum, the paper delivers a pragmatic and effective method for scaling RS segmentation datasets via SAM application. It significantly contributes to reducing annotation costs and enhances dataset characteristics, which are pivotal in advancing RS image analysis. By addressing task-specific challenges through intelligent pre-training strategies, the proposed approach sets a new pathway for improving the performance and adaptability of segmentation models in remote sensing and allied fields.

PDF Markdown

Related Papers

GitHub

GitHub - ViTAE-Transformer/SAMRS: The official repo for [NeurIPS'23] "SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model" (343 stars)