LoveDA: A Remote Sensing Land-Cover Dataset for Domain Adaptive Semantic Segmentation (2110.08733v6)

Published 17 Oct 2021 in cs.CV

Abstract: Deep learning approaches have shown promising results in remote sensing high spatial resolution (HSR) land-cover mapping. However, urban and rural scenes can show completely different geographical landscapes, and the inadequate generalizability of these algorithms hinders city-level or national-level mapping. Most of the existing HSR land-cover datasets mainly promote the research of learning semantic representation, thereby ignoring the model transferability. In this paper, we introduce the Land-cOVEr Domain Adaptive semantic segmentation (LoveDA) dataset to advance semantic and transferable learning. The LoveDA dataset contains 5987 HSR images with 166768 annotated objects from three different cities. Compared to the existing datasets, the LoveDA dataset encompasses two domains (urban and rural), which brings considerable challenges due to the: 1) multi-scale objects; 2) complex background samples; and 3) inconsistent class distributions. The LoveDA dataset is suitable for both land-cover semantic segmentation and unsupervised domain adaptation (UDA) tasks. Accordingly, we benchmarked the LoveDA dataset on eleven semantic segmentation methods and eight UDA methods. Some exploratory studies including multi-scale architectures and strategies, additional background supervision, and pseudo-label analysis were also carried out to address these challenges. The code and data are available at https://github.com/Junjue-Wang/LoveDA.

Authors (5)

Junjue Wang (13 papers)
Zhuo Zheng (18 papers)
Ailong Ma (9 papers)
Xiaoyan Lu (9 papers)
Yanfei Zhong (28 papers)

Citations (268)

View on Semantic Scholar

Summary

The paper introduces the LoveDA dataset with 5,987 HSR images and 166,768 annotations spanning urban and rural domains.
It evaluates semantic segmentation and UDA methods, showing that multi-scale architectures like HRNet significantly improve performance.
The findings highlight the potential for developing robust, domain-adaptive models to enhance land-cover mapping in diverse geographical settings.

Insightful Overview of the LoveDA Dataset for Remote Sensing

The paper "LoveDA: A Remote Sensing Land-Cover Dataset for Domain Adaptive Semantic Segmentation" introduces an innovative dataset designed to advance the field of high spatial resolution (HSR) remote sensing land-cover mapping. The authors present LoveDA as a distinct contribution, focusing on semantic segmentation and unsupervised domain adaptation (UDA).

Remote sensing technology, particularly HSR, plays a crucial role in understanding geographical and ecological environments by determining land-cover types at every image pixel. However, existing datasets in this domain have been limited to promoting semantic representation learning while neglecting model transferability across different geographical landscapes like urban and rural areas. LoveDA addresses this gap by providing a dataset that encompasses multiple scales, complex backgrounds, and varying class distributions, crucial for enhancing generalization capabilities in land-cover models.

Key Contributions

LoveDA Dataset: The LoveDA dataset consists of 5987 high spatial resolution images, entailing 166768 annotated objects sourced from three cities in China. The dataset is unique as it spans two distinct domains — urban and rural. This dual-domain focus presents considerable challenges such as multi-scale objects, complex background noise, and inconsistent class distributions.
Task Versatility: LoveDA is positioned to support both semantic segmentation and UDA tasks, which distinguishes it from existing datasets that prioritize semantic segmentation without considering adaptability across different geographic styles.
Benchmarking and Analysis: The authors evaluate the dataset against eleven semantic segmentation methods and eight UDA methods. They explore how different multi-scale strategies, additional background supervision techniques, and pseudo-label analysis can address the inherent challenges of the dataset.

Numerical Results and Claims

The authors benchmark various state-of-the-art architectures, including UNet, DeepLabV3+, HRNet, and others, revealing that multi-scale architectures like HRNet outperform others due to their sophisticated design, effectively capturing diverse scales of land-cover features. Furthermore, the paper indicates that semantic segmentation models heavily rely on multi-scale training and testing augmentations to enhance performance across LoveDA’s diversified scenes.

In the context of UDA, the paper underscores the difficulty yet necessity of adapting models across domains. Traditional adversarial methods underperform due to the stark intra-class variance in rural versus urban settings. Specifically, self-training methods like CBST and IAST display promising results by leveraging pseudo-labels for better class distribution balance.

Implications and Future Directions

The LoveDA dataset sets the foundation for developing robust domain-adaptive models capable of handling the variability in geographical environments on a large-scale. The primary implication is its potential to fuel research in creating generalized AI models for land-cover mapping, crucial for urban planning, agriculture monitoring, and environment management.

From a theoretical standpoint, the dataset challenges researchers to innovate adaptive learning models that can efficiently navigate the discrepancies in domain-specific data. The contrasting geographical styles and class distributions present an opportunity to develop algorithms that can seamlessly transition models across diverse landscapes.

Future research directions could include integrating more sophisticated attention mechanisms and architectural innovations to further enhance model adaptability. Additionally, the impact of alternative unsupervised learning techniques and domain generalization frameworks might offer breakthroughs in achieving seamless cross-domain model performance.

In conclusion, LoveDA represents a significant step towards addressing model transferability in remote sensing. By focusing on both semantic segmentation and UDA, the dataset not only addresses current gaps in the field but also encourages the development of new methodologies for handling diverse terrestrial environments.

PDF Markdown