Geospatial foundation models for image analysis: evaluating and enhancing NASA-IBM Prithvi's domain adaptability (2409.00489v1)

Published 31 Aug 2024 in cs.CV and cs.AI

Abstract: Research on geospatial foundation models (GFMs) has become a trending topic in geospatial AI research due to their potential for achieving high generalizability and domain adaptability, reducing model training costs for individual researchers. Unlike LLMs, such as ChatGPT, constructing visual foundation models for image analysis, particularly in remote sensing, encountered significant challenges such as formulating diverse vision tasks into a general problem framework. This paper evaluates the recently released NASA-IBM GFM Prithvi for its predictive performance on high-level image analysis tasks across multiple benchmark datasets. Prithvi was selected because it is one of the first open-source GFMs trained on time-series of high-resolution remote sensing imagery. A series of experiments were designed to assess Prithvi's performance as compared to other pre-trained task-specific AI models in geospatial image analysis. New strategies, including band adaptation, multi-scale feature generation, and fine-tuning techniques, are introduced and integrated into an image analysis pipeline to enhance Prithvi's domain adaptation capability and improve model performance. In-depth analyses reveal Prithvi's strengths and weaknesses, offering insights for both improving Prithvi and developing future visual foundation models for geospatial tasks.

Citations (5)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Geospatial foundation models for image analysis: evaluating and enhancing NASA-IBM Prithvi's domain adaptability (2409.00489v1)

Summary

Related Papers