Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
60 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
8 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

RS-Agent: Automating Remote Sensing Tasks through Intelligent Agents (2406.07089v1)

Published 11 Jun 2024 in cs.CV

Abstract: An increasing number of models have achieved great performance in remote sensing tasks with the recent development of LLMs and Visual LLMs (VLMs). However, these models are constrained to basic vision and language instruction-tuning tasks, facing challenges in complex remote sensing applications. Additionally, these models lack specialized expertise in professional domains. To address these limitations, we propose a LLM-driven remote sensing intelligent agent named RS-Agent. Firstly, RS-Agent is powered by a LLM that acts as its "Central Controller," enabling it to understand and respond to various problems intelligently. Secondly, our RS-Agent integrates many high-performance remote sensing image processing tools, facilitating multi-tool and multi-turn conversations. Thirdly, our RS-Agent can answer professional questions by leveraging robust knowledge documents. We conducted experiments using several datasets, e.g., RSSDIVCS, RSVQA, and DOTAv1. The experimental results demonstrate that our RS-Agent delivers outstanding performance in many tasks, i.e., scene classification, visual question answering, and object counting tasks.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Wenjia Xu (26 papers)
  2. Zijian Yu (5 papers)
  3. Yixu Wang (38 papers)
  4. Jiuniu Wang (21 papers)
  5. Mugen Peng (82 papers)
Citations (5)
X Twitter Logo Streamline Icon: https://streamlinehq.com