Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Bidirectional Multi-scale Attention Networks for Semantic Segmentation of Oblique UAV Imagery (2102.03099v1)

Published 5 Feb 2021 in cs.CV

Abstract: Semantic segmentation for aerial platforms has been one of the fundamental scene understanding task for the earth observation. Most of the semantic segmentation research focused on scenes captured in nadir view, in which objects have relatively smaller scale variation compared with scenes captured in oblique view. The huge scale variation of objects in oblique images limits the performance of deep neural networks (DNN) that process images in a single scale fashion. In order to tackle the scale variation issue, in this paper, we propose the novel bidirectional multi-scale attention networks, which fuse features from multiple scales bidirectionally for more adaptive and effective feature extraction. The experiments are conducted on the UAVid2020 dataset and have shown the effectiveness of our method. Our model achieved the state-of-the-art (SOTA) result with a mean intersection over union (mIoU) score of 70.80%.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Ye Lyu (5 papers)
  2. George Vosselman (23 papers)
  3. Gui-Song Xia (139 papers)
  4. Michael Ying Yang (70 papers)
Citations (10)

Summary

We haven't generated a summary for this paper yet.