Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

UAVD4L: A Large-Scale Dataset for UAV 6-DoF Localization (2401.05971v1)

Published 11 Jan 2024 in cs.CV

Abstract: Despite significant progress in global localization of Unmanned Aerial Vehicles (UAVs) in GPS-denied environments, existing methods remain constrained by the availability of datasets. Current datasets often focus on small-scale scenes and lack viewpoint variability, accurate ground truth (GT) pose, and UAV build-in sensor data. To address these limitations, we introduce a large-scale 6-DoF UAV dataset for localization (UAVD4L) and develop a two-stage 6-DoF localization pipeline (UAVLoc), which consists of offline synthetic data generation and online visual localization. Additionally, based on the 6-DoF estimator, we design a hierarchical system for tracking ground target in 3D space. Experimental results on the new dataset demonstrate the effectiveness of the proposed approach. Code and dataset are available at https://github.com/RingoWRW/UAVD4L

Definition Search Book Streamline Icon: https://streamlinehq.com
References (53)
  1. Survey of computer vision algorithms and applications for unmanned aerial vehicles. Expert Sys. Appl., 2018.
  2. Netvlad: Cnn architecture for weakly supervised place recognition. In CVPR, 2016.
  3. Survey on uav navigation in gps denied environments. In SCOPES, 2016.
  4. Are local features all you need for cross-domain visual place recognition? In CVPR, 2023.
  5. A convolutional neural network approach for assisting avalanche search and rescue operations with uav imagery. Remote Sens., 2017.
  6. Uav localization using autoencoded satellite images. IEEE Robot. Autom., 2021.
  7. Review of target geo-location algorithms for aerial remote sensing cameras without control points. Appl. Sci., 2022.
  8. Real-time geo-localization using satellite imagery and topography for unmanned aerial vehicles. In IROS, 2021.
  9. Brm localization: Uav localization in gnss-denied environments based on matching of numerical map and uav images. In IROS, 2020.
  10. Alto: A large-scale dataset for uav visual place recognition and localization. arXiv preprint, 2022.
  11. A review on absolute visual localization for uav. Rob. Autom. Syst., 2021.
  12. Lowe David. Distinctive image features from scale-invariant keypoints. IJCV, 2004.
  13. Superpoint: Self-supervised interest point detection and description. In CVPR, 2018.
  14. Revisiting the p3p problem. 2023.
  15. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM, 1981.
  16. Self-supervising fine-grained region similarities for large-scale image localization. In ECCV, 2020.
  17. Gps-denied uav localization using pre-existing satellite imagery. In ICRA, 2019.
  18. End-to-end learning of deep visual representations for image retrieval. IJCV, 2017.
  19. 3d mapping efficacy of a drone and terrestrial laser scanner over a temperate beach-dune zone. Geomorphology, 2019.
  20. Vision-based gnss-free localization for uavs in the wild. In ICMERR, 2022.
  21. High-efficiency determination of coastline by combination of tidal level and coastal zone dem from uav tilt photogrammetry. Remote Sens., 2020.
  22. Survey on computer vision for uavs: Current developments and trends. J. Intell. Rob. Syst. Theor. Appl., 2017.
  23. Gnss-denied geolocalization of uavs by visual matching of onboard camera images with orthophotos. In ICRA, 2021.
  24. Lsvl: Large-scale season-invariant visual localization for uavs. arXiv preprint, 2022.
  25. Autonomous underwater survey apparatus and system, 2021. US Patent 11,072,405.
  26. Epnp: An accurate o(n) solution to the pnp problem. IJCV, 2009.
  27. Uav localization in low-altitude gnss-denied environments based on poi and store signage text matching in uav images. Drones, 2023.
  28. A survey on vision-based uav navigation. Geo-Spatial Inf. Sci., 2018.
  29. A multi-stage multi-task neural network for aerial scene interpretation and geolocalization. arXiv preprint, 2018.
  30. Lens: Localization enhanced by nerf synthesis. In Proc. Mach. Learn. Res., 2022.
  31. Assisting uav localization via deep contextual image matching. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens., 2021.
  32. Aerial imagery registration using deep learning for uav geolocalization. In CVPRW, 2018.
  33. Drone mapping of damage information in gps-denied disaster sites. Adv. Eng. Inf., 2022.
  34. Meshloc: Mesh-based visual localization. In ECCV, 2022.
  35. Visual localization with google earth images for robust global pose estimation of uavs. In ICRA, 2020.
  36. From coarse to fine: Robust hierarchical localization at large scale. In CVPR, 2019.
  37. Superglue: Learning feature matching with graph neural networks. In CVPR, 2020.
  38. Back to the feature: Learning robust camera localization from pixels to pose. In CVPR, 2021.
  39. Lamar: Benchmarking localization and mapping for augmented reality. In ECCV, 2022.
  40. Uav pose estimation using cross-view geolocalization with satellite imagery. In ICRA, 2019.
  41. Multipurpose uav for search and rescue operations in mountain avalanche events. Geomatics Nat. Hazards Risk, 2017.
  42. Loftr: Detector-free local feature matching with transformers. In CVPR, 2021.
  43. Inloc: Indoor visual localization with dense matching and view synthesis. In CVPR, 2018.
  44. Coarse-to-fine uav image geo-localization using multi-stage lucas-kanade networks. In ICTC, 2021.
  45. Vision-based uavs aerial image localization: A survey. In SIGSPATIAL, 2018.
  46. Uav image geo-localization by point-line-patch feature matching and iclk optimization. In Int. Conf. Geoinformatics, 2022.
  47. Crossloc: Scalable aerial localization assisted by multimodal synthetic data. In CVPR, 2022.
  48. Long-term visual localization with mobile sensors. In CVPR, 2023.
  49. i3dloc: Image-to-range cross-domain localization robust to inconsistent environmental conditions. arXiv preprint, 2021.
  50. isimloc: Visual global localization for previously unseen environments with simulated images. IEEE Trans. Rob., 2023.
  51. Rendernet: Visual relocalization using virtual viewpoints in large-scale indoor environments. arXiv preprint, 2022.
  52. Reference pose generation for long-term visual localization via learned features and view synthesis. IJCV, 2021.
  53. University-1652: A multi-view multi-source benchmark for drone-based geo-localization. In MM, 2020.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Rouwan Wu (2 papers)
  2. Xiaoya Cheng (4 papers)
  3. Juelin Zhu (4 papers)
  4. Xuxiang Liu (1 paper)
  5. Maojun Zhang (25 papers)
  6. Shen Yan (47 papers)
Citations (3)

Summary

  • The paper presents UAVD4L, a dataset offering precise 6-DoF ground truth through synthetic image generation and a two-stage localization pipeline.
  • The methodology integrates offline synthetic data creation with IMU-guided online visual localization to establish robust 2D-3D correspondences.
  • The hierarchical 3D tracking system further refines localization, enhancing accuracy for critical applications like surveillance and search-and-rescue.

Introduction to UAVD4L

The reliance on GNSS systems for Unmanned Aerial Vehicles (UAVs) localization in environments where GPS signals are unavailable presents a challenge for various applications. Traditional datasets available for UAV localization are often limited in scope and accuracy, inhibiting progress in GPS-denied areas. A new dataset, UAVD4L, develops a comprehensive solution to overcome these challenges.

Tackling Limitations in Existing Datasets

UAVD4L overcomes the limitations of small-scale and limited viewpoint datasets by incorporating a textured 3D reference model and synthetic data such as rendered RGB and depth images. This approach allows for greater variability in viewpoints and more accurate 6-DoF ground truth poses without relying on potentially inaccurate built-in sensor data.

The Two-Stage 6-DoF Localization Pipeline

The UAVD4L dataset introduces a two-stage localization pipeline named UAVLoc. The first stage involves offline synthetic data generation to cover various viewpoints and conditions. The second stage conducts an online visual localization process where rotation information from the UAV's own Inertial Measurement Unit (IMU) narrows the search for relevant reference images. Next, feature matches establish 2D-3D correspondences which enable a gravity-guided PnP RANSAC to estimate the UAV's camera pose accurately.

Hierarchical 3D Target Tracking System

Apart from the dataset, a novel hierarchical target tracking system has been designed to capitalize on 6-DoF localization results. The system uses two lenses to pinpoint the exact location of objects on the ground, projecting their position from 2D images to absolute coordinates on a 3D map. This system could significantly enhance tracking accuracy in applications such as surveillance and search-and-rescue operations.

Conclusion and Future Work

UAVD4L's large-scale dataset for UAV 6-DoF localization represents an important step in advancing UAV technologies in GPS-denied environments. It provides a valuable benchmark for research, offering a dataset that includes a diverse range of urban and rural scenes, and accurate ground truth pose estimations. While the UAVD4L has achieved considerable accuracy, future work aims to include more challenging lighting conditions to extend the dataset's applicability. The provision of open-source code and data ensures that researchers and practitioners can continue to build on this promising foundation.

Github Logo Streamline Icon: https://streamlinehq.com

GitHub

X Twitter Logo Streamline Icon: https://streamlinehq.com