Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
129 tokens/sec
GPT-4o
28 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Track Anything Rapter(TAR) (2405.11655v2)

Published 19 May 2024 in cs.CV, cs.AI, and cs.RO

Abstract: Object tracking is a fundamental task in computer vision with broad practical applications across various domains, including traffic monitoring, robotics, and autonomous vehicle tracking. In this project, we aim to develop a sophisticated aerial vehicle system known as Track Anything Rapter (TAR), designed to detect, segment, and track objects of interest based on user-provided multimodal queries, such as text, images, and clicks. TAR utilizes cutting-edge pre-trained models like DINO, CLIP, and SAM to estimate the relative pose of the queried object. The tracking problem is approached as a Visual Servoing task, enabling the UAV to consistently focus on the object through advanced motion planning and control algorithms. We showcase how the integration of these foundational models with a custom high-level control algorithm results in a highly stable and precise tracking system deployed on a custom-built PX4 Autopilot-enabled Voxl2 M500 drone. To validate the tracking algorithm's performance, we compare it against Vicon-based ground truth. Additionally, we evaluate the reliability of the foundational models in aiding tracking in scenarios involving occlusions. Finally, we test and validate the model's ability to work seamlessly with multiple modalities, such as click, bounding box, and image templates.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (10)
  1. Embedded uav real-time visual object detection and tracking. In 2019 IEEE International Conference on Real-time Computing and Robotics (RCAR), pages 708–713. IEEE, 2019.
  2. Vision-based anti-uav detection and tracking. IEEE Transactions on Intelligent Transportation Systems, 23(12):25323–25334, 2022.
  3. Implementation of detection and tracking mechanism for small uas. In 2016 International Conference on Unmanned Aircraft Systems (ICUAS), pages 1254–1260. IEEE, 2016.
  4. Deep learning-based strategies for the detection and tracking of drones using several cameras. IPSJ Transactions on Computer Vision and Applications, 11:1–13, 2019.
  5. Deep learning on home drone: Searching for the optimal architecture. In 2023 IEEE International Conference on Robotics and Automation (ICRA), pages 8208–8215. IEEE, 2023.
  6. Vision-based system for a real-time detection and following of uav. In 2019 Workshop on Research, Education and Development of Unmanned Aerial Systems (RED UAS), pages 156–159. IEEE, 2019.
  7. Any object tracking and following by a flying drone. In 2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI), pages 35–41. IEEE, 2015.
  8. Follow anything: Open-set detection, tracking, and following in real-time. IEEE Robotics and Automation Letters, 9(4):3283–3290, 2024.
  9. Segment anything. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 4015–4026, October 2023.
  10. Dino: Detr with improved denoising anchor boxes for end-to-end object detection. arXiv preprint arXiv:2203.03605, 2022.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com

HackerNews

  1. Track Anything Rapter (1 point, 0 comments)