Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Unified Single-Stage Transformer Network for Efficient RGB-T Tracking (2308.13764v1)

Published 26 Aug 2023 in cs.CV and cs.AI

Abstract: Most existing RGB-T tracking networks extract modality features in a separate manner, which lacks interaction and mutual guidance between modalities. This limits the network's ability to adapt to the diverse dual-modality appearances of targets and the dynamic relationships between the modalities. Additionally, the three-stage fusion tracking paradigm followed by these networks significantly restricts the tracking speed. To overcome these problems, we propose a unified single-stage Transformer RGB-T tracking network, namely USTrack, which unifies the above three stages into a single ViT (Vision Transformer) backbone with a dual embedding layer through self-attention mechanism. With this structure, the network can extract fusion features of the template and search region under the mutual interaction of modalities. Simultaneously, relation modeling is performed between these features, efficiently obtaining the search region fusion features with better target-background discriminability for prediction. Furthermore, we introduce a novel feature selection mechanism based on modality reliability to mitigate the influence of invalid modalities for prediction, further improving the tracking performance. Extensive experiments on three popular RGB-T tracking benchmarks demonstrate that our method achieves new state-of-the-art performance while maintaining the fastest inference speed 84.2FPS. In particular, MPR/MSR on the short-term and long-term subsets of VTUAV dataset increased by 11.1$\%$/11.7$\%$ and 11.3$\%$/9.7$\%$.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (14)
  1. Jianqiang Xia (2 papers)
  2. DianXi Shi (4 papers)
  3. Ke Song (9 papers)
  4. Linna Song (1 paper)
  5. Songchang Jin (2 papers)
  6. Li Zhou (216 papers)
  7. Yu Cheng (354 papers)
  8. Lei Jin (73 papers)
  9. Zheng Zhu (200 papers)
  10. Jianan Li (88 papers)
  11. Gang Wang (407 papers)
  12. Junliang Xing (80 papers)
  13. Jian Zhao (218 papers)
  14. Xiaolei Wang (44 papers)
Citations (5)

Summary

We haven't generated a summary for this paper yet.