Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

UATVR: Uncertainty-Adaptive Text-Video Retrieval (2301.06309v2)

Published 16 Jan 2023 in cs.CV

Abstract: With the explosive growth of web videos and emerging large-scale vision-language pre-training models, e.g., CLIP, retrieving videos of interest with text instructions has attracted increasing attention. A common practice is to transfer text-video pairs to the same embedding space and craft cross-modal interactions with certain entities in specific granularities for semantic correspondence. Unfortunately, the intrinsic uncertainties of optimal entity combinations in appropriate granularities for cross-modal queries are understudied, which is especially critical for modalities with hierarchical semantics, e.g., video, text, etc. In this paper, we propose an Uncertainty-Adaptive Text-Video Retrieval approach, termed UATVR, which models each look-up as a distribution matching procedure. Concretely, we add additional learnable tokens in the encoders to adaptively aggregate multi-grained semantics for flexible high-level reasoning. In the refined embedding space, we represent text-video pairs as probabilistic distributions where prototypes are sampled for matching evaluation. Comprehensive experiments on four benchmarks justify the superiority of our UATVR, which achieves new state-of-the-art results on MSR-VTT (50.8%), VATEX (64.5%), MSVD (49.7%), and DiDeMo (45.8%). The code is available at https://github.com/bofang98/UATVR.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Bo Fang (26 papers)
  2. Wenhao Wu (71 papers)
  3. Chang Liu (863 papers)
  4. Yu Zhou (335 papers)
  5. Yuxin Song (21 papers)
  6. Weiping Wang (123 papers)
  7. Xiangbo Shu (39 papers)
  8. Xiangyang Ji (157 papers)
  9. Jingdong Wang (236 papers)
Citations (27)
Github Logo Streamline Icon: https://streamlinehq.com