GQE: Generalized Query Expansion for Enhanced Text-Video Retrieval (2408.07249v1)

Published 14 Aug 2024 in cs.CV and cs.IR

Abstract: In the rapidly expanding domain of web video content, the task of text-video retrieval has become increasingly critical, bridging the semantic gap between textual queries and video data. This paper introduces a novel data-centric approach, Generalized Query Expansion (GQE), to address the inherent information imbalance between text and video, enhancing the effectiveness of text-video retrieval systems. Unlike traditional model-centric methods that focus on designing intricate cross-modal interaction mechanisms, GQE aims to expand the text queries associated with videos both during training and testing phases. By adaptively segmenting videos into short clips and employing zero-shot captioning, GQE enriches the training dataset with comprehensive scene descriptions, effectively bridging the data imbalance gap. Furthermore, during retrieval, GQE utilizes LLMs (LLM) to generate a diverse set of queries and a query selection module to filter these queries based on relevance and diversity, thus optimizing retrieval performance while reducing computational overhead. Our contributions include a detailed examination of the information imbalance challenge, a novel approach to query expansion in video-text datasets, and the introduction of a query selection strategy that enhances retrieval accuracy without increasing computational costs. GQE achieves state-of-the-art performance on several benchmarks, including MSR-VTT, MSVD, LSMDC, and VATEX, demonstrating the effectiveness of addressing text-video retrieval from a data-centric perspective.

Authors (7)

Zechen Bai (17 papers)
Tianjun Xiao (20 papers)
Tong He (124 papers)
Pichao Wang (65 papers)
Zheng Zhang (488 papers)
Thomas Brox (134 papers)
Mike Zheng Shou (165 papers)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GQE: Generalized Query Expansion for Enhanced Text-Video Retrieval (2408.07249v1)

Summary

Related Papers