2000 character limit reached
Learning Diverse Document Representations with Deep Query Interactions for Dense Retrieval (2208.04232v1)
Published 8 Aug 2022 in cs.IR and cs.CL
Abstract: In this paper, we propose a new dense retrieval model which learns diverse document representations with deep query interactions. Our model encodes each document with a set of generated pseudo-queries to get query-informed, multi-view document representations. It not only enjoys high inference efficiency like the vanilla dual-encoder models, but also enables deep query-document interactions in document encoding and provides multi-faceted representations to better match different queries. Experiments on several benchmarks demonstrate the effectiveness of the proposed method, out-performing strong dual encoder baselines.The code is available at \url{https://github.com/jordane95/dual-cross-encoder
- Zehan Li (26 papers)
- Nan Yang (182 papers)
- Liang Wang (512 papers)
- Furu Wei (291 papers)