Distillation Enhanced Generative Retrieval (2402.10769v1)

Published 16 Feb 2024 in cs.CL, cs.AI, and cs.IR

Abstract: Generative retrieval is a promising new paradigm in text retrieval that generates identifier strings of relevant passages as the retrieval target. This paradigm leverages powerful generative LLMs, distinct from traditional sparse or dense retrieval methods. In this work, we identify a viable direction to further enhance generative retrieval via distillation and propose a feasible framework, named DGR. DGR utilizes sophisticated ranking models, such as the cross-encoder, in a teacher role to supply a passage rank list, which captures the varying relevance degrees of passages instead of binary hard labels; subsequently, DGR employs a specially designed distilled RankNet loss to optimize the generative retrieval model, considering the passage rank order provided by the teacher model as labels. This framework only requires an additional distillation step to enhance current generative retrieval systems and does not add any burden to the inference stage. We conduct experiments on four public datasets, and the results indicate that DGR achieves state-of-the-art performance among the generative retrieval methods. Additionally, DGR demonstrates exceptional robustness and generalizability with various teacher models and distillation losses.

PDF Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (6)

Yongqi Li (40 papers)
Zhen Zhang (384 papers)
Wenjie Wang (150 papers)
Liqiang Nie (191 papers)
Wenjie Li (183 papers)
Tat-Seng Chua (359 papers)

Citations (1)

View on Semantic Scholar

Tweets

https://twitter.com/_reachsumit/status/1759411151681847499

Distillation Enhanced Generative Retrieval (2402.10769v1)

Related Papers

Tweets