Learning-to-Rank with BERT in TF-Ranking (2004.08476v3)

Published 17 Apr 2020 in cs.IR

Abstract: This paper describes a machine learning algorithm for document (re)ranking, in which queries and documents are firstly encoded using BERT [1], and on top of that a learning-to-rank (LTR) model constructed with TF-Ranking (TFR) [2] is applied to further optimize the ranking performance. This approach is proved to be effective in a public MS MARCO benchmark [3]. Our first two submissions achieve the best performance for the passage re-ranking task [4], and the second best performance for the passage full-ranking task as of April 10, 2020 [5]. To leverage the lately development of pre-trained LLMs, we recently integrate RoBERTa [6] and ELECTRA [7]. Our latest submissions improve our previously state-of-the-art re-ranking performance by 4.3% [8], and achieve the third best performance for the full-ranking task [9] as of June 8, 2020. Both of them demonstrate the effectiveness of combining ranking losses with BERT representations for document ranking.

Authors (4)

Shuguang Han (22 papers)
Xuanhui Wang (36 papers)
Mike Bendersky (5 papers)
Marc Najork (27 papers)

Citations (88)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/Sarvesh_01X/status/1902465262424879516

Learning-to-Rank with BERT in TF-Ranking (2004.08476v3)

Summary

Related Papers

Tweets