Consolidating Ranking and Relevance Predictions of Large Language Models through Post-Processing (2404.11791v1)

Published 17 Apr 2024 in cs.IR

Abstract: The powerful generative abilities of LLMs show potential in generating relevance labels for search applications. Previous work has found that directly asking about relevancy, such as How relevant is document A to query Q?", results in sub-optimal ranking. Instead, the pairwise ranking prompting (PRP) approach produces promising ranking performance through asking about pairwise comparisons, e.g.,Is document A more relevant than document B to query Q?". Thus, while LLMs are effective at their ranking ability, this is not reflected in their relevance label generation. In this work, we propose a post-processing method to consolidate the relevance labels generated by an LLM with its powerful ranking abilities. Our method takes both LLM generated relevance labels and pairwise preferences. The labels are then altered to satisfy the pairwise preferences of the LLM, while staying as close to the original values as possible. Our experimental results indicate that our approach effectively balances label accuracy and ranking performance. Thereby, our work shows it is possible to combine both the ranking and labeling abilities of LLMs through post-processing.

References (35)

Authors (7)

Le Yan (28 papers)
Zhen Qin (105 papers)
Honglei Zhuang (31 papers)
Rolf Jagerman (18 papers)
Xuanhui Wang (36 papers)
Michael Bendersky (63 papers)
Harrie Oosterhuis (44 papers)

Citations (3)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/fly51fly/status/1781440747591647426

Consolidating Ranking and Relevance Predictions of Large Language Models through Post-Processing (2404.11791v1)

Summary

Related Papers

Tweets