Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Generating Diverse Criteria On-the-Fly to Improve Point-wise LLM Rankers (2404.11960v2)

Published 18 Apr 2024 in cs.IR and cs.AI

Abstract: The most recent pointwise LLM rankers have achieved remarkable ranking results. However, these rankers are hindered by two major drawbacks: (1) they fail to follow a standardized comparison guidance during the ranking process, and (2) they struggle with comprehensive considerations when dealing with complicated passages. To address these shortcomings, we propose to build a ranker that generates ranking scores based on a set of criteria from various perspectives. These criteria are intended to direct each perspective in providing a distinct yet synergistic evaluation. Our research, which examines eight datasets from the BEIR benchmark demonstrates that incorporating this multi-perspective criteria ensemble approach markedly enhanced the performance of pointwise LLM rankers.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Fang Guo (12 papers)
  2. Wenyu Li (19 papers)
  3. Honglei Zhuang (31 papers)
  4. Yun Luo (33 papers)
  5. Yafu Li (26 papers)
  6. Le Yan (28 papers)
  7. Yue Zhang (620 papers)
  8. Qi Zhu (160 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.