Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

From Replication to Redesign: Exploring Pairwise Comparisons for LLM-Based Peer Review (2506.11343v1)

Published 12 Jun 2025 in cs.CL

Abstract: The advent of LLMs offers unprecedented opportunities to reimagine peer review beyond the constraints of traditional workflows. Despite these opportunities, prior efforts have largely focused on replicating traditional review workflows with LLMs serving as direct substitutes for human reviewers, while limited attention has been given to exploring new paradigms that fundamentally rethink how LLMs can participate in the academic review process. In this paper, we introduce and explore a novel mechanism that employs LLM agents to perform pairwise comparisons among manuscripts instead of individual scoring. By aggregating outcomes from substantial pairwise evaluations, this approach enables a more accurate and robust measure of relative manuscript quality. Our experiments demonstrate that this comparative approach significantly outperforms traditional rating-based methods in identifying high-impact papers. However, our analysis also reveals emergent biases in the selection process, notably a reduced novelty in research topics and an increased institutional imbalance. These findings highlight both the transformative potential of rethinking peer review with LLMs and critical challenges that future systems must address to ensure equity and diversity.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Yaohui Zhang (6 papers)
  2. Haijing Zhang (8 papers)
  3. Wenlong Ji (12 papers)
  4. Tianyu Hua (9 papers)
  5. Nick Haber (48 papers)
  6. Hancheng Cao (20 papers)
  7. Weixin Liang (33 papers)