Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Online and Offline Evaluation in Search Clarification (2403.09180v1)

Published 14 Mar 2024 in cs.IR

Abstract: The effectiveness of clarification question models in engaging users within search systems is currently constrained, casting doubt on their overall usefulness. To improve the performance of these models, it is crucial to employ assessment approaches that encompass both real-time feedback from users (online evaluation) and the characteristics of clarification questions evaluated through human assessment (offline evaluation). However, the relationship between online and offline evaluations has been debated in information retrieval. This study aims to investigate how this discordance holds in search clarification. We use user engagement as ground truth and employ several offline labels to investigate to what extent the offline ranked lists of clarification resemble the ideal ranked lists based on online user engagement.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Leila Tavakoli (3 papers)
  2. Johanne R. Trippas (15 papers)
  3. Hamed Zamani (88 papers)
  4. Falk Scholer (18 papers)
  5. Mark Sanderson (27 papers)