Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

EvalRS: a Rounded Evaluation of Recommender Systems (2207.05772v2)

Published 12 Jul 2022 in cs.IR

Abstract: Much of the complexity of Recommender Systems (RSs) comes from the fact that they are used as part of more complex applications and affect user experience through a varied range of user interfaces. However, research focused almost exclusively on the ability of RSs to produce accurate item rankings while giving little attention to the evaluation of RS behavior in real-world scenarios. Such narrow focus has limited the capacity of RSs to have a lasting impact in the real world and makes them vulnerable to undesired behavior, such as reinforcing data biases. We propose EvalRS as a new type of challenge, in order to foster this discussion among practitioners and build in the open new methodologies for testing RSs "in the wild".

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Jacopo Tagliabue (34 papers)
  2. Federico Bianchi (47 papers)
  3. Tobias Schnabel (21 papers)
  4. Giuseppe Attanasio (21 papers)
  5. Ciro Greco (19 papers)
  6. Gabriel de Souza P. Moreira (9 papers)
  7. Patrick John Chia (9 papers)
Citations (13)