Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Learning to Match for Multi-criteria Document Relevance (1409.6512v2)

Published 23 Sep 2014 in cs.IR

Abstract: In light of the tremendous amount of data produced by social media, a large body of research have revisited the relevance estimation of the users' generated content. Most of the studies have stressed the multidimensional nature of relevance and proved the effectiveness of combining the different criteria that it embodies. Traditional relevance estimates combination methods are often based on linear combination schemes. However, despite being effective, those aggregation mechanisms are not effective in real-life applications since they heavily rely on the non-realistic independence property of the relevance dimensions. In this paper, we propose to tackle this issue through the design of a novel fuzzy-based document ranking model. We also propose an automated methodology to capture the importance of relevance dimensions, as well as information about their interaction. This model, based on the Choquet Integral, allows to optimize the aggregated documents relevance scores using any target information retrieval relevance metric. Experiments within the TREC Microblog task and a social personalized information retrieval task highlighted that our model significantly outperforms a wide range of state-of-the-art aggregation operators, as well as a representative learning to rank methods.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Bilel Moulahi (1 paper)
  2. Lynda Tamine (10 papers)
  3. Sadok Ben Yahia (19 papers)