Reproducing NevIR: Negation in Neural Information Retrieval (2502.13506v4)

Published 19 Feb 2025 in cs.IR

Abstract: Negation is a fundamental aspect of human communication, yet it remains a challenge for LLMs (LMs) in Information Retrieval (IR). Despite the heavy reliance of modern neural IR systems on LMs, little attention has been given to their handling of negation. In this study, we reproduce and extend the findings of NevIR, a benchmark study that revealed most IR models perform at or below the level of random ranking when dealing with negation. We replicate NevIR's original experiments and evaluate newly developed state-of-the-art IR models. Our findings show that a recently emerging category-listwise LLM re-rankers-outperforms other models but still underperforms human performance. Additionally, we leverage ExcluIR, a benchmark dataset designed for exclusionary queries with extensive negation, to assess the generalisability of negation understanding. Our findings suggest that fine-tuning on one dataset does not reliably improve performance on the other, indicating notable differences in their data distributions. Furthermore, we observe that only cross-encoders and listwise LLM re-rankers achieve reasonable performance across both negation tasks.

Collections

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Paper Prompts

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Reproducing NevIR: Negation in Neural Information Retrieval (2502.13506v4)

Collections

Summary

Paper Prompts

Follow-up Questions

Authors (5)

Don't miss out on important new AI/ML research

Reproducing NevIR: Negation in Neural Information Retrieval (2502.13506v4)

Collections

Summary

Paper Prompts

Follow-up Questions

Related Papers

Authors (5)

Don't miss out on important new AI/ML research