INSTRUCTIR: A Benchmark for Instruction Following of Information Retrieval Models (2402.14334v1)

Published 22 Feb 2024 in cs.CL

Abstract: Despite the critical need to align search targets with users' intention, retrievers often only prioritize query information without delving into the users' intended search context. Enhancing the capability of retrievers to understand intentions and preferences of users, akin to LLM instructions, has the potential to yield more aligned search targets. Prior studies restrict the application of instructions in information retrieval to a task description format, neglecting the broader context of diverse and evolving search scenarios. Furthermore, the prevailing benchmarks utilized for evaluation lack explicit tailoring to assess instruction-following ability, thereby hindering progress in this field. In response to these limitations, we propose a novel benchmark,INSTRUCTIR, specifically designed to evaluate instruction-following ability in information retrieval tasks. Our approach focuses on user-aligned instructions tailored to each query instance, reflecting the diverse characteristics inherent in real-world search scenarios. Through experimental analysis, we observe that retrievers fine-tuned to follow task-style instructions, such as INSTRUCTOR, can underperform compared to their non-instruction-tuned counterparts. This underscores potential overfitting issues inherent in constructing retrievers trained on existing instruction-aware retrieval datasets.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (30)

Authors (7)

Hanseok Oh (8 papers)
Hyunji Lee (19 papers)
Seonghyeon Ye (25 papers)
Haebin Shin (6 papers)
Hansol Jang (5 papers)
Changwook Jun (4 papers)
Minjoon Seo (82 papers)

Citations (13)

View on Semantic Scholar

Tweets

https://twitter.com/hanseok_oh/status/1772184002113212479

https://twitter.com/hanseok_oh/status/1763063182741152147

https://twitter.com/hanseok_oh/status/1763061361846935913

https://twitter.com/knishimae0531/status/1763361130792276479

https://twitter.com/knishimae0531/status/1761204588915658877

INSTRUCTIR: A Benchmark for Instruction Following of Information Retrieval Models (2402.14334v1)

Related Papers

Tweets