Revisiting Sparse Retrieval for Few-shot Entity Linking (2310.12444v1)

Published 19 Oct 2023 in cs.CL

Abstract: Entity linking aims to link ambiguous mentions to their corresponding entities in a knowledge base. One of the key challenges comes from insufficient labeled data for specific domains. Although dense retrievers have achieved excellent performance on several benchmarks, their performance decreases significantly when only a limited amount of in-domain labeled data is available. In such few-shot setting, we revisit the sparse retrieval method, and propose an ELECTRA-based keyword extractor to denoise the mention context and construct a better query expression. For training the extractor, we propose a distant supervision method to automatically generate training data based on overlapping tokens between mention contexts and entity descriptions. Experimental results on the ZESHEL dataset demonstrate that the proposed method outperforms state-of-the-art models by a significant margin across all test domains, showing the effectiveness of keyword-enhanced sparse retrieval.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (19)

Authors (4)

Yulin Chen (134 papers)
Zhenran Xu (12 papers)
Baotian Hu (67 papers)
Min Zhang (630 papers)

Citations (1)

View on Semantic Scholar

Revisiting Sparse Retrieval for Few-shot Entity Linking (2310.12444v1)

Related Papers