Sparsifying Sparse Representations for Passage Retrieval by Top-$k$ Masking (2112.09628v1)

Published 17 Dec 2021 in cs.IR and cs.CL

Abstract: Sparse lexical representation learning has demonstrated much progress in improving passage retrieval effectiveness in recent models such as DeepImpact, uniCOIL, and SPLADE. This paper describes a straightforward yet effective approach for sparsifying lexical representations for passage retrieval, building on SPLADE by introducing a top-$k$ masking scheme to control sparsity and a self-learning method to coax masked representations to mimic unmasked representations. A basic implementation of our model is competitive with more sophisticated approaches and achieves a good balance between effectiveness and efficiency. The simplicity of our methods opens the door for future explorations in lexical representation learning for passage retrieval.

Authors (3)

Jheng-Hong Yang (14 papers)
Xueguang Ma (36 papers)
Jimmy Lin (208 papers)

Citations (12)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Sparsifying Sparse Representations for Passage Retrieval by Top-$k$ Masking (2112.09628v1)

Summary

Related Papers