Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Few-Shot Text Ranking with Meta Adapted Synthetic Weak Supervision (2012.14862v2)

Published 29 Dec 2020 in cs.IR and cs.CL

Abstract: The effectiveness of Neural Information Retrieval (Neu-IR) often depends on a large scale of in-domain relevance training signals, which are not always available in real-world ranking scenarios. To democratize the benefits of Neu-IR, this paper presents MetaAdaptRank, a domain adaptive learning method that generalizes Neu-IR models from label-rich source domains to few-shot target domains. Drawing on source-domain massive relevance supervision, MetaAdaptRank contrastively synthesizes a large number of weak supervision signals for target domains and meta-learns to reweight these synthetic "weak" data based on their benefits to the target-domain ranking accuracy of Neu-IR models. Experiments on three TREC benchmarks in the web, news, and biomedical domains show that MetaAdaptRank significantly improves the few-shot ranking accuracy of Neu-IR models. Further analyses indicate that MetaAdaptRank thrives from both its contrastive weak data synthesis and meta-reweighted data selection. The code and data of this paper can be obtained from https://github.com/thunlp/MetaAdaptRank.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Si Sun (9 papers)
  2. Yingzhuo Qian (1 paper)
  3. Zhenghao Liu (77 papers)
  4. Chenyan Xiong (95 papers)
  5. Kaitao Zhang (4 papers)
  6. Jie Bao (40 papers)
  7. Zhiyuan Liu (433 papers)
  8. Paul Bennett (17 papers)
Citations (18)