Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

RankCSE: Unsupervised Sentence Representations Learning via Learning to Rank (2305.16726v1)

Published 26 May 2023 in cs.CL and cs.AI

Abstract: Unsupervised sentence representation learning is one of the fundamental problems in natural language processing with various downstream applications. Recently, contrastive learning has been widely adopted which derives high-quality sentence representations by pulling similar semantics closer and pushing dissimilar ones away. However, these methods fail to capture the fine-grained ranking information among the sentences, where each sentence is only treated as either positive or negative. In many real-world scenarios, one needs to distinguish and rank the sentences based on their similarities to a query sentence, e.g., very relevant, moderate relevant, less relevant, irrelevant, etc. In this paper, we propose a novel approach, RankCSE, for unsupervised sentence representation learning, which incorporates ranking consistency and ranking distillation with contrastive learning into a unified framework. In particular, we learn semantically discriminative sentence representations by simultaneously ensuring ranking consistency between two representations with different dropout masks, and distilling listwise ranking knowledge from the teacher. An extensive set of experiments are conducted on both semantic textual similarity (STS) and transfer (TR) tasks. Experimental results demonstrate the superior performance of our approach over several state-of-the-art baselines.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Jiduan Liu (4 papers)
  2. Jiahao Liu (72 papers)
  3. Qifan Wang (129 papers)
  4. Jingang Wang (71 papers)
  5. Wei Wu (481 papers)
  6. Yunsen Xian (17 papers)
  7. Dongyan Zhao (144 papers)
  8. Kai Chen (512 papers)
  9. Rui Yan (250 papers)
Citations (28)