Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Learning Semantic Textual Similarity via Topic-informed Discrete Latent Variables (2211.03616v1)

Published 7 Nov 2022 in cs.CL and cs.AI

Abstract: Recently, discrete latent variable models have received a surge of interest in both NLP and Computer Vision (CV), attributed to their comparable performance to the continuous counterparts in representation learning, while being more interpretable in their predictions. In this paper, we develop a topic-informed discrete latent variable model for semantic textual similarity, which learns a shared latent space for sentence-pair representation via vector quantization. Compared with previous models limited to local semantic contexts, our model can explore richer semantic information via topic modeling. We further boost the performance of semantic similarity by injecting the quantized representation into a transformer-based LLM with a well-designed semantic-driven attention mechanism. We demonstrate, through extensive experiments across various English language datasets, that our model is able to surpass several strong neural baselines in semantic textual similarity tasks.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Erxin Yu (6 papers)
  2. Lan Du (46 papers)
  3. Yuan Jin (24 papers)
  4. Zhepei Wei (12 papers)
  5. Yi Chang (150 papers)
Citations (6)

Summary

We haven't generated a summary for this paper yet.