Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Learning Topic-Sensitive Word Representations (1705.00441v1)

Published 1 May 2017 in cs.CL

Abstract: Distributed word representations are widely used for modeling words in NLP tasks. Most of the existing models generate one representation per word and do not consider different meanings of a word. We present two approaches to learn multiple topic-sensitive representations per word by using Hierarchical Dirichlet Process. We observe that by modeling topics and integrating topic distributions for each document we obtain representations that are able to distinguish between different meanings of a given word. Our models yield statistically significant improvements for the lexical substitution task indicating that commonly used single word representations, even when combined with contextual information, are insufficient for this task.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Marzieh Fadaee (40 papers)
  2. Arianna Bisazza (43 papers)
  3. Christof Monz (54 papers)
Citations (11)

Summary

We haven't generated a summary for this paper yet.