Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Connect-the-Dots: Bridging Semantics between Words and Definitions via Aligning Word Sense Inventories (2110.14091v1)

Published 27 Oct 2021 in cs.CL and cs.AI

Abstract: Word Sense Disambiguation (WSD) aims to automatically identify the exact meaning of one word according to its context. Existing supervised models struggle to make correct predictions on rare word senses due to limited training data and can only select the best definition sentence from one predefined word sense inventory (e.g., WordNet). To address the data sparsity problem and generalize the model to be independent of one predefined inventory, we propose a gloss alignment algorithm that can align definition sentences (glosses) with the same meaning from different sense inventories to collect rich lexical knowledge. We then train a model to identify semantic equivalence between a target word in context and one of its glosses using these aligned inventories, which exhibits strong transfer capability to many WSD tasks. Experiments on benchmark datasets show that the proposed method improves predictions on both frequent and rare word senses, outperforming prior work by 1.2% on the All-Words WSD Task and 4.3% on the Low-Shot WSD Task. Evaluation on WiC Task also indicates that our method can better capture word meanings in context.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Wenlin Yao (38 papers)
  2. Xiaoman Pan (25 papers)
  3. Lifeng Jin (24 papers)
  4. Jianshu Chen (66 papers)
  5. Dian Yu (78 papers)
  6. Dong Yu (329 papers)
Citations (7)

Summary

We haven't generated a summary for this paper yet.