Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Using Distributed Representations to Disambiguate Biomedical and Clinical Concepts (1608.05605v1)

Published 19 Aug 2016 in cs.CL

Abstract: In this paper, we report a knowledge-based method for Word Sense Disambiguation in the domains of biomedical and clinical text. We combine word representations created on large corpora with a small number of definitions from the UMLS to create concept representations, which we then compare to representations of the context of ambiguous terms. Using no relational information, we obtain comparable performance to previous approaches on the MSH-WSD dataset, which is a well-known dataset in the biomedical domain. Additionally, our method is fast and easy to set up and extend to other domains. Supplementary materials, including source code, can be found at https: //github.com/clips/yarn

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Stéphan Tulkens (5 papers)
  2. Simon Šuster (14 papers)
  3. Walter Daelemans (31 papers)
Citations (27)

Summary

We haven't generated a summary for this paper yet.