Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

MED-SE: Medical Entity Definition-based Sentence Embedding (2212.04734v1)

Published 9 Dec 2022 in cs.LG, cs.AI, and cs.CL

Abstract: We propose Medical Entity Definition-based Sentence Embedding (MED-SE), a novel unsupervised contrastive learning framework designed for clinical texts, which exploits the definitions of medical entities. To this end, we conduct an extensive analysis of multiple sentence embedding techniques in clinical semantic textual similarity (STS) settings. In the entity-centric setting that we have designed, MED-SE achieves significantly better performance, while the existing unsupervised methods including SimCSE show degraded performance. Our experiments elucidate the inherent discrepancies between the general- and clinical-domain texts, and suggest that entity-centric contrastive approaches may help bridge this gap and lead to a better representation of clinical sentences.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Hyeonbin Hwang (11 papers)
  2. Haanju Yoo (5 papers)
  3. Yera Choi (3 papers)

Summary

We haven't generated a summary for this paper yet.