Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Unifying Heterogeneous Electronic Health Records Systems via Text-Based Code Embedding (2111.09098v4)

Published 12 Nov 2021 in cs.CL and cs.LG

Abstract: EHR systems lack a unified code system forrepresenting medical concepts, which acts asa barrier for the deployment of deep learningmodels in large scale to multiple clinics and hos-pitals. To overcome this problem, we introduceDescription-based Embedding,DescEmb, a code-agnostic representation learning framework forEHR. DescEmb takes advantage of the flexibil-ity of neural language understanding models toembed clinical events using their textual descrip-tions rather than directly mapping each event toa dedicated embedding. DescEmb outperformedtraditional code-based embedding in extensiveexperiments, especially in a zero-shot transfertask (one hospital to another), and was able totrain a single unified model for heterogeneousEHR datasets.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Kyunghoon Hur (8 papers)
  2. Jiyoung Lee (42 papers)
  3. Jungwoo Oh (11 papers)
  4. Wesley Price (2 papers)
  5. Young-Hak Kim (14 papers)
  6. Edward Choi (90 papers)
Citations (15)

Summary

We haven't generated a summary for this paper yet.