Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Unsupervised Annotation of Phenotypic Abnormalities via Semantic Latent Representations on Electronic Health Records (1911.03862v1)

Published 10 Nov 2019 in cs.CL

Abstract: The extraction of phenotype information which is naturally contained in electronic health records (EHRs) has been found to be useful in various clinical informatics applications such as disease diagnosis. However, due to imprecise descriptions, lack of gold standards and the demand for efficiency, annotating phenotypic abnormalities on millions of EHR narratives is still challenging. In this work, we propose a novel unsupervised deep learning framework to annotate the phenotypic abnormalities from EHRs via semantic latent representations. The proposed framework takes the advantage of Human Phenotype Ontology (HPO), which is a knowledge base of phenotypic abnormalities, to standardize the annotation results. Experiments have been conducted on 52,722 EHRs from MIMIC-III dataset. Quantitative and qualitative analysis have shown the proposed framework achieves state-of-the-art annotation performance and computational efficiency compared with other methods.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Jingqing Zhang (15 papers)
  2. Xiaoyu Zhang (144 papers)
  3. Kai Sun (317 papers)
  4. Xian Yang (33 papers)
  5. Chengliang Dai (11 papers)
  6. Yike Guo (144 papers)
Citations (6)

Summary

We haven't generated a summary for this paper yet.