Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Neural Natural Language Processing for Unstructured Data in Electronic Health Records: a Review (2107.02975v1)

Published 7 Jul 2021 in cs.CL and cs.AI

Abstract: Electronic health records (EHRs), digital collections of patient healthcare events and observations, are ubiquitous in medicine and critical to healthcare delivery, operations, and research. Despite this central role, EHRs are notoriously difficult to process automatically. Well over half of the information stored within EHRs is in the form of unstructured text (e.g. provider notes, operation reports) and remains largely untapped for secondary use. Recently, however, newer neural network and deep learning approaches to NLP have made considerable advances, outperforming traditional statistical and rule-based systems on a variety of tasks. In this survey paper, we summarize current neural NLP methods for EHR applications. We focus on a broad scope of tasks, namely, classification and prediction, word embeddings, extraction, generation, and other topics such as question answering, phenotyping, knowledge graphs, medical dialogue, multilinguality, interpretability, etc.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (13)
  1. Irene Li (47 papers)
  2. Jessica Pan (3 papers)
  3. Jeremy Goldwasser (8 papers)
  4. Neha Verma (18 papers)
  5. Wai Pan Wong (2 papers)
  6. Muhammed Yavuz Nuzumlalı (2 papers)
  7. Benjamin Rosand (4 papers)
  8. Yixin Li (29 papers)
  9. Matthew Zhang (4 papers)
  10. David Chang (4 papers)
  11. R. Andrew Taylor (3 papers)
  12. Harlan M. Krumholz (7 papers)
  13. Dragomir Radev (98 papers)
Citations (128)