Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

EHRKit: A Python Natural Language Processing Toolkit for Electronic Health Record Texts (2204.06604v5)

Published 13 Apr 2022 in cs.CL

Abstract: The Electronic Health Record (EHR) is an essential part of the modern medical system and impacts healthcare delivery, operations, and research. Unstructured text is attracting much attention despite structured information in the EHRs and has become an exciting research field. The success of the recent neural NLP method has led to a new direction for processing unstructured clinical notes. In this work, we create a python library for clinical texts, EHRKit. This library contains two main parts: MIMIC-III-specific functions and tasks specific functions. The first part introduces a list of interfaces for accessing MIMIC-III NOTEEVENTS data, including basic search, information retrieval, and information extraction. The second part integrates many third-party libraries for up to 12 off-shelf NLP tasks such as named entity recognition, summarization, machine translation, etc.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Irene Li (47 papers)
  2. Keen You (7 papers)
  3. Yujie Qiao (4 papers)
  4. Lucas Huang (3 papers)
  5. Chia-Chun Hsieh (3 papers)
  6. Benjamin Rosand (4 papers)
  7. Jeremy Goldwasser (8 papers)
  8. Dragomir Radev (98 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.