Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Phenotyping of Clinical Notes with Improved Document Classification Models Using Contextualized Neural Language Models (1910.13664v2)

Published 30 Oct 2019 in cs.CL

Abstract: Clinical notes contain an extensive record of a patient's health status, such as smoking status or the presence of heart conditions. However, this detail is not replicated within the structured data of electronic health systems. Phenotyping, the extraction of patient conditions from free clinical text, is a critical task which supports avariety of downstream applications such as decision support and secondary use of medical records. Previous work has resulted in systems which are high performing but require hand engineering, often of rules. Recent work in pretrained contextualized LLMs have enabled advances in representing text for a variety of tasks. We therefore explore several architectures for modeling pheno-typing that rely solely on BERT representations of the clinical note, removing the need for manual engineering. We find these architectures are competitive with or outperform existing state of the art methods on two phenotyping tasks.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Andriy Mulyar (6 papers)
  2. Elliot Schumacher (10 papers)
  3. Masoud Rouhizadeh (5 papers)
  4. Mark Dredze (66 papers)
Citations (37)