Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Concept Annotation System for Clinical Records (1012.1663v1)

Published 8 Dec 2010 in cs.IR

Abstract: Unstructured information comprises a valuable source of data in clinical records. For text mining in clinical records, concept extraction is the first step in finding assertions and relationships. This study presents a system developed for the annotation of medical concepts, including medical problems, tests, and treatments, mentioned in clinical records. The system combines six publicly available named entity recognition system into one framework, and uses a simple voting scheme that allows to tune precision and recall of the system to specific needs. The system provides both a web service interface and a UIMA interface which can be easily used by other systems. The system was tested in the fourth i2b2 challenge and achieved an F-score of 82.1% for the concept exact match task, a score which is among the top-ranking systems. To our knowledge, this is the first publicly available clinical record concept annotation system.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Ning Kang (27 papers)
  2. Rogier Barendse (1 paper)
  3. Zubair Afzal (2 papers)
  4. Bharat Singh (26 papers)
  5. Martijn J. Schuemie (7 papers)
  6. Erik M. van Mulligen (1 paper)
  7. Jan A. Kors (4 papers)
Citations (2)