Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Named Entities in Medical Case Reports: Corpus and Experiments (2003.13032v1)

Published 29 Mar 2020 in cs.CL

Abstract: We present a new corpus comprising annotations of medical entities in case reports, originating from PubMed Central's open access library. In the case reports, we annotate cases, conditions, findings, factors and negation modifiers. Moreover, where applicable, we annotate relations between these entities. As such, this is the first corpus of this kind made available to the scientific community in English. It enables the initial investigation of automatic information extraction from case reports through tasks like Named Entity Recognition, Relation Extraction and (sentence/paragraph) relevance detection. Additionally, we present four strong baseline systems for the detection of medical entities made available through the annotated dataset.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Sarah Schulz (4 papers)
  2. Samuel Rodriguez (5 papers)
  3. Malte Ostendorff (23 papers)
  4. Georg Rehm (32 papers)
  5. Jurica Ĺ eva (1 paper)
Citations (9)

Summary

We haven't generated a summary for this paper yet.