Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

PHEE: A Dataset for Pharmacovigilance Event Extraction from Text (2210.12560v1)

Published 22 Oct 2022 in cs.CL

Abstract: The primary goal of drug safety researchers and regulators is to promptly identify adverse drug reactions. Doing so may in turn prevent or reduce the harm to patients and ultimately improve public health. Evaluating and monitoring drug safety (i.e., pharmacovigilance) involves analyzing an ever growing collection of spontaneous reports from health professionals, physicians, and pharmacists, and information voluntarily submitted by patients. In this scenario, facilitating analysis of such reports via automation has the potential to rapidly identify safety signals. Unfortunately, public resources for developing natural LLMs for this task are scant. We present PHEE, a novel dataset for pharmacovigilance comprising over 5000 annotated events from medical case reports and biomedical literature, making it the largest such public dataset to date. We describe the hierarchical event schema designed to provide coarse and fine-grained information about patients' demographics, treatments and (side) effects. Along with the discussion of the dataset, we present a thorough experimental evaluation of current state-of-the-art approaches for biomedical event extraction, point out their limitations, and highlight open challenges to foster future research in this area.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Zhaoyue Sun (6 papers)
  2. Jiazheng Li (37 papers)
  3. Gabriele Pergola (26 papers)
  4. Byron C. Wallace (82 papers)
  5. Bino John (4 papers)
  6. Nigel Greene (2 papers)
  7. Joseph Kim (9 papers)
  8. Yulan He (113 papers)
Citations (31)