Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Text Classification of Cancer Clinical Trial Eligibility Criteria (2309.07812v2)

Published 14 Sep 2023 in cs.CL and cs.LG

Abstract: Automatic identification of clinical trials for which a patient is eligible is complicated by the fact that trial eligibility is stated in natural language. A potential solution to this problem is to employ text classification methods for common types of eligibility criteria. In this study, we focus on seven common exclusion criteria in cancer trials: prior malignancy, human immunodeficiency virus, hepatitis B, hepatitis C, psychiatric illness, drug/substance abuse, and autoimmune illness. Our dataset consists of 764 phase III cancer trials with these exclusions annotated at the trial level. We experiment with common transformer models as well as a new pre-trained clinical trial BERT model. Our results demonstrate the feasibility of automatically classifying common exclusion criteria. Additionally, we demonstrate the value of a pre-trained LLM specifically for clinical trials, which yields the highest average performance across all criteria.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Yumeng Yang (49 papers)
  2. Soumya Jayaraj (1 paper)
  3. Ethan B Ludmir (2 papers)
  4. Kirk Roberts (32 papers)
Citations (7)

Summary

We haven't generated a summary for this paper yet.