2000 character limit reached
Clinical Trial Information Extraction with BERT (2110.10027v1)
Published 11 Sep 2021 in q-bio.QM, cs.CL, and cs.LG
Abstract: Natural language processing (NLP) of clinical trial documents can be useful in new trial design. Here we identify entity types relevant to clinical trial design and propose a framework called CT-BERT for information extraction from clinical trial text. We trained named entity recognition (NER) models to extract eligibility criteria entities by fine-tuning a set of pre-trained BERT models. We then compared the performance of CT-BERT with recent baseline methods including attention-based BiLSTM and Criteria2Query. The results demonstrate the superiority of CT-BERT in clinical trial NLP.
- Xiong Liu (26 papers)
- Greg L. Hersch (2 papers)
- Iya Khalil (6 papers)
- Murthy Devarakonda (10 papers)