Leveraging Prompt-Learning for Structured Information Extraction from Crohn's Disease Radiology Reports in a Low-Resource Language (2405.01682v2)

Published 2 May 2024 in cs.CL and cs.AI

Abstract: Automatic conversion of free-text radiology reports into structured data using NLP techniques is crucial for analyzing diseases on a large scale. While effective for tasks in widely spoken languages like English, generative LLMs typically underperform with less common languages and can pose potential risks to patient privacy. Fine-tuning local NLP models is hindered by the skewed nature of real-world medical datasets, where rare findings represent a significant data imbalance. We introduce SMP-BERT, a novel prompt learning method that leverages the structured nature of reports to overcome these challenges. In our studies involving a substantial collection of Crohn's disease radiology reports in Hebrew (over 8,000 patients and 10,000 reports), SMP-BERT greatly surpassed traditional fine-tuning methods in performance, notably in detecting infrequent conditions (AUC: 0.99 vs 0.94, F1: 0.84 vs 0.34). SMP-BERT empowers more accurate AI diagnostics available for low-resource languages.

References (22)

Authors (9)

Liam Hazan (5 papers)
Gili Focht (2 papers)
Naama Gavrielov (2 papers)
Roi Reichart (82 papers)
Talar Hagopian (2 papers)
Mary-Louise C. Greer (1 paper)
Ruth Cytter Kuint (1 paper)
Dan Turner (3 papers)
Moti Freiman (33 papers)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Leveraging Prompt-Learning for Structured Information Extraction from Crohn's Disease Radiology Reports in a Low-Resource Language (2405.01682v2)

Summary

Related Papers

Tweets