Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Can Zero-Shot Commercial APIs Deliver Regulatory-Grade Clinical Text DeIdentification? (2503.20794v2)

Published 21 Mar 2025 in cs.CL, cs.CR, cs.IR, and cs.LG

Abstract: We evaluate the performance of four leading solutions for de-identification of unstructured medical text - Azure Health Data Services, AWS Comprehend Medical, OpenAI GPT-4o, and John Snow Labs - on a ground truth dataset of 48 clinical documents annotated by medical experts. The analysis, conducted at both entity-level and token-level, suggests that John Snow Labs' Medical LLMs solution achieves the highest accuracy, with a 96% F1-score in protected health information (PHI) detection, outperforming Azure (91%), AWS (83%), and GPT-4o (79%). John Snow Labs is not only the only solution which achieves regulatory-grade accuracy (surpassing that of human experts) but is also the most cost-effective solution: It is over 80% cheaper compared to Azure and GPT-4o, and is the only solution not priced by token. Its fixed-cost local deployment model avoids the escalating per-request fees of cloud-based services, making it a scalable and economical choice.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Veysel Kocaman (12 papers)
  2. Muhammed Santas (1 paper)
  3. Yigit Gul (2 papers)
  4. Mehmet Butgul (2 papers)
  5. David Talby (9 papers)

Summary

We haven't generated a summary for this paper yet.