Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Human-centric Metric for Accelerating Pathology Reports Annotation (1911.01226v2)

Published 31 Oct 2019 in cs.CL, cs.CY, cs.LG, and stat.ML

Abstract: Pathology reports contain useful information such as the main involved organ, diagnosis, etc. These information can be identified from the free text reports and used for large-scale statistical analysis or serve as annotation for other modalities such as pathology slides images. However, manual classification for a huge number of reports on multiple tasks is labor-intensive. In this paper, we have developed an automatic text classifier based on BERT and we propose a human-centric metric to evaluate the model. According to the model confidence, we identify low-confidence cases that require further expert annotation and high-confidence cases that are automatically classified. We report the percentage of low-confidence cases and the performance of automatically classified cases. On the high-confidence cases, the model achieves classification accuracy comparable to pathologists. This leads a potential of reducing 80% to 98% of the manual annotation workload.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Ruibin Ma (2 papers)
  2. Po-Hsuan Cameron Chen (10 papers)
  3. Gang Li (579 papers)
  4. Wei-Hung Weng (35 papers)
  5. Angela Lin (3 papers)
  6. Krishna Gadepalli (7 papers)
  7. Yuannan Cai (3 papers)
Citations (3)