Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Recommendations on test datasets for evaluating AI solutions in pathology (2204.14226v1)

Published 21 Apr 2022 in eess.IV, cs.AI, cs.CV, cs.LG, and physics.med-ph

Abstract: AI solutions that automatically extract information from digital histology images have shown great promise for improving pathological diagnosis. Prior to routine use, it is important to evaluate their predictive performance and obtain regulatory approval. This assessment requires appropriate test datasets. However, compiling such datasets is challenging and specific recommendations are missing. A committee of various stakeholders, including commercial AI developers, pathologists, and researchers, discussed key aspects and conducted extensive literature reviews on test datasets in pathology. Here, we summarize the results and derive general recommendations for the collection of test datasets. We address several questions: Which and how many images are needed? How to deal with low-prevalence subsets? How can potential bias be detected? How should datasets be reported? What are the regulatory requirements in different countries? The recommendations are intended to help AI developers demonstrate the utility of their products and to help regulatory agencies and end users verify reported performance measures. Further research is needed to formulate criteria for sufficiently representative test datasets so that AI solutions can operate with less user intervention and better support diagnostic workflows in the future.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (30)
  1. André Homeyer (6 papers)
  2. Christian Geißler (5 papers)
  3. Lars Ole Schwen (4 papers)
  4. Falk Zakrzewski (3 papers)
  5. Theodore Evans (1 paper)
  6. Klaus Strohmenger (2 papers)
  7. Max Westphal (5 papers)
  8. Roman David Bülow (1 paper)
  9. Michaela Kargl (1 paper)
  10. Aray Karjauv (2 papers)
  11. Isidre Munné-Bertran (1 paper)
  12. Carl Orge Retzlaff (1 paper)
  13. Adrià Romero-López (2 papers)
  14. Tomasz Sołtysiński (1 paper)
  15. Markus Plass (5 papers)
  16. Rita Carvalho (4 papers)
  17. Peter Steinbach (12 papers)
  18. Yu-Chia Lan (1 paper)
  19. Nassim Bouteldja (2 papers)
  20. David Haber (3 papers)
Citations (35)

Summary

We haven't generated a summary for this paper yet.