Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Important New Developments in Arabographic Optical Character Recognition (OCR) (1703.09550v1)

Published 28 Mar 2017 in cs.CV and cs.DL

Abstract: The OpenITI team has achieved Optical Character Recognition (OCR) accuracy rates for classical Arabic-script texts in the high nineties. These numbers are based on our tests of seven different Arabic-script texts of varying quality and typefaces, totaling over 7,000 lines. These accuracy rates not only represent a distinct improvement over the actual accuracy rates of the various proprietary OCR options for classical Arabic-script texts, but, equally important, they are produced using an open-source OCR software, thus enabling us to make this Arabic-script OCR technology freely available to the broader Islamic, Persian, and Arabic Studies communities.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Maxim Romanov (3 papers)
  2. Matthew Thomas Miller (4 papers)
  3. Sarah Bowen Savant (1 paper)
  4. Benjamin Kiessling (3 papers)
Citations (25)

Summary

We haven't generated a summary for this paper yet.