Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Advancements and Challenges in Arabic Optical Character Recognition: A Comprehensive Survey (2312.11812v1)

Published 19 Dec 2023 in cs.CV and cs.AI

Abstract: Optical character recognition (OCR) is a vital process that involves the extraction of handwritten or printed text from scanned or printed images, converting it into a format that can be understood and processed by machines. This enables further data processing activities such as searching and editing. The automatic extraction of text through OCR plays a crucial role in digitizing documents, enhancing productivity, improving accessibility, and preserving historical records. This paper seeks to offer an exhaustive review of contemporary applications, methodologies, and challenges associated with Arabic Optical Character Recognition (OCR). A thorough analysis is conducted on prevailing techniques utilized throughout the OCR process, with a dedicated effort to discern the most efficacious approaches that demonstrate enhanced outcomes. To ensure a thorough evaluation, a meticulous keyword-search methodology is adopted, encompassing a comprehensive analysis of articles relevant to Arabic OCR, including both backward and forward citation reviews. In addition to presenting cutting-edge techniques and methods, this paper critically identifies research gaps within the realm of Arabic OCR. By highlighting these gaps, we shed light on potential areas for future exploration and development, thereby guiding researchers toward promising avenues in the field of Arabic OCR. The outcomes of this study provide valuable insights for researchers, practitioners, and stakeholders involved in Arabic OCR, ultimately fostering advancements in the field and facilitating the creation of more accurate and efficient OCR systems for the Arabic language.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (114)
  1. Amurd: Annotated multilingual receipts dataset for cross-lingual key information extraction and classification. arXiv preprint arXiv:2309.09800 .
  2. Tncr: Table net detection and classification dataset. Neurocomputing 473, 79–97.
  3. Attention-based fully gated cnn-bgru for russian handwritten text. Journal of Imaging 6, 141.
  4. Generator-retriever-generator: A novel approach to open-domain question answering. arXiv preprint arXiv:2307.11278 .
  5. Automated question-answer medical model based on deep learning technology, in: Proceedings of the 6th International Conference on Engineering & MIS 2020, pp. 1–8.
  6. Exploring the state of the art in legal qa systems. arXiv preprint arXiv:2304.06623 .
  7. Enhancing core image classification using generative adversarial networks (gans). arXiv e-prints , arXiv–2204.
  8. An approach to analysis of arabic text documents into text lines, words, and characters. Indonesian Journal of Electrical Engineering and Computer Science 26, 754–763.
  9. Novel deep convolutional neural network-based contextual recognition of arabic handwritten scripts. Entropy 23, 340.
  10. Handwritten urdu character recognition using one-dimensional blstm classifier. Neural Computing and Applications 31, 1143–1151.
  11. An information security model for an iot-enabled smart grid in the saudi energy sector. Computers and Electrical Engineering 105, 108491.
  12. An arabic manuscript regions detection, recognition and its applications for ocring. Transactions on Asian and Low-Resource Language Information Processing 22, 1–28.
  13. A novel approach to printed arabic optical character recognition. Arabian Journal for Science and Engineering 47, 2219–2237.
  14. A data base for arabic handwritten text recognition research, in: Proceedings eighth international workshop on frontiers in handwriting recognition, IEEE. pp. 485–489.
  15. Databases for recognition of handwritten arabic cheques. Pattern Recognition 36, 111–121.
  16. A review of arabic text recognition dataset. Asia-Pacific J. Inf. Technol. Multimedia 9, 69–81.
  17. Improving the accuracy for offline arabic digit recognition using sliding window approach. Iranian Journal of Science and Technology, Transactions of Electrical Engineering 44, 1633–1644.
  18. Printed arabic script recognition: A survey. International Journal of Advanced Computer Science and Applications 9.
  19. Arabic ocr evaluation tool, in: 2016 7th international conference on computer science and information technology (CSIT), IEEE. pp. 1–6.
  20. Handwritten arabic character recognition for children writing using convolutional neural network and stroke identification. Human-Centric Intelligent Systems , 1–13.
  21. A survey on the existing arabic optical character recognition and future trends. International Journal of Advanced Research in Computer and Communication Engineering (IJARCCE) 7, 78–88.
  22. A recognition model for handwritten persian/arabic numbers based on optimized deep convolutional neural network. Multimedia Tools and Applications 82, 14557–14580.
  23. Ensemble deep transfer learning model for arabic (indian) handwritten digit recognition. Neural Computing and Applications 34, 705–719.
  24. Arabic handwritten recognition using deep learning: A survey. Arabian Journal for Science and Engineering 47, 9943–9963.
  25. Arabic handwriting recognition system using convolutional neural network. Neural Computing and Applications 33, 2249–2261.
  26. Generative adversarial network for an improved arabic handwritten characters recognition. International Journal of Advances in Soft Computing & Its Applications 14.
  27. A novel hybrid dl model for printed arabic word recognition based on gan. International Journal of Advanced Computer Science and Applications 14.
  28. Deep learning application for handwritten arabic word recognition, in: 2022 IEEE Canadian Conference on Electrical and Computer Engineering (CCECE), IEEE. pp. 95–100.
  29. A survey on scanned receipts ocr and information extraction .
  30. Automatic processing of handwritten arabic forms using neural networks., in: IEC (Prague), pp. 313–317.
  31. A multiple feature/resolution scheme to arabic (indian) numerals recognition using hidden markov models. Signal Processing 89, 1176–1184.
  32. Deep-learning ensemble for offline arabic handwritten words recognition, in: 2019 14th International Conference on Computer Engineering and Systems (ICCES), IEEE. pp. 40–45.
  33. Qtid: Quran text image dataset. International Journal Of Advanced Computer Science And Applications 9.
  34. Quranic script optical text recognition using deep learning in iot systems. CMC-Comput. Mater. Contin 68, 1847–1858.
  35. Arabic natural language processing for qur’anic research: A systematic review. Artificial Intelligence Review 56, 6801–6854.
  36. Ocr post-processing error correction algorithm using google online spelling suggestion. arXiv preprint arXiv:1204.0191 .
  37. A printed paw image database of arabic language for document analysis and recognition. Journal of ICT Research & Applications 11.
  38. Novel perspectives for the management of multilingual and multialphabetic heritages through automatic knowledge extraction: The digitalmaktaba approach. Sensors 22, 3995.
  39. Recognition and classification of handwritten urdu numerals using deep learning techniques. Applied Sciences 13, 1624.
  40. Printed ottoman text recognition using synthetic data and data augmentation. International Journal on Document Analysis and Recognition (IJDAR) , 1–15.
  41. Towards accurate children’s arabic handwriting recognition via deep learning. Applied Sciences 13, 1692.
  42. Characters segmentation from arabic handwritten document images: Hybrid approach. International Journal of Advanced Computer Science and Applications 13.
  43. Arabic handwriting word recognition based on convolutional recurrent neural network, in: WITS 2020: Proceedings of the 6th International Conference on Wireless Technologies, Embedded, and Intelligent Systems, Springer. pp. 877–885.
  44. Printed arabic characters recognition using combined features and cnn classifier, in: 2021 International Conference on Recent Advances in Mathematics and Informatics (ICRAMI), IEEE. pp. 1–5.
  45. A review of arabic document analysis methods, in: 2022 4th International Conference on Pattern Analysis and Intelligent Systems (PAIS), IEEE. pp. 1–7.
  46. Attention-based cnn-rnn arabic text recognition from natural scene images. Forecasting 3, 520–540.
  47. Automatic cnn-based arabic numeral spotting and handwritten digit recognition by using deep transfer learning in ottoman population registers. Applied Sciences 10, 5430.
  48. Smartatid: A mobile captured arabic text images dataset for multi-purpose recognition tasks, in: 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR), IEEE. pp. 120–125.
  49. Attention-based models for speech recognition. arXiv preprint arXiv:1506.07503 .
  50. A survey on arabic handwritten script recognition systems. International Journal of Artificial Intelligence and Machine Learning (IJAIML) 11, 1–17.
  51. A novel arabic ocr post-processing using rule-based and word context techniques. International Journal on Document Analysis and Recognition (IJDAR) 21, 77–89.
  52. A two-stage system for arabic handwritten digit recognition tested on a new large database., in: Artificial intelligence and pattern recognition, pp. 237–242.
  53. Segmentation of handwritten arabic graphemes using a directed convolutional neural network and mathematical morphology operations. Pattern Recognition 122, 108288.
  54. Exploring deep learning approaches to recognize handwritten arabic texts. IEEE Access 8, 89882–89898.
  55. Generative adversarial network based adaptive data augmentation for handwritten arabic text recognition. PeerJ Computer Science 8, e861.
  56. Generative vs. discriminative recognition models for off-line arabic handwriting. Sensors 18, 2786.
  57. Measuring text similarity based on structure and word embedding. Cognitive Systems Research 63.
  58. Continuous handwritten script recognition, in: Doermann, D., Tombre, K. (Eds.), Handbook of Document Image Processing and Recognition. Springer London, London, pp. 391–425. doi:10.1007/978-0-85729-859-1_12.
  59. Stacking ensemble model of deep learning and its application to persian/arabic handwritten digits recognition. Knowledge-Based Systems 220, 106940.
  60. A detailed analysis of optical character recognition technology. International Journal of Applied Mathematics Electronics and Computers , 244–249.
  61. Neural network estimation model to optimize timing and schedule of software projects, in: 2021 IEEE International Conference on Smart Information Systems and Technologies (SIST), IEEE. pp. 1–7.
  62. Cursive arabic handwritten word recognition system using majority voting and k-nn for feature descriptor selection. Multimedia Tools and Applications , 1–25.
  63. Icdar2019 competition on scanned receipt ocr and information extraction, in: 2019 International Conference on Document Analysis and Recognition (ICDAR), IEEE. pp. 1516–1520.
  64. A survey on optical character recognition system. arXiv preprint arXiv:1710.05703 .
  65. Out of vocabulary word detection and recovery in arabic handwritten text recognition. Pattern Recognition 93, 507–520.
  66. Deep learning for table detection and structure recognition: A survey. arXiv preprint arXiv:2211.08469 .
  67. Urdu optical character recognition systems: Present contributions and future directions. IEEE Access 6, 46019–46046.
  68. Introducing a very large dataset of handwritten farsi digits and a study on their varieties. Pattern recognition letters 28, 1133–1141.
  69. Persian optical character recognition using deep bidirectional long short-term memory. Applied Sciences 12, 11760.
  70. Hacdb: Handwritten arabic characters database for automatic character recognition, in: European workshop on visual information processing (EUVIP), IEEE. pp. 255–259.
  71. Kafd arabic font database. Pattern Recognition 47, 2231–2240.
  72. Ganmasker: A two-stage generative adversarial network for high-quality face mask removal. Sensors 23, 7094.
  73. Ae-lstm: Autoencoder with lstm-based intrusion detection in iot, in: 2022 International Telecommunications Conference (ITC-Egypt), IEEE. pp. 1–6.
  74. Khatt: An open arabic offline handwritten text database. Pattern Recognition 47, 1096–1112.
  75. Recognizing handwriting styles in a historical scanned document using scikit-fuzzy c-means clustering. arXiv preprint arXiv:2210.16780 .
  76. Handwritten arabic and roman word recognition using holistic approach. The Visual Computer 39, 2909–2932.
  77. Arabic machine reading comprehension on the holy qur’an using cl-arabert. Information Processing & Management 59, 103068.
  78. An effective combination of convolutional neural network and support vector machine classifier for arabic handwritten recognition. Autom. Control Comput. Sci. 57, 267–275. URL: https://doi.org/10.3103/S0146411623030069, doi:10.3103/S0146411623030069.
  79. A new strategy for arabic ocr: archigraphemes, letter blocks, script grammar, and shape synthesis, in: Proceedings of the 3rd International Conference on Digital Access to Textual Cultural Heritage, pp. 93–96.
  80. Arabic-sos: segmentation, stemming, and orthography standardization for classical and pre-modern standard arabic, in: Proceedings of the 3rd International Conference on Digital Access to Textual Cultural Heritage, pp. 27–32.
  81. Alnasikh: An arabic ocr system based on transformers, in: 2023 International Mobile, Intelligent, and Ubiquitous Computing Conference (MIUCC), IEEE. pp. 74–81.
  82. Ocformer: A transformer-based model for arabic handwritten text recognition, in: 2021 International Mobile, Intelligent, and Ubiquitous Computing Conference (MIUCC), IEEE. pp. 182–186.
  83. Conditional deep convolutional generative adversarial networks for isolated handwritten arabic character generation. Arabian Journal for Science and Engineering 47, 1309–1320.
  84. Optimized leaky relu for handwritten arabic character recognition using convolution neural networks. Multimedia Tools and Applications , 1–30.
  85. Segmentation techniques for recognition of arabic-like scripts: A comprehensive survey. Education and Information Technologies 21, 1225–1241.
  86. A survey of ocr evaluation tools and metrics, in: The 6th International Workshop on Historical Document Imaging and Processing, pp. 13–18.
  87. Survey of post-ocr processing approaches. ACM Computing Surveys (CSUR) 54, 1–37.
  88. Disease inference from health-related questions via sparse deep learning. IEEE Transactions on knowledge and Data Engineering 27, 2107–2119.
  89. Classification of handwritten names of cities and handwritten text recognition using various deep learning models. arXiv preprint arXiv:2102.04816 .
  90. Handwritten kazakh and russian (hkr) database for text recognition. Multimedia Tools and Applications 80, 33075–33097.
  91. An augmented reality for an arabic text reading and visualization assistant for the visually impaired. Multimedia Tools and Applications , 1–29.
  92. Ifn/enit-database of handwritten arabic words, in: Proc. of CIFED, Citeseer. pp. 127–136.
  93. Cascadetabnet: An approach for end to end table detection and structure recognition from image-based documents, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp. 572–573.
  94. Learning-free, divide and conquer text-line extraction algorithm for printed arabic text with diacritics. Journal of King Saud University-Computer and Information Sciences 34, 7699–7709.
  95. An efficient, font independent word and character segmentation algorithm for printed arabic text. Journal of King Saud University-Computer and Information Sciences 34, 1330–1344.
  96. Arabic handwriting data base for text recognition. Procedia Technology 11, 580–584.
  97. Scrutinization of urdu handwritten text recognition with machine learning approach, in: Emerging Technologies in Computer Engineering: Cognitive Computing and Intelligent IoT: 5th International Conference, ICETCE 2022, Jaipur, India, February 4–5, 2022, Revised Selected Papers, Springer. pp. 383–394.
  98. Ocr4all—an open-source tool providing a (semi-) automatic ocr workflow for historical printings. Applied Sciences 9, 4853.
  99. A segmentation-free approach to arabic and urdu ocr, in: Document recognition and retrieval XX, SPIE. pp. 215–226.
  100. A robust approach for arabic document images segmentation and indexation, in: International Conference on Digital Technologies and Applications, Springer. pp. 540–549.
  101. A database of printed jawi character image, in: 2015 Third International Conference on Image Information Processing (ICIIP), IEEE. pp. 56–59.
  102. Proposed deep learning system for arabic text detection and recognition, in: 2023 15th International Conference on Developments in eSystems Engineering (DeSE), IEEE. pp. 39–44.
  103. A survey of ocr applications. International Journal of Machine Learning and Computing 2, 314.
  104. On the performance analysis of various features and classifiers for handwritten devanagari word recognition. Neural Computing and Applications 35, 7509–7527.
  105. Database and evaluation protocols for arabic printed text recognition. DIUF-University of Fribourg-Switzerland 1.
  106. A database for degraded arabic historical manuscripts, in: 2017 6th International Conference on Electrical Engineering and Informatics (ICEEI), IEEE. pp. 1–6.
  107. Ocr as a service: an experimental evaluation of google docs ocr, tesseract, abbyy finereader, and transym, in: Advances in Visual Computing: 12th International Symposium, ISVC 2016, Las Vegas, NV, USA, December 12-14, 2016, Proceedings, Part I 12, Springer. pp. 735–746.
  108. Recognition of visual arabic scripting news ticker from broadcast stream. IEEE Access 10, 59189–59204.
  109. Kohtd: Kazakh offline handwritten text dataset. Signal Processing: Image Communication 108, 116827.
  110. Optimization of global production scheduling with deep reinforcement learning. Procedia Cirp 72, 1264–1269.
  111. Improving performance of autoencoder-based network anomaly detection on nsl-kdd dataset. IEEE Access 9, 140136–140146.
  112. Alif: A dataset for arabic embedded text recognition in tv broadcast, in: 2015 13th International Conference on Document Analysis and Recognition (ICDAR), IEEE. pp. 1221–1225.
  113. A dataset for arabic text detection, tracking and recognition in news videos-activ, in: 2015 13th International Conference on Document Analysis and Recognition (ICDAR), IEEE. pp. 996–1000.
  114. Open datasets and tools for arabic text detection and recognition in news video frames. Journal of Imaging 4, 32.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Mahmoud SalahEldin Kasem (4 papers)
  2. Mohamed Mahmoud (24 papers)
  3. Hyun-Soo Kang (2 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.