Javascript must be enabled to continue!
Experimental evaluation of Arabic OCR systems
View through CrossRef
PurposeThe aim of this paper is to experimentally evaluate the effectiveness of the state-of-the-art printed Arabic text recognition systems to determine open areas for future improvements. In addition, this paper proposes a standard protocol with a set of metrics for measuring the effectiveness of Arabic optical character recognition (OCR) systems to assist researchers in comparing different Arabic OCR approaches.Design/methodology/approachThis paper describes an experiment to automatically evaluate four well-known Arabic OCR systems using a set of performance metrics. The evaluation experiment is conducted on a publicly available printed Arabic dataset comprising 240 text images with a variety of resolution levels, font types, font styles and font sizes.FindingsThe experimental results show that the field of character recognition for printed Arabic still requires further research to reach an efficient text recognition method for Arabic script.Originality/valueTo the best of the authors’ knowledge, this is the first work that provides a comprehensive automated evaluation of Arabic OCR systems with respect to the characteristics of Arabic script and, in addition, proposes an evaluation methodology that can be used as a benchmark by researchers and therefore will contribute significantly to the enhancement of the field of Arabic script recognition.
Title: Experimental evaluation of Arabic OCR systems
Description:
PurposeThe aim of this paper is to experimentally evaluate the effectiveness of the state-of-the-art printed Arabic text recognition systems to determine open areas for future improvements.
In addition, this paper proposes a standard protocol with a set of metrics for measuring the effectiveness of Arabic optical character recognition (OCR) systems to assist researchers in comparing different Arabic OCR approaches.
Design/methodology/approachThis paper describes an experiment to automatically evaluate four well-known Arabic OCR systems using a set of performance metrics.
The evaluation experiment is conducted on a publicly available printed Arabic dataset comprising 240 text images with a variety of resolution levels, font types, font styles and font sizes.
FindingsThe experimental results show that the field of character recognition for printed Arabic still requires further research to reach an efficient text recognition method for Arabic script.
Originality/valueTo the best of the authors’ knowledge, this is the first work that provides a comprehensive automated evaluation of Arabic OCR systems with respect to the characteristics of Arabic script and, in addition, proposes an evaluation methodology that can be used as a benchmark by researchers and therefore will contribute significantly to the enhancement of the field of Arabic script recognition.
Related Results
الإعلام العربي ومساهمته في ترويج اللغة العربية بالمجتمع الماليزي دراسة وصفية تحليلية
الإعلام العربي ومساهمته في ترويج اللغة العربية بالمجتمع الماليزي دراسة وصفية تحليلية
This study, entitled “Contribution of Arabic Media in Disseminating Arabic Language in Malaysian Society” aims at discovering the efforts of the Arabic-Malaysian media and its role...
قصيد”اللغة العربية تنعى حظها بين أهلها“ لحافظ ابراهيم: دراسة تحليلية
قصيد”اللغة العربية تنعى حظها بين أهلها“ لحافظ ابراهيم: دراسة تحليلية
Many Languages are spoken in the world. The diversity of human languages and colors are sign of Allah, for those of knowledge (Al-Quran, 30:22). Although the Arabic language origin...
Arabic Natural Language Processing
Arabic Natural Language Processing
The Arabic language presents researchers and developers of natural language processing (NLP) applications for Arabic text and speech with serious challenges. The purpose of this ar...
Arabic Learning for Academic Purposes
Arabic Learning for Academic Purposes
This study aimed to determine the goal of teaching Arabic for Academic purposes. Teaching Arabic for non-Arabic speakers is generally divided into two types: Arabic language for li...
Using Diacritics in the Arabic Script of Malay to Scaffold Arab Postgraduate Students in Reading Malay Words
Using Diacritics in the Arabic Script of Malay to Scaffold Arab Postgraduate Students in Reading Malay Words
Purpose – This study aims to investigate the use of diacritics in the Arabic script of Malay to facilitate Arab postgraduate students of UKM to read the Malay words accurately. It ...
Effective Arabic Language Teaching Strategies in the Language Laboratory for Students of Darussalam Gontor Islamic Institution
Effective Arabic Language Teaching Strategies in the Language Laboratory for Students of Darussalam Gontor Islamic Institution
Language is an important tool for the life of civilized man. Through language, people can communicate with each other, and convey their intentions and feelings to others. The moder...
Implementasi Optical Character Recognition (OCR) untuk Otomasi Penghitungan Tagihan Listrik
Implementasi Optical Character Recognition (OCR) untuk Otomasi Penghitungan Tagihan Listrik
With the advancement of technology, automation in various aspects of life has become a necessity, including in the electricity billing system. The use of Optical Character Recognit...
It's All Connected: A Survey for Multimodal Arabic AI
It's All Connected: A Survey for Multimodal Arabic AI
Abstract
Multimodal AI integrates text, vision, and speech within unified reasoning frameworks, yet Arabic remains significantly underrepresented due to diglossia, ...

