Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

A Method for Arabic Handwritten Diacritics Characters

View through CrossRef
An Optical Character Recognition (OCR) is the process of converting an image representation of a document into an editable format. In addition, people have the ability to recognize characters without difficulty as reading papers or books. However, developing an OCR system that has the ability to read and recognized Arabic diacritics characters as human still, remain a problem. More, specifically, poor recognition rate in most of optical diacritics characters recognition is mainly attributed to failing in segmenting a handwritten text correctly. To overcome this problem, we perform develop a method based on seven operations; it starts with searching the text-line height followed by reading words from the line. Then identify the diacritics regions. The segmentation is also applied during this operation by converting the text-line into a grayscale and binary image. Moreover, we introduced a new model based on k-nearest neighbors (KNN) algorithm to identify diacritics and characters segmentation. KNN is trained to directly predict the diacritic from the text-line. Finally, we offer an evaluation discussion on optical diacritics characters recognition.
Title: A Method for Arabic Handwritten Diacritics Characters
Description:
An Optical Character Recognition (OCR) is the process of converting an image representation of a document into an editable format.
In addition, people have the ability to recognize characters without difficulty as reading papers or books.
However, developing an OCR system that has the ability to read and recognized Arabic diacritics characters as human still, remain a problem.
More, specifically, poor recognition rate in most of optical diacritics characters recognition is mainly attributed to failing in segmenting a handwritten text correctly.
To overcome this problem, we perform develop a method based on seven operations; it starts with searching the text-line height followed by reading words from the line.
Then identify the diacritics regions.
The segmentation is also applied during this operation by converting the text-line into a grayscale and binary image.
Moreover, we introduced a new model based on k-nearest neighbors (KNN) algorithm to identify diacritics and characters segmentation.
KNN is trained to directly predict the diacritic from the text-line.
Finally, we offer an evaluation discussion on optical diacritics characters recognition.

Related Results

Using Diacritics in the Arabic Script of Malay to Scaffold Arab Postgraduate Students in Reading Malay Words
Using Diacritics in the Arabic Script of Malay to Scaffold Arab Postgraduate Students in Reading Malay Words
Purpose – This study aims to investigate the use of diacritics in the Arabic script of Malay to facilitate Arab postgraduate students of UKM to read the Malay words accurately. It ...
Automatic Diacritics Restoration for Tunisian Dialect
Automatic Diacritics Restoration for Tunisian Dialect
Modern Standard Arabic, as well as Arabic dialect languages, are usually written without diacritics. The absence of these marks constitute a real problem in the automatic processin...
Pronunciation Errors in Arabic YouTube Videos Narrated by AI
Pronunciation Errors in Arabic YouTube Videos Narrated by AI
Arabic has three long vowels /a:/, /u:/, /i:/ and three short vowel /a/, /u/, /i/ which are represented by diacritics marked over and under consonant letters. In words that have sh...
Arabic Language Teaching in Arabic Preparatory Schools
Arabic Language Teaching in Arabic Preparatory Schools
This study aims to highlight, describe and analyse the experiment conducted at the Arabic Preparatory School for Girls in Bandar Seri Begawan (SPABSB) and explore how it can be uti...
ON-LINE HANDWRITTEN ARABIC CHARACTER RECOGNITION BASED ON GENETIC ALGORITHM
ON-LINE HANDWRITTEN ARABIC CHARACTER RECOGNITION BASED ON GENETIC ALGORITHM
On-line Arabic handwritten character recognition is one of the most challenging problems in pattern recognition field. By now, printed Arabic character recognition and on-line Arab...
Segmentation techniques for Arabic handwritten: a review
Segmentation techniques for Arabic handwritten: a review
Image segmentation refers to the process of partitioning a page into distinct sections. This technique aims to improve and transform the image's representation into a more coherent...
قصيد”اللغة العربية تنعى حظها بين أهلها“ لحافظ ابراهيم: دراسة تحليلية
قصيد”اللغة العربية تنعى حظها بين أهلها“ لحافظ ابراهيم: دراسة تحليلية
Many Languages are spoken in the world. The diversity of human languages and colors are sign of Allah, for those of knowledge (Al-Quran, 30:22). Although the Arabic language origin...

Back to Top