Javascript must be enabled to continue!
Exploring topological data analysis for information extraction: application to recognition of Arabic machine-printed numerals
View through CrossRef
AbstractThis manuscript explores the capability of topological data analysis (TDA) based on homology theory (HT: a subfield of algebraic topology) to extract relevant information for recognition of confusing Arabic machine-printed numerals. In fact, topological properties may significantly reduce the confusion between some numerals such as “1” and “4” in the context of small data sets. These two latter digits differ in the sense that digit 1 has no hole and digit 4 has one hole. Our contribution consists of evaluating the contribution of TDA with its invariant descriptors such as Betti numbers in machine-printed Arabic numerals recognition. Our investigation is driven by the following set of actions: (i) we extract Betti numbers invariant features of each numeral image and partition the ten numerals into three different clusters with respect to these features. (ii) We then perform a classification by assigning a test image to its corresponding cluster, and map this image to a numeral using dynamic-time warping as a metric defined in the Freemans’ chaincode space. We compared our proposed approach with major state-of-the-art methods depicting various ways of using TDA in character recognition. The advantages and limitations of TDA (including its pros and cons) are discussed further based on numeral recognition results.
Springer Science and Business Media LLC
Title: Exploring topological data analysis for information extraction: application to recognition of Arabic machine-printed numerals
Description:
AbstractThis manuscript explores the capability of topological data analysis (TDA) based on homology theory (HT: a subfield of algebraic topology) to extract relevant information for recognition of confusing Arabic machine-printed numerals.
In fact, topological properties may significantly reduce the confusion between some numerals such as “1” and “4” in the context of small data sets.
These two latter digits differ in the sense that digit 1 has no hole and digit 4 has one hole.
Our contribution consists of evaluating the contribution of TDA with its invariant descriptors such as Betti numbers in machine-printed Arabic numerals recognition.
Our investigation is driven by the following set of actions: (i) we extract Betti numbers invariant features of each numeral image and partition the ten numerals into three different clusters with respect to these features.
(ii) We then perform a classification by assigning a test image to its corresponding cluster, and map this image to a numeral using dynamic-time warping as a metric defined in the Freemans’ chaincode space.
We compared our proposed approach with major state-of-the-art methods depicting various ways of using TDA in character recognition.
The advantages and limitations of TDA (including its pros and cons) are discussed further based on numeral recognition results.
Related Results
Cak numerals
Cak numerals
Cak is a Luish language of the Tibeto-Burman language family and it is spoken mainly in the Naikhyongchari subdistrict of Bandarban district, Chittagong Hill Tracts (henceforth, CH...
DISCOVERING THE EFFECTIVENESS OF TEACHING METHODS IN TEACHING COMMUNICATIVE ARABIC AT SULTAN SHARIF ALI ISLAMIC UNIVERSITY: FACULTY OF ARABIC LANGUAGE AS CASE STUDY
DISCOVERING THE EFFECTIVENESS OF TEACHING METHODS IN TEACHING COMMUNICATIVE ARABIC AT SULTAN SHARIF ALI ISLAMIC UNIVERSITY: FACULTY OF ARABIC LANGUAGE AS CASE STUDY
This research aims to identify the effectiveness of the objectives of teaching communicative Arabic at the Faculty of Arabic Language at Sultan Sharif Ali Islamic University in the...
Numerals in Koracha
Numerals in Koracha
Abstract
The paper examines Koracha numerals to comprehend their internal structure and historical development. This study
has revealed several intriguing features within ...
Arabic Language Teaching in Arabic Preparatory Schools
Arabic Language Teaching in Arabic Preparatory Schools
This study aims to highlight, describe and analyse the experiment conducted at the Arabic Preparatory School for Girls in Bandar Seri Begawan (SPABSB) and explore how it can be uti...
COMPARING NUMERALS IN OROCHON AND EVENKI
COMPARING NUMERALS IN OROCHON AND EVENKI
The current article focuses upon the results of the numerals analysis and the ways of their forming in the Orochon language. The obtained data are compared with the existing data f...
DENOMINATING APPROXIMATE NUMBERS IN CLASSICAL CHINESE (WENYAN)
DENOMINATING APPROXIMATE NUMBERS IN CLASSICAL CHINESE (WENYAN)
Актуальность настоящего диахронического исследования обусловлена, во-первых, наличием одинаковых моделей обозначения приблизительных, дробных и кратных чисел в вэньяне; во-вторых, ...
قصيد”اللغة العربية تنعى حظها بين أهلها“ لحافظ ابراهيم: دراسة تحليلية
قصيد”اللغة العربية تنعى حظها بين أهلها“ لحافظ ابراهيم: دراسة تحليلية
Many Languages are spoken in the world. The diversity of human languages and colors are sign of Allah, for those of knowledge (Al-Quran, 30:22). Although the Arabic language origin...
Reprogrammable plasmonic topological insulators with ultrafast control
Reprogrammable plasmonic topological insulators with ultrafast control
Abstract
Topological photonics has revolutionized our understanding of light propagation, providing a remarkably robust way to manipulate light. Despite the intensive resea...

