Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Exploring topological data analysis for information extraction: application to recognition of Arabic machine-printed numerals

View through CrossRef
AbstractThis manuscript explores the capability of topological data analysis (TDA) based on homology theory (HT: a subfield of algebraic topology) to extract relevant information for recognition of confusing Arabic machine-printed numerals. In fact, topological properties may significantly reduce the confusion between some numerals such as “1” and “4” in the context of small data sets. These two latter digits differ in the sense that digit 1 has no hole and digit 4 has one hole. Our contribution consists of evaluating the contribution of TDA with its invariant descriptors such as Betti numbers in machine-printed Arabic numerals recognition. Our investigation is driven by the following set of actions: (i) we extract Betti numbers invariant features of each numeral image and partition the ten numerals into three different clusters with respect to these features. (ii) We then perform a classification by assigning a test image to its corresponding cluster, and map this image to a numeral using dynamic-time warping as a metric defined in the Freemans’ chaincode space. We compared our proposed approach with major state-of-the-art methods depicting various ways of using TDA in character recognition. The advantages and limitations of TDA (including its pros and cons) are discussed further based on numeral recognition results.
Title: Exploring topological data analysis for information extraction: application to recognition of Arabic machine-printed numerals
Description:
AbstractThis manuscript explores the capability of topological data analysis (TDA) based on homology theory (HT: a subfield of algebraic topology) to extract relevant information for recognition of confusing Arabic machine-printed numerals.
In fact, topological properties may significantly reduce the confusion between some numerals such as “1” and “4” in the context of small data sets.
These two latter digits differ in the sense that digit 1 has no hole and digit 4 has one hole.
Our contribution consists of evaluating the contribution of TDA with its invariant descriptors such as Betti numbers in machine-printed Arabic numerals recognition.
Our investigation is driven by the following set of actions: (i) we extract Betti numbers invariant features of each numeral image and partition the ten numerals into three different clusters with respect to these features.
(ii) We then perform a classification by assigning a test image to its corresponding cluster, and map this image to a numeral using dynamic-time warping as a metric defined in the Freemans’ chaincode space.
We compared our proposed approach with major state-of-the-art methods depicting various ways of using TDA in character recognition.
The advantages and limitations of TDA (including its pros and cons) are discussed further based on numeral recognition results.

Related Results

Cak numerals
Cak numerals
Cak is a Luish language of the Tibeto-Burman language family and it is spoken mainly in the Naikhyongchari subdistrict of Bandarban district, Chittagong Hill Tracts (henceforth, CH...
Numerals in Koracha
Numerals in Koracha
Abstract The paper examines Koracha numerals to comprehend their internal structure and historical development. This study has revealed several intriguing features within ...
Arabic Language Teaching in Arabic Preparatory Schools
Arabic Language Teaching in Arabic Preparatory Schools
This study aims to highlight, describe and analyse the experiment conducted at the Arabic Preparatory School for Girls in Bandar Seri Begawan (SPABSB) and explore how it can be uti...
COMPARING NUMERALS IN OROCHON AND EVENKI
COMPARING NUMERALS IN OROCHON AND EVENKI
The current article focuses upon the results of the numerals analysis and the ways of their forming in the Orochon language. The obtained data are compared with the existing data f...
DENOMINATING APPROXIMATE NUMBERS IN CLASSICAL CHINESE (WENYAN)
DENOMINATING APPROXIMATE NUMBERS IN CLASSICAL CHINESE (WENYAN)
Актуальность настоящего диахронического исследования обусловлена, во-первых, наличием одинаковых моделей обозначения приблизительных, дробных и кратных чисел в вэньяне; во-вторых, ...
قصيد”اللغة العربية تنعى حظها بين أهلها“ لحافظ ابراهيم: دراسة تحليلية
قصيد”اللغة العربية تنعى حظها بين أهلها“ لحافظ ابراهيم: دراسة تحليلية
Many Languages are spoken in the world. The diversity of human languages and colors are sign of Allah, for those of knowledge (Al-Quran, 30:22). Although the Arabic language origin...
Reprogrammable plasmonic topological insulators with ultrafast control
Reprogrammable plasmonic topological insulators with ultrafast control
Abstract Topological photonics has revolutionized our understanding of light propagation, providing a remarkably robust way to manipulate light. Despite the intensive resea...

Back to Top