Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

AraSpell: A Deep Learning Approach for Arabic Spelling Correction

View through CrossRef
Abstract Spelling correction is the task of identifying spelling mistakes, typos, and grammatical mistakes in a given text and correcting them according to their context and grammatical structure. This work introduces "AraSpell," a framework for Arabic spelling correction using different seq2seq model architectures such as Recurrent Neural Network (RNN) and Transformer with artificial data generation for error injection, trained on more than 6.9 Million Arabic sentences. Thorough experimental studies provide empirical evidence of the effectiveness of the proposed approach, which achieved 4.8% and 1.11% word error rate (WER) and character error rate (CER), respectively, in comparison with labeled data of 29.72% WER and 5.03% CER. Our approach achieved 2.9% CER and 10.65% WER in comparison with labeled data of 10.02% CER and 50.94% WER. Both of these results are obtained on a test set of 100K sentences.
Springer Science and Business Media LLC
Title: AraSpell: A Deep Learning Approach for Arabic Spelling Correction
Description:
Abstract Spelling correction is the task of identifying spelling mistakes, typos, and grammatical mistakes in a given text and correcting them according to their context and grammatical structure.
This work introduces "AraSpell," a framework for Arabic spelling correction using different seq2seq model architectures such as Recurrent Neural Network (RNN) and Transformer with artificial data generation for error injection, trained on more than 6.
9 Million Arabic sentences.
Thorough experimental studies provide empirical evidence of the effectiveness of the proposed approach, which achieved 4.
8% and 1.
11% word error rate (WER) and character error rate (CER), respectively, in comparison with labeled data of 29.
72% WER and 5.
03% CER.
Our approach achieved 2.
9% CER and 10.
65% WER in comparison with labeled data of 10.
02% CER and 50.
94% WER.
Both of these results are obtained on a test set of 100K sentences.

Related Results

The contributions of reading and phonological awareness for spelling in grade three isiXhosa learners
The contributions of reading and phonological awareness for spelling in grade three isiXhosa learners
One factor, which is consistently highlighted in research on literacy, is the lack of understanding of how literacy develops in the Southern-Bantu languages. In particular, little ...
QALB: Qatar Arabic language bank
QALB: Qatar Arabic language bank
Automatic text correction has been attracting research attention for English and some other western languages. Applications for automatic text correction vary from improving langua...
Pan, Rickard, and Bjork (2021) Does spelling still matter—And if so, how should it be taught?
Pan, Rickard, and Bjork (2021) Does spelling still matter—And if so, how should it be taught?
A century ago, spelling skills were highly valued and widely taught in schools using traditional methods, such as weekly lists, drill exercises, and low- and high-stakes spelling t...
TEACHING SPELLING THROUGH GAMES
TEACHING SPELLING THROUGH GAMES
Games have been believed to be good media in assisting teaching for years. Games are believed can promote learning become more interesting. Many studies have been conducted on util...
Using an app-based screening tool to predict deficits in written word spelling at school entry
Using an app-based screening tool to predict deficits in written word spelling at school entry
IntroductionThe first year of schooling is crucial for the further development of spelling abilities in children, which makes early assessment and intervention essential. The aim o...
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
قصيد”اللغة العربية تنعى حظها بين أهلها“ لحافظ ابراهيم: دراسة تحليلية
قصيد”اللغة العربية تنعى حظها بين أهلها“ لحافظ ابراهيم: دراسة تحليلية
Many Languages are spoken in the world. The diversity of human languages and colors are sign of Allah, for those of knowledge (Al-Quran, 30:22). Although the Arabic language origin...

Back to Top