Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

A Fine-Tuned Hybrid Stacked CNN to Improve Bengali Handwritten Digit Recognition

View through CrossRef
Recognition of Bengali handwritten digits has several unique challenges, including the variation in writing styles, the different shapes and sizes of digits, the varying levels of noise, and the distortion in the images. Despite significant improvements, there is still room for further improvement in the recognition rate. By building datasets and developing models, researchers can advance state-of-the-art support, which can have important implications for various domains. In this paper, we introduce a new dataset of 5440 handwritten Bengali digit images acquired from a Bangladeshi University that is now publicly available. Both conventional machine learning and CNN models were used to evaluate the task. To begin, we scrutinized the results of the ML model used after integrating three image feature descriptors, namely Binary Pattern (LBP), Complete Local Binary Pattern (CLBP), and Histogram of Oriented Gradients (HOG), using principal component analysis (PCA), which explained 95% of the variation in these descriptors. Then, via a fine-tuning approach, we designed three customized CNN models and their stack to recognize Bengali handwritten digits. On handcrafted image features, the XGBoost classifier achieved the best accuracy at 85.29%, an ROC AUC score of 98.67%, and precision, recall, and F1 scores ranging from 85.08% to 85.18%, indicating that there was still room for improvement. On our own data, the proposed customized CNN models and their stack model surpassed all other models, reaching a 99.66% training accuracy and a 97.57% testing accuracy. In addition, to robustify our proposed CNN model, we used another dataset of Bengali handwritten digits obtained from the Kaggle repository. Our stack CNN model provided remarkable performance. It obtained a training accuracy of 99.26% and an almost equally remarkable testing accuracy of 96.14%. Without any rigorous image preprocessing, fewer epochs, and less computation time, our proposed CNN model performed the best and proved the most resilient throughout all of the datasets, which solidified its position at the forefront of the field.
Title: A Fine-Tuned Hybrid Stacked CNN to Improve Bengali Handwritten Digit Recognition
Description:
Recognition of Bengali handwritten digits has several unique challenges, including the variation in writing styles, the different shapes and sizes of digits, the varying levels of noise, and the distortion in the images.
Despite significant improvements, there is still room for further improvement in the recognition rate.
By building datasets and developing models, researchers can advance state-of-the-art support, which can have important implications for various domains.
In this paper, we introduce a new dataset of 5440 handwritten Bengali digit images acquired from a Bangladeshi University that is now publicly available.
Both conventional machine learning and CNN models were used to evaluate the task.
To begin, we scrutinized the results of the ML model used after integrating three image feature descriptors, namely Binary Pattern (LBP), Complete Local Binary Pattern (CLBP), and Histogram of Oriented Gradients (HOG), using principal component analysis (PCA), which explained 95% of the variation in these descriptors.
Then, via a fine-tuning approach, we designed three customized CNN models and their stack to recognize Bengali handwritten digits.
On handcrafted image features, the XGBoost classifier achieved the best accuracy at 85.
29%, an ROC AUC score of 98.
67%, and precision, recall, and F1 scores ranging from 85.
08% to 85.
18%, indicating that there was still room for improvement.
On our own data, the proposed customized CNN models and their stack model surpassed all other models, reaching a 99.
66% training accuracy and a 97.
57% testing accuracy.
In addition, to robustify our proposed CNN model, we used another dataset of Bengali handwritten digits obtained from the Kaggle repository.
Our stack CNN model provided remarkable performance.
It obtained a training accuracy of 99.
26% and an almost equally remarkable testing accuracy of 96.
14%.
Without any rigorous image preprocessing, fewer epochs, and less computation time, our proposed CNN model performed the best and proved the most resilient throughout all of the datasets, which solidified its position at the forefront of the field.

Related Results

OPTIMIZING CNN HYPERPARAMETERS FOR ENHANCED HANDWRITTEN DIGIT RECOGNITION ON CUSTOM DATASET: A SYSTEMATIC STUDY
OPTIMIZING CNN HYPERPARAMETERS FOR ENHANCED HANDWRITTEN DIGIT RECOGNITION ON CUSTOM DATASET: A SYSTEMATIC STUDY
Handwritten Digit Recognition is still an essential issue in artificial intelligence and pattern recognition. Convolutional Neural Networks (CNNs) have shown outstanding accuracy o...
Neural Machine Translation from Bengali Language to English language and vice-versa
Neural Machine Translation from Bengali Language to English language and vice-versa
Bengali ranks among the first ten spoken languages in the world with a native speaker numbering about 230 million people.  With UNESCO declaring 21st February as International Moth...
Digit Recognition Using Convolutional Neural Network
Digit Recognition Using Convolutional Neural Network
Digit detection using convolutional neural networks works as an exciting area of computer vision and machine learning. Developed to process and analyze visual data, they comprise s...
Farsi Digit Recognition Using GAN-Generated Data and Convolutional Neural Networks
Farsi Digit Recognition Using GAN-Generated Data and Convolutional Neural Networks
Handwritten digit recognition is one of the most active study areas in computer vision because to its numerous applications such as automatically identifying the digits in bank che...
A Web-based Intelligent Handwriting Education System for Autonomous Learning of Bengali Characters
A Web-based Intelligent Handwriting Education System for Autonomous Learning of Bengali Characters
In this paper, we describe a prototype of web-based intelligent handwriting education system for autonomous learning of Bengali characters. Bengali language is used by more than 21...
Implementasi Convolutional Neural Network dalam Mengenali Image Angka Tulisan Tangan
Implementasi Convolutional Neural Network dalam Mengenali Image Angka Tulisan Tangan
Abstract. Advances in information technology and artificial intelligence, particularly in the field of machine learning, have had a significant impact on various aspects of daily l...
CNN-RNN BASED HANDWRITTEN TEXT RECOGNITION
CNN-RNN BASED HANDWRITTEN TEXT RECOGNITION
At present most of the scripts are handwritten due to the ease of using a pen tip in place of a keyboard, hence errors are common due to illegibility of the human handwriting. To a...
Reform and Change in Early 20th Century Bengali Society: A Study of Chattopadhyay's Novel Nishkriti
Reform and Change in Early 20th Century Bengali Society: A Study of Chattopadhyay's Novel Nishkriti
The goal of this research is to examine the societal reforms and modifications that took place in early 20th-century Bengal as a result of the flourishing Bengali Renaissance, as p...

Back to Top