Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

OPTIMIZING CNN HYPERPARAMETERS FOR ENHANCED HANDWRITTEN DIGIT RECOGNITION ON CUSTOM DATASET: A SYSTEMATIC STUDY

View through CrossRef
Handwritten Digit Recognition is still an essential issue in artificial intelligence and pattern recognition. Convolutional Neural Networks (CNNs) have shown outstanding accuracy on standardized datasets such as MNIST. Still, overfitting and incorrect hyperparameter tuning can cause CNNs to perform worse when applied to noisy real-world data. For example, handwritten digits taken from realistic Sudoku boards provide variation and noise that are not present in benchmark datasets, making it difficult to generalize CNN models to these datasets. The present study gives a systematic approach to improve handwritten digit recognition on custom datasets by improving CNN hyperparameters. The primary dataset derived from Kaggle’s “Sudoku Digit Classification” comprises 70,000 grayscale images of the digits 1 through 9, including zero representing empty cells. we use 10,000 images for this study, 7000 for training, 1500 for validation, and 1500 for testing. A personal handwritten digit dataset is a custom dataset that has only 2 images per class, and 9 classes represent the real world with small data. It is artificially extended using various kinds of data augmentation techniques, including rotation, scaling, flipping, shear transformation, brightness and contrast correction, and noise addition. Data augmentation increases each class to 100 images. These techniques enhance the model’s performance on unknown data and help it to become more generalizable. Training occurs using an Adam optimizer, a batch size of 32, and an initial learning rate of 0.001. To optimize the model’s performance, these hyperparameters are properly tuned. The Kaggle dataset is used to train, validate, and test the model on unseen data, and the custom unseen dataset is used for testing. The proposed model indicated great potential for accurate handwritten digit recognition with a training accuracy of approximately 95% and a validation accuracy of up to 99% after 30 epochs. Strong generalization over unseen handwritten digits is determined by testing accuracy of 97% on the Sudoku dataset. Testing accuracy on the personal dataset is 94.44%, and testing accuracy on the augmented personal dataset is 77.85%.
Title: OPTIMIZING CNN HYPERPARAMETERS FOR ENHANCED HANDWRITTEN DIGIT RECOGNITION ON CUSTOM DATASET: A SYSTEMATIC STUDY
Description:
Handwritten Digit Recognition is still an essential issue in artificial intelligence and pattern recognition.
Convolutional Neural Networks (CNNs) have shown outstanding accuracy on standardized datasets such as MNIST.
Still, overfitting and incorrect hyperparameter tuning can cause CNNs to perform worse when applied to noisy real-world data.
For example, handwritten digits taken from realistic Sudoku boards provide variation and noise that are not present in benchmark datasets, making it difficult to generalize CNN models to these datasets.
The present study gives a systematic approach to improve handwritten digit recognition on custom datasets by improving CNN hyperparameters.
The primary dataset derived from Kaggle’s “Sudoku Digit Classification” comprises 70,000 grayscale images of the digits 1 through 9, including zero representing empty cells.
we use 10,000 images for this study, 7000 for training, 1500 for validation, and 1500 for testing.
A personal handwritten digit dataset is a custom dataset that has only 2 images per class, and 9 classes represent the real world with small data.
It is artificially extended using various kinds of data augmentation techniques, including rotation, scaling, flipping, shear transformation, brightness and contrast correction, and noise addition.
Data augmentation increases each class to 100 images.
These techniques enhance the model’s performance on unknown data and help it to become more generalizable.
Training occurs using an Adam optimizer, a batch size of 32, and an initial learning rate of 0.
001.
To optimize the model’s performance, these hyperparameters are properly tuned.
The Kaggle dataset is used to train, validate, and test the model on unseen data, and the custom unseen dataset is used for testing.
The proposed model indicated great potential for accurate handwritten digit recognition with a training accuracy of approximately 95% and a validation accuracy of up to 99% after 30 epochs.
Strong generalization over unseen handwritten digits is determined by testing accuracy of 97% on the Sudoku dataset.
Testing accuracy on the personal dataset is 94.
44%, and testing accuracy on the augmented personal dataset is 77.
85%.

Related Results

Implementasi Convolutional Neural Network dalam Mengenali Image Angka Tulisan Tangan
Implementasi Convolutional Neural Network dalam Mengenali Image Angka Tulisan Tangan
Abstract. Advances in information technology and artificial intelligence, particularly in the field of machine learning, have had a significant impact on various aspects of daily l...
Do evidence summaries increase health policy‐makers' use of evidence from systematic reviews? A systematic review
Do evidence summaries increase health policy‐makers' use of evidence from systematic reviews? A systematic review
This review summarizes the evidence from six randomized controlled trials that judged the effectiveness of systematic review summaries on policymakers' decision making, or the most...
Filtering Approaches and Mish Activation Function Applied on Handwritten Chinese Character Recognition
Filtering Approaches and Mish Activation Function Applied on Handwritten Chinese Character Recognition
Handwritten Chinese Characters (HCC) have recently received much attention as a global means of exchanging information and knowledge. The start of the information age has increased...
ON-LINE HANDWRITTEN ARABIC CHARACTER RECOGNITION BASED ON GENETIC ALGORITHM
ON-LINE HANDWRITTEN ARABIC CHARACTER RECOGNITION BASED ON GENETIC ALGORITHM
On-line Arabic handwritten character recognition is one of the most challenging problems in pattern recognition field. By now, printed Arabic character recognition and on-line Arab...
Evaluating the Science to Inform the Physical Activity Guidelines for Americans Midcourse Report
Evaluating the Science to Inform the Physical Activity Guidelines for Americans Midcourse Report
Abstract The Physical Activity Guidelines for Americans (Guidelines) advises older adults to be as active as possible. Yet, despite the well documented benefits of physical a...
Implementing handwritten text recognition using deep learning with TensorFlow: An MNIST dataset approach
Implementing handwritten text recognition using deep learning with TensorFlow: An MNIST dataset approach
Handwritten text recognition (HTR) is a pivotal technology with extensive applications in document digitization, postal automation, and educational tools. This paper delves into th...
Task instructions modulate unit-decade binding in two-digit number representation
Task instructions modulate unit-decade binding in two-digit number representation
Previous studies have found decomposed processes, as well as holistic processes, in the representation of two-digit numbers. The present study investigated the influence of task in...
Identifying Links Between Latent Memory and Speech Recognition Factors
Identifying Links Between Latent Memory and Speech Recognition Factors
Objectives: The link between memory ability and speech recognition accuracy is often examined by correlating summary measures of performance across various tasks, but i...

Back to Top