Javascript must be enabled to continue!
OPTIMIZING CNN HYPERPARAMETERS FOR ENHANCED HANDWRITTEN DIGIT RECOGNITION ON CUSTOM DATASET: A SYSTEMATIC STUDY
View through CrossRef
Handwritten Digit Recognition is still an essential issue in artificial intelligence and pattern recognition. Convolutional Neural Networks (CNNs) have shown outstanding accuracy on standardized datasets such as MNIST. Still, overfitting and incorrect hyperparameter tuning can cause CNNs to perform worse when applied to noisy real-world data. For example, handwritten digits taken from realistic Sudoku boards provide variation and noise that are not present in benchmark datasets, making it difficult to generalize CNN models to these datasets. The present study gives a systematic approach to improve handwritten digit recognition on custom datasets by improving CNN hyperparameters. The primary dataset derived from Kaggle’s “Sudoku Digit Classification” comprises 70,000 grayscale images of the digits 1 through 9, including zero representing empty cells. we use 10,000 images for this study, 7000 for training, 1500 for validation, and 1500 for testing. A personal handwritten digit dataset is a custom dataset that has only 2 images per class, and 9 classes represent the real world with small data. It is artificially extended using various kinds of data augmentation techniques, including rotation, scaling, flipping, shear transformation, brightness and contrast correction, and noise addition. Data augmentation increases each class to 100 images. These techniques enhance the model’s performance on unknown data and help it to become more generalizable. Training occurs using an Adam optimizer, a batch size of 32, and an initial learning rate of 0.001. To optimize the model’s performance, these hyperparameters are properly tuned. The Kaggle dataset is used to train, validate, and test the model on unseen data, and the custom unseen dataset is used for testing. The proposed model indicated great potential for accurate handwritten digit recognition with a training accuracy of approximately 95% and a validation accuracy of up to 99% after 30 epochs. Strong generalization over unseen handwritten digits is determined by testing accuracy of 97% on the Sudoku dataset. Testing accuracy on the personal dataset is 94.44%, and testing accuracy on the augmented personal dataset is 77.85%.
Akademik Çalışmalar Derneği
Title: OPTIMIZING CNN HYPERPARAMETERS FOR ENHANCED HANDWRITTEN DIGIT RECOGNITION ON CUSTOM DATASET: A SYSTEMATIC STUDY
Description:
Handwritten Digit Recognition is still an essential issue in artificial intelligence and pattern recognition.
Convolutional Neural Networks (CNNs) have shown outstanding accuracy on standardized datasets such as MNIST.
Still, overfitting and incorrect hyperparameter tuning can cause CNNs to perform worse when applied to noisy real-world data.
For example, handwritten digits taken from realistic Sudoku boards provide variation and noise that are not present in benchmark datasets, making it difficult to generalize CNN models to these datasets.
The present study gives a systematic approach to improve handwritten digit recognition on custom datasets by improving CNN hyperparameters.
The primary dataset derived from Kaggle’s “Sudoku Digit Classification” comprises 70,000 grayscale images of the digits 1 through 9, including zero representing empty cells.
we use 10,000 images for this study, 7000 for training, 1500 for validation, and 1500 for testing.
A personal handwritten digit dataset is a custom dataset that has only 2 images per class, and 9 classes represent the real world with small data.
It is artificially extended using various kinds of data augmentation techniques, including rotation, scaling, flipping, shear transformation, brightness and contrast correction, and noise addition.
Data augmentation increases each class to 100 images.
These techniques enhance the model’s performance on unknown data and help it to become more generalizable.
Training occurs using an Adam optimizer, a batch size of 32, and an initial learning rate of 0.
001.
To optimize the model’s performance, these hyperparameters are properly tuned.
The Kaggle dataset is used to train, validate, and test the model on unseen data, and the custom unseen dataset is used for testing.
The proposed model indicated great potential for accurate handwritten digit recognition with a training accuracy of approximately 95% and a validation accuracy of up to 99% after 30 epochs.
Strong generalization over unseen handwritten digits is determined by testing accuracy of 97% on the Sudoku dataset.
Testing accuracy on the personal dataset is 94.
44%, and testing accuracy on the augmented personal dataset is 77.
85%.
Related Results
Digit Recognition Using Convolutional Neural Network
Digit Recognition Using Convolutional Neural Network
Digit detection using convolutional neural networks works as an exciting area of computer vision and machine learning. Developed to process and analyze visual data, they comprise s...
Sensitivity Analysis of the Hyperparameters of CNN for Precipitation Downscaling
Sensitivity Analysis of the Hyperparameters of CNN for Precipitation Downscaling
<p>Nowadays, a convolutional neural network (CNN), which is a kind of deep neural network has been shown to have high applicability to precipitation downscaling in pr...
A Fine-Tuned Hybrid Stacked CNN to Improve Bengali Handwritten Digit Recognition
A Fine-Tuned Hybrid Stacked CNN to Improve Bengali Handwritten Digit Recognition
Recognition of Bengali handwritten digits has several unique challenges, including the variation in writing styles, the different shapes and sizes of digits, the varying levels of ...
Farsi Digit Recognition Using GAN-Generated Data and Convolutional Neural Networks
Farsi Digit Recognition Using GAN-Generated Data and Convolutional Neural Networks
Handwritten digit recognition is one of the most active study areas in computer vision because to its numerous applications such as automatically identifying the digits in bank che...
Implementasi Convolutional Neural Network dalam Mengenali Image Angka Tulisan Tangan
Implementasi Convolutional Neural Network dalam Mengenali Image Angka Tulisan Tangan
Abstract. Advances in information technology and artificial intelligence, particularly in the field of machine learning, have had a significant impact on various aspects of daily l...
CNN-RNN BASED HANDWRITTEN TEXT RECOGNITION
CNN-RNN BASED HANDWRITTEN TEXT RECOGNITION
At present most of the scripts are handwritten due to the ease of using a pen tip in place of a keyboard, hence errors are common due to illegibility of the human handwriting. To a...
Custom order entry for Parkinson’s medications in the hospital improves timely administration: an analysis of over 31,000 medication doses
Custom order entry for Parkinson’s medications in the hospital improves timely administration: an analysis of over 31,000 medication doses
BackgroundPatients with Parkinson’s disease (PD) are at increased risk for hospital acquired complications. Deviations from home medication schedules and delays in administration a...
Investigating the effects of hyperparameter sensitivity on machine learning algorithms for PV forecasting
Investigating the effects of hyperparameter sensitivity on machine learning algorithms for PV forecasting
Machine Learning (ML) models have been introduced in the past, and users have debated whether to tune the hyperparameters of the models. This study investigates the effects of tuni...

