Javascript must be enabled to continue!
Deteriorated image classification model for malayalam palm leaf manuscripts
View through CrossRef
The method for document image classification presented in this paper mainly focuses on six different Malayalam palm leaf manuscripts categories. The proposed approach consists of three phases: dataset analysis, building a bag of words repository followed by recognition and classification using a voting approach. The palm leaf manuscripts are initially subject to pre-processing and subjective analysis techniques to create a bag of words repository during the dataset analysis phase. Next, the textual components from the manuscripts are extracted for recognition using Tesseract 4 OCR with default and self-adapted training sets and a deep-learning algorithm. The Bag of Words approach is used in the third phase to categorize the palm leaf manuscripts based on textual components recognized by OCR using a voting process. Experimental analysis was done to analyze the proposed approach with and without the voting techniques, varying the size of the Bag of Words with default/self-adapted training datasets using Tesseract OCR and a deep learning model. Experimental analysis proves that the proposed approach works equally well with/ without voting with a bag of words technique using Tesseract OCR. It is noticed that, for document classification, an overall accuracy of 83% without voting and 84.5% with voting is achieved with an F-score of 0.90 in both cases using Teserract OCR. Overall, the proposed approach proves to be high generalizable based on trial wise experiments with Bag of Words, offering a reliable way for classifying deteriorated Malayalam handwritten palm manuscripts.
SAGE Publications
Title: Deteriorated image classification model for malayalam palm leaf manuscripts
Description:
The method for document image classification presented in this paper mainly focuses on six different Malayalam palm leaf manuscripts categories.
The proposed approach consists of three phases: dataset analysis, building a bag of words repository followed by recognition and classification using a voting approach.
The palm leaf manuscripts are initially subject to pre-processing and subjective analysis techniques to create a bag of words repository during the dataset analysis phase.
Next, the textual components from the manuscripts are extracted for recognition using Tesseract 4 OCR with default and self-adapted training sets and a deep-learning algorithm.
The Bag of Words approach is used in the third phase to categorize the palm leaf manuscripts based on textual components recognized by OCR using a voting process.
Experimental analysis was done to analyze the proposed approach with and without the voting techniques, varying the size of the Bag of Words with default/self-adapted training datasets using Tesseract OCR and a deep learning model.
Experimental analysis proves that the proposed approach works equally well with/ without voting with a bag of words technique using Tesseract OCR.
It is noticed that, for document classification, an overall accuracy of 83% without voting and 84.
5% with voting is achieved with an F-score of 0.
90 in both cases using Teserract OCR.
Overall, the proposed approach proves to be high generalizable based on trial wise experiments with Bag of Words, offering a reliable way for classifying deteriorated Malayalam handwritten palm manuscripts.
Related Results
Implementasi Learning Vector Quantization (LVQ) Dalam Mengidentifikasi Gula Aren Asli dengan Gula Aren Campuran
Implementasi Learning Vector Quantization (LVQ) Dalam Mengidentifikasi Gula Aren Asli dengan Gula Aren Campuran
Palm sugar is one type of sugar that is often used by the community as a sweet taste for cooking, making food and drinks. Palm sugar is made from palm sap or juice from coconut tre...
Implementasi Learning Vector Quantization (LVQ) Dalam Mengidentifikasi Gula Aren Asli dengan Gula Aren Campuran
Implementasi Learning Vector Quantization (LVQ) Dalam Mengidentifikasi Gula Aren Asli dengan Gula Aren Campuran
Palm sugar is one type of sugar that is often used by the community as a sweet taste for cooking, making food and drinks. Palm sugar is made from palm sap or juice from coconut tre...
Potentialities of the Oil Palm Industry in Cameroon
Potentialities of the Oil Palm Industry in Cameroon
Cameroon belongs to the Central African region where the oil palm is an endemic species. Since 1913 the country was exporting palm oil and palm kernels through the exploitation of ...
Leaf phenology as an optimal strategy for carbon gain in plants
Leaf phenology as an optimal strategy for carbon gain in plants
Since leaves are essentially energy-gaining organs, the arrangement of leaves in time (leaf phenology) and in space (canopy architecture) in both seasonal and nonseasonal environme...
THE IMPACT OF INDONESIA’S PALM OIL INDUSTRY ON ECONOMIC AND ENVIRONMENTAL PERFORMANCE
THE IMPACT OF INDONESIA’S PALM OIL INDUSTRY ON ECONOMIC AND ENVIRONMENTAL PERFORMANCE
Indonesia is number one palm oil producer worldwide. Oil palm development is important for Indonesia’s economy. However, it has some issues regarding economic and environmental per...
Comparison of Palm Prints in Relation to Gender and Age Among Native Lucknow Population: A Correlation Study
Comparison of Palm Prints in Relation to Gender and Age Among Native Lucknow Population: A Correlation Study
Background: Palm is the inner surface of hand between wrist and fingers, Palm print is composed of principal lines, wrinkles and ridges, it is unique to each individual and does no...
Lions and Bears: The World Wars in Malayalam Cartoons
Lions and Bears: The World Wars in Malayalam Cartoons
Titled Mahakshamadevatha or
“The Great Famine Goddess” (Vidooshakan,
October 1919), the first cartoon published in Malayalam depicted
the impact of the Firs...
PEMBUATAN SISTEM INFORMASI PERON SAWIT BERBABSIS MOBILE
PEMBUATAN SISTEM INFORMASI PERON SAWIT BERBABSIS MOBILE
The process of selling palm fruit is still manual, only relying on the interaction media between the palm oil platform owner and oil palm farmers directly, while the farmers who wi...

