Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Balochi Text Segmentation for Establishing Balochi OCR

View through CrossRef
OCR is considered the fastest way of data entry; the smart conversion of the text data is called handwritten text recognition. Many of the languages possess OCRs and there are still some languages lacking the OCR. Balochi is one of the national languages of the Pakistan country and the most of speakers live in Baluchistan province of Pakistan. Balochi computing is at its infancy and require attention to its many of the approaches to accumulate the level of other languages especially pertaining to the matter of computation. This paper investigates the relation between other Arabic adopting languages and proposes a segmentation algorithm to segment Balochi text paragraphs into lines, lines into words and words into characters. The algorithm has been adopted and fine tuned to produce the accuracy of 95%. The segmentation algorithm will play a role in developing a complex OCR and handwritten recognition of Balochi language.
Title: Balochi Text Segmentation for Establishing Balochi OCR
Description:
OCR is considered the fastest way of data entry; the smart conversion of the text data is called handwritten text recognition.
Many of the languages possess OCRs and there are still some languages lacking the OCR.
Balochi is one of the national languages of the Pakistan country and the most of speakers live in Baluchistan province of Pakistan.
Balochi computing is at its infancy and require attention to its many of the approaches to accumulate the level of other languages especially pertaining to the matter of computation.
This paper investigates the relation between other Arabic adopting languages and proposes a segmentation algorithm to segment Balochi text paragraphs into lines, lines into words and words into characters.
The algorithm has been adopted and fine tuned to produce the accuracy of 95%.
The segmentation algorithm will play a role in developing a complex OCR and handwritten recognition of Balochi language.

Related Results

A Grammar of Modern Standard Balochi
A Grammar of Modern Standard Balochi
Balochi is an Iranian language spoken in Pakistan, Iran, Afghanistan, the Gulf States (particularly Oman and the United Arab Emirates), Turkmenistan, India, and East Africa. Inform...
Sleep Habits and Occurrence of Lowback Pain among Craftsmen
Sleep Habits and Occurrence of Lowback Pain among Craftsmen
<span style="color: #000000; font-family: Verdana, Arial, Helvetica, sans-serif; font-size: 10px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; ...
Sleep Habits and Occurrence of Lowback Pain among Craftsmen
Sleep Habits and Occurrence of Lowback Pain among Craftsmen
<span style="color: #000000; font-family: Verdana, Arial, Helvetica, sans-serif; font-size: 10px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; ...
<b>Linguistic Links between Balochi and Urdu Language</b>
<b>Linguistic Links between Balochi and Urdu Language</b>
The Balochi language holds a significant position among the languages spoken in Pakistan. The accent of the Balochi language is categorized into two variations, Estern Balochi acce...
Bounds on the sum of broadcast domination number and strong metric dimension of graphs
Bounds on the sum of broadcast domination number and strong metric dimension of graphs
Let [Formula: see text] be a connected graph of order at least two with vertex set [Formula: see text]. For [Formula: see text], let [Formula: see text] denote the length of an [Fo...
بلوچی میں سعادت حسن منٹو کے افسانوں کے تراجم کا تاریخی و تنقیدی جائزہ
بلوچی میں سعادت حسن منٹو کے افسانوں کے تراجم کا تاریخی و تنقیدی جائزہ
The translation of short stories from various languages into Balochi has played a pivotal role in shaping modern Balochi literature, with Urdu being a particularly significant sour...
Optical character recognition based document image quality assessment
Optical character recognition based document image quality assessment
Optical Character Recognition (OCR) systems play a crucial role in digitizing documents. However, their performance significantly deteriorates when handling low-quality images. Eve...

Back to Top