Javascript must be enabled to continue!
Improving the Accuracy of Text Classification using Stemming Method, A Case of Informal Indonesian Conversation
View through CrossRef
Abstract
As social beings, humans always interact with one another using either verbal or non-verbal language. Language is an arbitrary sound-symbol system, which is used by members of a community to cooperate, interact, and identify themselves. Indonesian language is classified into two categories, namely formal and non-formal. The former meets the grammatical standard as prescribed by linguistic rules of the language, while the latter tends to deviate it. In daily communication, however, non-formal language is more intensively used because they are more practical and easier to understand. With this tendency, non-formal language causes problems in linguistic computation because most linguistic computations use formal standard languages that already have standardized rules. This research aims to develop a dynamic Indonesian closed corpus related to airline ticket reservation, namely "Incorbiz". The "Incorbiz" will be used as stemming tool for formal and non-formal Indonesian. Text processing, text normalization, and auto-update data were proposed in this research. This research also compared two stemming techniques i.e. "Sastrawi" and "Incorbiz" to process the 30-sample dataset. The algorithm used to process the classification is Support Vector Machine (SVM). The data used to develop the "Incorbiz" were taken from conversations between customer service staff and consumers in airline ticket reservations. The result showed that "Incorbiz" had higher accuracy than "Sastrawi" on 0.89 and 0.67, respectively.
Springer Science and Business Media LLC
Title: Improving the Accuracy of Text Classification using Stemming Method, A Case of Informal Indonesian Conversation
Description:
Abstract
As social beings, humans always interact with one another using either verbal or non-verbal language.
Language is an arbitrary sound-symbol system, which is used by members of a community to cooperate, interact, and identify themselves.
Indonesian language is classified into two categories, namely formal and non-formal.
The former meets the grammatical standard as prescribed by linguistic rules of the language, while the latter tends to deviate it.
In daily communication, however, non-formal language is more intensively used because they are more practical and easier to understand.
With this tendency, non-formal language causes problems in linguistic computation because most linguistic computations use formal standard languages that already have standardized rules.
This research aims to develop a dynamic Indonesian closed corpus related to airline ticket reservation, namely "Incorbiz".
The "Incorbiz" will be used as stemming tool for formal and non-formal Indonesian.
Text processing, text normalization, and auto-update data were proposed in this research.
This research also compared two stemming techniques i.
e.
"Sastrawi" and "Incorbiz" to process the 30-sample dataset.
The algorithm used to process the classification is Support Vector Machine (SVM).
The data used to develop the "Incorbiz" were taken from conversations between customer service staff and consumers in airline ticket reservations.
The result showed that "Incorbiz" had higher accuracy than "Sastrawi" on 0.
89 and 0.
67, respectively.
Related Results
Funkcije komunikacijski relevantne šutnje u njemačkome
Funkcije komunikacijski relevantne šutnje u njemačkome
Additionally, this chapter presents research of silence with review of main aspects of papers in the field of conversational analysis, ethnography of communication and metaphor of ...
Improving the accuracy of text classification using stemming method, a case of non-formal Indonesian conversation
Improving the accuracy of text classification using stemming method, a case of non-formal Indonesian conversation
Abstract
Background
Stemming has long been used in data pre-processing to retrieve information by tracking affixed words ...
Improving the Accuracy of Text Classification using Stemming Method, A Case of Non-Formal Indonesian Conversation
Improving the Accuracy of Text Classification using Stemming Method, A Case of Non-Formal Indonesian Conversation
Abstract
Background: Stemming has long been used in data pre-processing to retrieve information by tracking affixed words back into their root. In an Indonesian setting, ex...
Improving the Accuracy of Text Classification using Stemming Method, A Case of Non-formal Indonesian Conversation
Improving the Accuracy of Text Classification using Stemming Method, A Case of Non-formal Indonesian Conversation
Abstract
Stemming has long been used in data pre-processing in information retrieval, which aims to make affix words into root words. However, there are not many stemming m...
Sleep Habits and Occurrence of Lowback Pain among Craftsmen
Sleep Habits and Occurrence of Lowback Pain among Craftsmen
<span style="color: #000000; font-family: Verdana, Arial, Helvetica, sans-serif; font-size: 10px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; ...
Sleep Habits and Occurrence of Lowback Pain among Craftsmen
Sleep Habits and Occurrence of Lowback Pain among Craftsmen
<span style="color: #000000; font-family: Verdana, Arial, Helvetica, sans-serif; font-size: 10px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; ...
Hydatid Disease of The Brain Parenchyma: A Systematic Review
Hydatid Disease of The Brain Parenchyma: A Systematic Review
Abstarct
Introduction
Isolated brain hydatid disease (BHD) is an extremely rare form of echinococcosis. A prompt and timely diagnosis is a crucial step in disease management. This ...
Computer-Mediated Chat
Computer-Mediated Chat
The technical apparatus is, then, being made at home with the rest of our world. And that's a thing that's routinely being done, and it's the source of the failure of technocratic ...

