Javascript must be enabled to continue!
Peculiarities of the Arabic Language Processing: Morphological Modeling
View through CrossRef
The paper deals with the features of morphological modeling of the Arabic language based on the definition of the specifics of its formalization. Morphological modeling is one of the key stages of automatic text analysis and includes tools for building a word form to a stem, root, definition of a part of speech, automatic construction (generation) of a given word form, etc. The objectives of the study are interdisciplinary in nature and include both the theoretical aspects of studying the features of the Arabic language, which are most relevant for its automatic processing, and the study of existing morphological analyzers and determining the specifics of their work. The practical part is based on testing the CAMeL TOOLS, one of the advantages of which is its comprehensive nature, which allows both preprocessing of text and solving applied problems, including sentiment analysis. The criteria for selecting examples for testing took into account the features of the Arabic language, which are difficult for its formalization (segmentation of functional words with continuous spelling, morphological and lexical homonymy, etc.). The variability of the generalized concept of "the Arabic language" is taken into account, which combines classical Arabic, Modern Standard Arabic and modern Arabic dialects. Testing tools for morphological modeling allows us to draw conclusions about the need to improve the terminological apparatus, the variability of which is noted in the description of word forms. Such kind of variation (divergence from the concepts accepted in general linguistics) potentially leads to a distortion of the results of lexico-semantic analysis. During the analysis, some gaps were noted related to the definition of part-of-speech belonging, the description of word forms, etc. The results of the study are relevant both for linguistic research and for improving the development of software applications aimed at processing the Arabic text.
Saint Petersburg State University
Title: Peculiarities of the Arabic Language Processing: Morphological Modeling
Description:
The paper deals with the features of morphological modeling of the Arabic language based on the definition of the specifics of its formalization.
Morphological modeling is one of the key stages of automatic text analysis and includes tools for building a word form to a stem, root, definition of a part of speech, automatic construction (generation) of a given word form, etc.
The objectives of the study are interdisciplinary in nature and include both the theoretical aspects of studying the features of the Arabic language, which are most relevant for its automatic processing, and the study of existing morphological analyzers and determining the specifics of their work.
The practical part is based on testing the CAMeL TOOLS, one of the advantages of which is its comprehensive nature, which allows both preprocessing of text and solving applied problems, including sentiment analysis.
The criteria for selecting examples for testing took into account the features of the Arabic language, which are difficult for its formalization (segmentation of functional words with continuous spelling, morphological and lexical homonymy, etc.
).
The variability of the generalized concept of "the Arabic language" is taken into account, which combines classical Arabic, Modern Standard Arabic and modern Arabic dialects.
Testing tools for morphological modeling allows us to draw conclusions about the need to improve the terminological apparatus, the variability of which is noted in the description of word forms.
Such kind of variation (divergence from the concepts accepted in general linguistics) potentially leads to a distortion of the results of lexico-semantic analysis.
During the analysis, some gaps were noted related to the definition of part-of-speech belonging, the description of word forms, etc.
The results of the study are relevant both for linguistic research and for improving the development of software applications aimed at processing the Arabic text.
Related Results
Perfect in the Old Uighur Language
Perfect in the Old Uighur Language
The article discusses the semantic nature of the Turkic perfect, its semantic zone limitations and possible grammar tools of expressing Perfect in the Old Uighur language. Goals. T...
Porphyry, Universal Soul and the Arabic Plotinus
Porphyry, Universal Soul and the Arabic Plotinus
Scholars working in the field of Graeco-Arabic Neoplatonism often discuss the role Porphyry, the editor of Plotinus, must be credited with in the formation of the Arabic Plotinianc...
Lead and Tin in Arabic Alchemy
Lead and Tin in Arabic Alchemy
The present article is devoted to two issues. The first is the identification of lead and tin in medieval Arabic alchemy. The second is the investigation of whether Arabic alchemis...
Code-Mixing in the Conversation of Northern Khmer Speakers in Thailand: A Case Study of Teenagers and Middle-Aged Northern Khmer Speakers in Buriram Province
Code-Mixing in the Conversation of Northern Khmer Speakers in Thailand: A Case Study of Teenagers and Middle-Aged Northern Khmer Speakers in Buriram Province
This study aims to examine the linguistic performance of code-mixing by Northern Khmer (NK) teenagers in Buriram Province while conversing with NK middle-aged speakers in their com...
Information Processing in Dendritic Trees
Information Processing in Dendritic Trees
This review considers the input-output behavior of neurons with dendritic trees, with an emphasis on questions of information processing. The parts of this review are (1) a brief h...
Visual processing abilities associated with piano music sight-reading expertise
Visual processing abilities associated with piano music sight-reading expertise
Visual processing expertise in musicians has traditionally focused on the difference between expert and non-expert music sight-readers. More generally, differences between musician...
The movement to promote an ethnic language in American schools: The Korean community in the New York–New Jersey area
The movement to promote an ethnic language in American schools: The Korean community in the New York–New Jersey area
This paper examines a New York Korean immigrants’ movement to promote the Korean language in American schools. This movement includes the efforts of Korean community leaders to inc...
ACQUISITION: PROCESS, STRATEGY, PROBLEM IN FOREIGN LANGUAGE LEARNING
ACQUISITION: PROCESS, STRATEGY, PROBLEM IN FOREIGN LANGUAGE LEARNING
This paper explained the pattern of overcoming difficulties of foreign language learning, identifying based one previous study in learning English as a foreign language. Yet, it al...