Javascript must be enabled to continue!
Tonal spaces of vector language models
View through CrossRef
Objectives. Tonality as a positive-negative mood is a key parameter of text analysis and generation algorithms. Existing machine learning methods encode tonality in computationally extensive and uninterpretable ways which hampers the development of the corresponding applications. The work aims to solve this problem for Russian language.
Methods. Pre-trained vector language models are used to encode words as vectors in multidimensional spaces. In these spaces, the tonality corresponds to a specific direction which optimally discriminates the positive and negative prototypes. The tonality of word is then determined by its projection onto this direction. Adding a tonal vector to a key word defines a one-dimensional subspace, containing its positive and negative associations.
Results. The algorithm is tested on GloVe and FastText machine language models, encoding individual Russian words and morphemes with vectors in 300-dimensional space. Commonly used verbs and nouns served as key words. The average reliability of the found tonal associations estimates as 80 %.
Conclusion. The results indicate the applicability of pre-trained vector language models for fast and interpretable working with tonal information. The developed approach is applicable for the tasks of aspect-based sentiment analysis, as well as for the machine generation of object-oriented texts with a required tonality. Generalization of the tonal axis to the triple of Osgood's semantic factors allows expanding the method to a full range of affectively-semantic information.
United Institute of Informatics Problems of the National Academy of Sciences of Belarus
Title: Tonal spaces of vector language models
Description:
Objectives.
Tonality as a positive-negative mood is a key parameter of text analysis and generation algorithms.
Existing machine learning methods encode tonality in computationally extensive and uninterpretable ways which hampers the development of the corresponding applications.
The work aims to solve this problem for Russian language.
Methods.
Pre-trained vector language models are used to encode words as vectors in multidimensional spaces.
In these spaces, the tonality corresponds to a specific direction which optimally discriminates the positive and negative prototypes.
The tonality of word is then determined by its projection onto this direction.
Adding a tonal vector to a key word defines a one-dimensional subspace, containing its positive and negative associations.
Results.
The algorithm is tested on GloVe and FastText machine language models, encoding individual Russian words and morphemes with vectors in 300-dimensional space.
Commonly used verbs and nouns served as key words.
The average reliability of the found tonal associations estimates as 80 %.
Conclusion.
The results indicate the applicability of pre-trained vector language models for fast and interpretable working with tonal information.
The developed approach is applicable for the tasks of aspect-based sentiment analysis, as well as for the machine generation of object-oriented texts with a required tonality.
Generalization of the tonal axis to the triple of Osgood's semantic factors allows expanding the method to a full range of affectively-semantic information.
Related Results
Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
<p><em><span style="font-size: 11.0pt; font-family: 'Times New Roman',serif; mso-fareast-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-langua...
A Touch of Space Weather - Outreach project for visually impaired students
A Touch of Space Weather - Outreach project for visually impaired students
<p><em><span data-preserver-spaces="true">'A Touch of Space Weather' is a project that brings space weather science into...
Algebraic models of tonal function
Algebraic models of tonal function
This thesis presents an algebraic model of tonal function. This model is based on several pillars. The first one is a mathematical formalization of the musical universe, from pure ...
Učinak poučavanja razrednomu jeziku u izobrazbi nastavnika njemačkoga
Učinak poučavanja razrednomu jeziku u izobrazbi nastavnika njemačkoga
The actual use of classroom language is principally limited to the classroom environment. As far as foreign language learning is concerned, the classroom often turns out to be the ...
Increased life expectancy of heart failure patients in a rural center by a multidisciplinary program
Increased life expectancy of heart failure patients in a rural center by a multidisciplinary program
Abstract
Funding Acknowledgements
Type of funding sources: None.
INTRODUCTION Patients with heart failure (HF)...
Toward a comprehensive framework for tonal analysis: Yangru tone in Southern Min
Toward a comprehensive framework for tonal analysis: Yangru tone in Southern Min
Abstract
This study establishes a comprehensive framework for analyzing tone as an important but complex mechanism in East and Southeast Asian tonal languages. Th...
An investigation of acoustic cues to tonal registers and voicing in Donglei Kam
An investigation of acoustic cues to tonal registers and voicing in Donglei Kam
The Kam language has experienced historical tonal splits, resulting in the development of a complex tonal system. However, there is still limited knowledge regarding the acoustic c...
Tonal priming is resistant to changes in pitch height
Tonal priming is resistant to changes in pitch height
Research on tonal priming has consistently shown that tonally expected events are processed more efficiently and has confirmed that the locus of the effect is cognitive rather than...

