Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Articulatory‐to‐Acoustic Conversion of Mandarin Emotional Speech Based on PSO‐LSSVM

View through CrossRef
The production of emotional speech is determined by the movement of the speaker’s tongue, lips, and jaw. In order to combine articulatory data and acoustic data of speakers, articulatory‐to‐acoustic conversion of emotional speech has been studied. In this paper, parameters of LSSVM model have been optimized using the PSO method, and the optimized PSO‐LSSVM model was applied to the articulatory‐to‐acoustic conversion. The root mean square error (RMSE) and mean Mel‐cepstral distortion (MMCD) have been used to evaluate the results of conversion; the evaluated result illustrates that MMCD of MFCC is 1.508 dB, and RMSE of the second formant (F2) is 25.10 Hz. The results of this research can be further applied to the feature fusion of emotion speech recognition to improve the accuracy of emotion recognition.
Title: Articulatory‐to‐Acoustic Conversion of Mandarin Emotional Speech Based on PSO‐LSSVM
Description:
The production of emotional speech is determined by the movement of the speaker’s tongue, lips, and jaw.
In order to combine articulatory data and acoustic data of speakers, articulatory‐to‐acoustic conversion of emotional speech has been studied.
In this paper, parameters of LSSVM model have been optimized using the PSO method, and the optimized PSO‐LSSVM model was applied to the articulatory‐to‐acoustic conversion.
The root mean square error (RMSE) and mean Mel‐cepstral distortion (MMCD) have been used to evaluate the results of conversion; the evaluated result illustrates that MMCD of MFCC is 1.
508 dB, and RMSE of the second formant (F2) is 25.
10 Hz.
The results of this research can be further applied to the feature fusion of emotion speech recognition to improve the accuracy of emotion recognition.

Related Results

Abnormal Status Detection of Catenary Based on TSNE Dimensionality Reduction Method and IGWO-LSSVM Model
Abnormal Status Detection of Catenary Based on TSNE Dimensionality Reduction Method and IGWO-LSSVM Model
Background: Catenary is a crucial component of an electrified railroad's traction power supply system. There is a considerable incidence of abnormal status and failures due to prol...
KESETARAAN HANYU SHUIPING KAOSHI LEVEL I-IV DENGAN CEFR PADA KETERAMPILAN BERBICARA BAHASA MANDARIN
KESETARAAN HANYU SHUIPING KAOSHI LEVEL I-IV DENGAN CEFR PADA KETERAMPILAN BERBICARA BAHASA MANDARIN
AbstrakSejalan dengan derasnya perkembangan era disrupsi teknologi revolusi industri 4.0, pembelajaran bahasa asing dituntut untuk mempunyai standar tingkat penguasaan sebagai acua...
Experimental Study on the Acoustic Characteristics of “Similar” Vowels in Mandarin Learners
Experimental Study on the Acoustic Characteristics of “Similar” Vowels in Mandarin Learners
Abstract The rapid globalization of regions with different languages necessitates more advanced non-native tongue proficiency among individuals who need to communica...
Analysis of Factors Influencing the Mandarin Proficiency of Ethnic Minority College Students
Analysis of Factors Influencing the Mandarin Proficiency of Ethnic Minority College Students
This study focuses on determining the factors that influence Mandarin proficiency of ethnic minority college students of Hulunbuir University in border minority areas of China. We ...
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Speech, communication, and neuroimaging in Parkinson's disease : Characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : Characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Abstract 14986: A Randomized Trial of Statins to Reduce Vascular Endothelial Inflammation in Psoriasis
Abstract 14986: A Randomized Trial of Statins to Reduce Vascular Endothelial Inflammation in Psoriasis
Introduction: Psoriasis (PsO) is a chronic skin disease associated with increased CV risk. Systemic and vascular endothelial inflammation in PsO is highly prevalent and...

Back to Top