Javascript must be enabled to continue!
Articulatory‐to‐Acoustic Conversion of Mandarin Emotional Speech Based on PSO‐LSSVM
View through CrossRef
The production of emotional speech is determined by the movement of the speaker’s tongue, lips, and jaw. In order to combine articulatory data and acoustic data of speakers, articulatory‐to‐acoustic conversion of emotional speech has been studied. In this paper, parameters of LSSVM model have been optimized using the PSO method, and the optimized PSO‐LSSVM model was applied to the articulatory‐to‐acoustic conversion. The root mean square error (RMSE) and mean Mel‐cepstral distortion (MMCD) have been used to evaluate the results of conversion; the evaluated result illustrates that MMCD of MFCC is 1.508 dB, and RMSE of the second formant (F2) is 25.10 Hz. The results of this research can be further applied to the feature fusion of emotion speech recognition to improve the accuracy of emotion recognition.
Title: Articulatory‐to‐Acoustic Conversion of Mandarin Emotional Speech Based on PSO‐LSSVM
Description:
The production of emotional speech is determined by the movement of the speaker’s tongue, lips, and jaw.
In order to combine articulatory data and acoustic data of speakers, articulatory‐to‐acoustic conversion of emotional speech has been studied.
In this paper, parameters of LSSVM model have been optimized using the PSO method, and the optimized PSO‐LSSVM model was applied to the articulatory‐to‐acoustic conversion.
The root mean square error (RMSE) and mean Mel‐cepstral distortion (MMCD) have been used to evaluate the results of conversion; the evaluated result illustrates that MMCD of MFCC is 1.
508 dB, and RMSE of the second formant (F2) is 25.
10 Hz.
The results of this research can be further applied to the feature fusion of emotion speech recognition to improve the accuracy of emotion recognition.
Related Results
Abnormal Status Detection of Catenary Based on TSNE Dimensionality
Reduction Method and IGWO-LSSVM Model
Abnormal Status Detection of Catenary Based on TSNE Dimensionality
Reduction Method and IGWO-LSSVM Model
Background:
Catenary is a crucial component of an electrified railroad's traction power supply system.
There is a considerable incidence of abnormal status and failures due to prol...
KESETARAAN HANYU SHUIPING KAOSHI LEVEL I-IV DENGAN CEFR PADA KETERAMPILAN BERBICARA BAHASA MANDARIN
KESETARAAN HANYU SHUIPING KAOSHI LEVEL I-IV DENGAN CEFR PADA KETERAMPILAN BERBICARA BAHASA MANDARIN
AbstrakSejalan dengan derasnya perkembangan era disrupsi teknologi revolusi industri 4.0, pembelajaran bahasa asing dituntut untuk mempunyai standar tingkat penguasaan sebagai acua...
Experimental Study on the Acoustic Characteristics of “Similar” Vowels in Mandarin Learners
Experimental Study on the Acoustic Characteristics of “Similar” Vowels in Mandarin Learners
Abstract
The rapid globalization of regions with different languages necessitates more advanced non-native tongue proficiency among individuals who need to communica...
PEMBELAJARAN BAHASA MANDARIN ANAK USIA DINI DI YAYASAN PENDIDIKAN ISLAM AR-RAHMAH
PEMBELAJARAN BAHASA MANDARIN ANAK USIA DINI DI YAYASAN PENDIDIKAN ISLAM AR-RAHMAH
Bahasa Mandarin telah dijadikan sebagai salah satu mata pelajaran di sekolah-sekolah yang ada di Indonesia termasuk TK/PAUD.Namun demikian, banyak kendala yang dihadapi oleh pihak ...
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Speech, communication, and neuroimaging in Parkinson's disease : Characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : Characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Analysis of Factors Influencing the Mandarin Proficiency of Ethnic Minority College Students
Analysis of Factors Influencing the Mandarin Proficiency of Ethnic Minority College Students
This study focuses on determining the factors that influence Mandarin proficiency of ethnic minority college students of Hulunbuir University in border minority areas of China. We ...

