Javascript must be enabled to continue!
Neuronal and behavioral affective perceptions of human and naturalness-reduced emotional prosodies
View through CrossRef
Artificial voices are nowadays embedded into our daily lives with latest neural voices approaching human voice consistency (naturalness). Nevertheless, behavioral, and neuronal correlates of the perception of less naturalistic emotional prosodies are still misunderstood. In this study, we explored the acoustic tendencies that define naturalness from human to synthesized voices. Then, we created naturalness-reduced emotional utterances by acoustic editions of human voices. Finally, we used Event-Related Potentials (ERP) to assess the time dynamics of emotional integration when listening to both human and synthesized voices in a healthy adult sample. Additionally, listeners rated their perceptions for valence, arousal, discrete emotions, naturalness, and intelligibility. Synthesized voices were characterized by less lexical stress (i.e., reduced difference between stressed and unstressed syllables within words) as regards duration and median pitch modulations. Besides, spectral content was attenuated toward lower F2 and F3 frequencies and lower intensities for harmonics 1 and 4. Both psychometric and neuronal correlates were sensitive to naturalness reduction. (1) Naturalness and intelligibility ratings dropped with emotional utterances synthetization, (2) Discrete emotion recognition was impaired as naturalness declined, consistent with P200 and Late Positive Potentials (LPP) being less sensitive to emotional differentiation at lower naturalness, and (3) Relative P200 and LPP amplitudes between prosodies were modulated by synthetization. Nevertheless, (4) Valence and arousal perceptions were preserved at lower naturalness, (5) Valence (arousal) ratings correlated negatively (positively) with Higuchi’s fractal dimension extracted on neuronal data under all naturalness perturbations, (6) Inter-Trial Phase Coherence (ITPC) and standard deviation measurements revealed high inter-individual heterogeneity for emotion perception that is still preserved as naturalness reduces. Notably, partial between-participant synchrony (low ITPC), along with high amplitude dispersion on ERPs at both early and late stages emphasized miscellaneous emotional responses among subjects. In this study, we highlighted for the first time both behavioral and neuronal basis of emotional perception under acoustic naturalness alterations. Partial dependencies between ecological relevance and emotion understanding outlined the modulation but not the annihilation of emotional integration by synthetization.
Title: Neuronal and behavioral affective perceptions of human and naturalness-reduced emotional prosodies
Description:
Artificial voices are nowadays embedded into our daily lives with latest neural voices approaching human voice consistency (naturalness).
Nevertheless, behavioral, and neuronal correlates of the perception of less naturalistic emotional prosodies are still misunderstood.
In this study, we explored the acoustic tendencies that define naturalness from human to synthesized voices.
Then, we created naturalness-reduced emotional utterances by acoustic editions of human voices.
Finally, we used Event-Related Potentials (ERP) to assess the time dynamics of emotional integration when listening to both human and synthesized voices in a healthy adult sample.
Additionally, listeners rated their perceptions for valence, arousal, discrete emotions, naturalness, and intelligibility.
Synthesized voices were characterized by less lexical stress (i.
e.
, reduced difference between stressed and unstressed syllables within words) as regards duration and median pitch modulations.
Besides, spectral content was attenuated toward lower F2 and F3 frequencies and lower intensities for harmonics 1 and 4.
Both psychometric and neuronal correlates were sensitive to naturalness reduction.
(1) Naturalness and intelligibility ratings dropped with emotional utterances synthetization, (2) Discrete emotion recognition was impaired as naturalness declined, consistent with P200 and Late Positive Potentials (LPP) being less sensitive to emotional differentiation at lower naturalness, and (3) Relative P200 and LPP amplitudes between prosodies were modulated by synthetization.
Nevertheless, (4) Valence and arousal perceptions were preserved at lower naturalness, (5) Valence (arousal) ratings correlated negatively (positively) with Higuchi’s fractal dimension extracted on neuronal data under all naturalness perturbations, (6) Inter-Trial Phase Coherence (ITPC) and standard deviation measurements revealed high inter-individual heterogeneity for emotion perception that is still preserved as naturalness reduces.
Notably, partial between-participant synchrony (low ITPC), along with high amplitude dispersion on ERPs at both early and late stages emphasized miscellaneous emotional responses among subjects.
In this study, we highlighted for the first time both behavioral and neuronal basis of emotional perception under acoustic naturalness alterations.
Partial dependencies between ecological relevance and emotion understanding outlined the modulation but not the annihilation of emotional integration by synthetization.
Related Results
Improved emotion differentiation under reduced acoustic variability of speech in autism
Improved emotion differentiation under reduced acoustic variability of speech in autism
Abstract
Background
Socio-emotional impairments are among the diagnostic criteria for autism spectrum disorder (ASD), but the actual knowledge has s...
Metabolically induced neuronal differentiation
Metabolically induced neuronal differentiation
In recent years, several neuronal differentiation protocols were published that circumvent the requirement of embryoid body (EB) formation under serum-deprivation and simplified me...
Relationship of Tree Stand Heterogeneity and Forest Naturalness
Relationship of Tree Stand Heterogeneity and Forest Naturalness
The aim of our study was to investigate if compositional (tree species richness) and structural (vertical structure, age-structure, patterns of canopy closure) heterogeneity of the...
Affective Forecasting: the Effects of Immune Neglect and Surrogation
Affective Forecasting: the Effects of Immune Neglect and Surrogation
Studies of affective forecasting examine people’s ability to predict (forecast) their emotional (affective) responses to future events. Affective forecasts underlie nearly all deci...
Daniela Fenu Foerch: interview by Márcia Fusaro and Ana Maria Haddad Baptista
Daniela Fenu Foerch: interview by Márcia Fusaro and Ana Maria Haddad Baptista
EccoS Journal: Dr Foerch thank you very much for this interview. Could you start telling us about your professional background and what the WeFEEL project is?
Daniela Fenu Foerch:...
Astrocytes improve neuronal health after cisplatin treatment through mitochondrial transfer
Astrocytes improve neuronal health after cisplatin treatment through mitochondrial transfer
AbstractNeurodegenerative disorders, including chemotherapy-induced cognitive impairment, are associated with neuronal mitochondrial dysfunction. Cisplatin, a commonly used chemoth...
Investigation of Colour Naturalness in Lighting: A Comparative Study
Investigation of Colour Naturalness in Lighting: A Comparative Study
This paper is concerned with improving the acceptability of LED light sources, since their long life and high efficiency have contributed to their widespread adoption in many appli...
The Self-Cultivation Realm and Natural Value in Zhuangzi’s Concept of Zhenren 真人
The Self-Cultivation Realm and Natural Value in Zhuangzi’s Concept of Zhenren 真人
Adopting a comparative philosophical approach and engaging in textual analysis, this paper reveals that the concept of Zhenren 真人—as the embodiment of Zhuangzi’s ideal personality—...

