Javascript must be enabled to continue!
Voice clones sound realistic but not (yet) hyperrealistic
View through CrossRef
AI-generated voices are increasingly prevalent in our lives, via virtual assistants, automated customer service, and voice-overs. With increased availability and affordability of AI-generated voices, we need to examine how humans perceive them. Recently, an intriguing effect was reported in AI-generated faces, where such face images were perceived as more human than images of real humans - a “hyperrealism effect.” Here, we tested whether a “hyperrealism effect” also exists for AI-generated voices. We investigated the extent to which AI-generated voices sound real to human listeners, and whether listeners can accurately distinguish between human and AI-generated voices. We also examined perceived social trait characteristics (trustworthiness and dominance) of human and AI-generated voices. We tested these questions using AI-generated voices generated with and without a specific human counterpart (i.e., voice clones, and voices generated from the latent space of a large voice model).We find that voice clones can sound as real as human voices, making it difficult for listeners to distinguish between them. However, we did not observe a hyperrealism effect. Both types of AI-generated voices were evaluated as more dominant than human voices, with some AI-generated voices also being perceived as more trustworthy. These findings raise questions for future research: Can hyperrealistic voices be created with more advanced technology, or is the lack of a hyperrealism effect due to differences between voice and face (image) perception? Our findings also highlight the potential for AI-generated voices to misinform and fraud, alongside opportunities to use realistic AI-generated voices for beneficial purposes.
Title: Voice clones sound realistic but not (yet) hyperrealistic
Description:
AI-generated voices are increasingly prevalent in our lives, via virtual assistants, automated customer service, and voice-overs.
With increased availability and affordability of AI-generated voices, we need to examine how humans perceive them.
Recently, an intriguing effect was reported in AI-generated faces, where such face images were perceived as more human than images of real humans - a “hyperrealism effect.
” Here, we tested whether a “hyperrealism effect” also exists for AI-generated voices.
We investigated the extent to which AI-generated voices sound real to human listeners, and whether listeners can accurately distinguish between human and AI-generated voices.
We also examined perceived social trait characteristics (trustworthiness and dominance) of human and AI-generated voices.
We tested these questions using AI-generated voices generated with and without a specific human counterpart (i.
e.
, voice clones, and voices generated from the latent space of a large voice model).
We find that voice clones can sound as real as human voices, making it difficult for listeners to distinguish between them.
However, we did not observe a hyperrealism effect.
Both types of AI-generated voices were evaluated as more dominant than human voices, with some AI-generated voices also being perceived as more trustworthy.
These findings raise questions for future research: Can hyperrealistic voices be created with more advanced technology, or is the lack of a hyperrealism effect due to differences between voice and face (image) perception? Our findings also highlight the potential for AI-generated voices to misinform and fraud, alongside opportunities to use realistic AI-generated voices for beneficial purposes.
Related Results
Centaurs transitioning to JFCs: thermal and dynamical evolution
Centaurs transitioning to JFCs: thermal and dynamical evolution
<p>1- Context</p>
<p>Jupiter-family Comets are continuously replenished from their outer solar system reservoirs. Before they enter the in...
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Speech, communication, and neuroimaging in Parkinson's disease : Characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : Characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Brain mechanism of unfamiliar and familiar voice processing: an activation likelihood estimation meta-analysis
Brain mechanism of unfamiliar and familiar voice processing: an activation likelihood estimation meta-analysis
Interpersonal communication through vocal information is very important for human society. During verbal interactions, our vocal cord vibrations convey important information regard...
Selection Criteria and Performance of Energycane Clones (Saccharum spp. × S. spontaneum) for Biomass Production Under Tropical and Sub-tropical Conditions
Selection Criteria and Performance of Energycane Clones (Saccharum spp. × S. spontaneum) for Biomass Production Under Tropical and Sub-tropical Conditions
The urgent need to reduce our reliance on oil and at the same time reduce carbon emissions, has triggered the search for alternative energy sources such as biofuels. New technologi...
Evaluation of Clonal Variability in Shoot Coppicing Ability and in vitro Responses of Dalbergia sissoo Roxb
Evaluation of Clonal Variability in Shoot Coppicing Ability and in vitro Responses of Dalbergia sissoo Roxb
Summary
Clonal variations were observed amongst 12 clones of Dalbergia sissoo belonging to four states (U.P, Uttaranchal, Haryana and Rajasthan) of India, represent...
Therapeutic approach to steroid-resistant asthma
Therapeutic approach to steroid-resistant asthma
Abstract
Background
To control therapy-resistant eosinophilia, synergistic effects of CTLA4-Ig and glucocorticoid was investigat...

