Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Synthesis of severely pathological voices

View through CrossRef
In this paper, the acoustic and perceptual correlates of bicyclic, rough/breathy, rough/bicyclic, strained/breathy, and breathy/bicyclic, are studied. The work represents a continuation of a previous study [J. Acoust. Soc. Am. 94, 1782 (A) (1993)]. An analysis-by-synthesis approach is used, utilizing KLSYN88, to study ten speech waveforms obtained from the VA Hospital in West LA. Preliminary results indicate the synthesizer’s diplophonia parameter (DI) is useful in synthesizing bicyclic voices. Other severe disorders can be synthesized in one of three ways: (1) simultaneous and equal use of parameters needed to synthesize milder cases of pathologies; for example, rough/breathy voices are synthesized with a time-varying F0, characteristic of rough voices, in combination with a high amplitude of aspiration noise, needed for breathiness perception, (2) increased use of a single set of parameters appropriate for a milder pathology; for example, a rough/bicyclic voice is synthesized with a time-varying F0 and very little DI, and (3) sequential use of parameters appropriate for two different qualities; for example, the synthesis of a strained/breathy voice requires varying the open-quotient parameter in time to match the acoustic and perceptual correlates of breathiness in one time interval and those of the strained quality in the other. These results will be discussed in terms of the independence, or otherwise correlation, of acoustic and perceptual features.
Title: Synthesis of severely pathological voices
Description:
In this paper, the acoustic and perceptual correlates of bicyclic, rough/breathy, rough/bicyclic, strained/breathy, and breathy/bicyclic, are studied.
The work represents a continuation of a previous study [J.
Acoust.
Soc.
Am.
94, 1782 (A) (1993)].
An analysis-by-synthesis approach is used, utilizing KLSYN88, to study ten speech waveforms obtained from the VA Hospital in West LA.
Preliminary results indicate the synthesizer’s diplophonia parameter (DI) is useful in synthesizing bicyclic voices.
Other severe disorders can be synthesized in one of three ways: (1) simultaneous and equal use of parameters needed to synthesize milder cases of pathologies; for example, rough/breathy voices are synthesized with a time-varying F0, characteristic of rough voices, in combination with a high amplitude of aspiration noise, needed for breathiness perception, (2) increased use of a single set of parameters appropriate for a milder pathology; for example, a rough/bicyclic voice is synthesized with a time-varying F0 and very little DI, and (3) sequential use of parameters appropriate for two different qualities; for example, the synthesis of a strained/breathy voice requires varying the open-quotient parameter in time to match the acoustic and perceptual correlates of breathiness in one time interval and those of the strained quality in the other.
These results will be discussed in terms of the independence, or otherwise correlation, of acoustic and perceptual features.

Related Results

Voice clones sound realistic but not (yet) hyperrealistic
Voice clones sound realistic but not (yet) hyperrealistic
AI-generated voices are increasingly prevalent in our lives, via virtual assistants, automated customer service, and voice-overs. With increased availability and affordability of A...
Acoustic analysis and synthesis of pathological voice qualities
Acoustic analysis and synthesis of pathological voice qualities
An analysis-by-synthesis approach was adopted to classify the acoustic and perceptual features of three pathological voice qualities: breathy, strained, and rough. One hundred and ...
Trait impressions from voices are formed rapidly within 400ms of exposure
Trait impressions from voices are formed rapidly within 400ms of exposure
When seeing a face or hearing a voice, perceivers readily form first impressions of a person’s characteristics – are they trustworthy, do they seem aggressive? One of the key claim...
Both personal and shared taste shape impressions from voices and faces
Both personal and shared taste shape impressions from voices and faces
Voices elicit rich first impressions of what the person we are hearing might be like. Research stresses that these impressions from voices are shared across different listeners, su...
Voice and International Studies
Voice and International Studies
The call to include different voices in international studies has been made from heterogenous vantage points. Critical theoretical traditions—like feminist, postcolonial, and decol...
Remembering Voices
Remembering Voices
Abstract A key focus of the Hearing the Voice Phenomenological Interview is the question of when participants’ voices started. This chapter explores some of the impl...
Clinical outcomes of implantation of posterior chamber phakic intraocular lens for pathologic and non-pathologic myopia
Clinical outcomes of implantation of posterior chamber phakic intraocular lens for pathologic and non-pathologic myopia
Abstract Background To compare the clinical outcomes of posterior chamber phakic intraocular lens (pIOL) implantation for non-pathological myopia and pathological myopia. ...
The Observatory
The Observatory
<p><b>This thesis investigation looks at how transformative heritage stories linked to abandoned architectural sites can be reawakened through an allegorical architectu...

Back to Top