Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Acoustic analysis and synthesis of pathological voice qualities

View through CrossRef
An analysis-by-synthesis approach was adopted to classify the acoustic and perceptual features of three pathological voice qualities: breathy, strained, and rough. One hundred and sixty waveforms of the vowel /open aye/ spoken by female and male subjects with pathological voice qualities were obtained from the VA Hospital in West LA. The temporal and spectral features of the waveforms were studied and the results were used in synthesizing the utterances using the Klatt formant synthesizer. Preliminary results on 30 breathy and strained voices indicate that the perception of ‘‘pathological’’ breathiness is mainly related to: (1) a large open quotient of the glottal waveform (OQ) and (2) the amplitude of aspiration noise (AH) relative to that of voicing (AV) with female voices exhibiting a larger (AH–AV) than male voices. For some voices, it was also necessary to introduce extra poles to the vocal-tract transfer function to achieve a better spectral match. Synthesis of strained voices required a lower OQ than that needed for normal voices and, in some cases, amplitude and/or frequency modulation was introduced to achieve a better match in the time domain. The synthetic voices were judged perceptually by clinicians to be of high quality. The results will be discussed in terms of the effects of different vibratory patterns of the vocal folds on the acoustic speech waveform.
Title: Acoustic analysis and synthesis of pathological voice qualities
Description:
An analysis-by-synthesis approach was adopted to classify the acoustic and perceptual features of three pathological voice qualities: breathy, strained, and rough.
One hundred and sixty waveforms of the vowel /open aye/ spoken by female and male subjects with pathological voice qualities were obtained from the VA Hospital in West LA.
The temporal and spectral features of the waveforms were studied and the results were used in synthesizing the utterances using the Klatt formant synthesizer.
Preliminary results on 30 breathy and strained voices indicate that the perception of ‘‘pathological’’ breathiness is mainly related to: (1) a large open quotient of the glottal waveform (OQ) and (2) the amplitude of aspiration noise (AH) relative to that of voicing (AV) with female voices exhibiting a larger (AH–AV) than male voices.
For some voices, it was also necessary to introduce extra poles to the vocal-tract transfer function to achieve a better spectral match.
Synthesis of strained voices required a lower OQ than that needed for normal voices and, in some cases, amplitude and/or frequency modulation was introduced to achieve a better match in the time domain.
The synthetic voices were judged perceptually by clinicians to be of high quality.
The results will be discussed in terms of the effects of different vibratory patterns of the vocal folds on the acoustic speech waveform.

Related Results

Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Speech, communication, and neuroimaging in Parkinson's disease : Characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : Characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
How to speak and vocal hygiene
How to speak and vocal hygiene
An abnormal tongue shape, pitch difference or voice quality can lead to difficulty communicating effectively. Common among teachers are voice issues, which can be uncomfortable and...
Voice in marketing interactions
Voice in marketing interactions
[ACCESS RESTRICTED TO THE UNIVERSITY OF MISSOURI AT REQUEST OF AUTHOR.] Voice is an intrinsic feature of marketing interactions that varies among individual agents, across encounte...
Acoustic cloaking design based on penetration manipulation with combination acoustic metamaterials
Acoustic cloaking design based on penetration manipulation with combination acoustic metamaterials
The acoustic wave transmission manipulation ability is the most important performance for the acoustic metamaterials. To manipulate the acoustic transmission, the combination acous...
Future automobile driving space voice interaction: adapt to the driving scenarios and user personalities
Future automobile driving space voice interaction: adapt to the driving scenarios and user personalities
This paper investigates in-car voice interaction, where in-car voice assistants are becoming a common form of interaction in the car. However, voice assistants are unable to natura...
Telepractice program in voice therapy for primary school teachers: A Pilot study
Telepractice program in voice therapy for primary school teachers: A Pilot study
Background: Teaching is an occupation where teachers consistently use their voices. However, excessive voice use causes voice disorders, especially in primary school teachers. Ther...

Back to Top