Javascript must be enabled to continue!
Acoustic analysis and synthesis of pathological voice qualities
View through CrossRef
An analysis-by-synthesis approach was adopted to classify the acoustic and perceptual features of three pathological voice qualities: breathy, strained, and rough. One hundred and sixty waveforms of the vowel /open aye/ spoken by female and male subjects with pathological voice qualities were obtained from the VA Hospital in West LA. The temporal and spectral features of the waveforms were studied and the results were used in synthesizing the utterances using the Klatt formant synthesizer. Preliminary results on 30 breathy and strained voices indicate that the perception of ‘‘pathological’’ breathiness is mainly related to: (1) a large open quotient of the glottal waveform (OQ) and (2) the amplitude of aspiration noise (AH) relative to that of voicing (AV) with female voices exhibiting a larger (AH–AV) than male voices. For some voices, it was also necessary to introduce extra poles to the vocal-tract transfer function to achieve a better spectral match. Synthesis of strained voices required a lower OQ than that needed for normal voices and, in some cases, amplitude and/or frequency modulation was introduced to achieve a better match in the time domain. The synthetic voices were judged perceptually by clinicians to be of high quality. The results will be discussed in terms of the effects of different vibratory patterns of the vocal folds on the acoustic speech waveform.
Acoustical Society of America (ASA)
Title: Acoustic analysis and synthesis of pathological voice qualities
Description:
An analysis-by-synthesis approach was adopted to classify the acoustic and perceptual features of three pathological voice qualities: breathy, strained, and rough.
One hundred and sixty waveforms of the vowel /open aye/ spoken by female and male subjects with pathological voice qualities were obtained from the VA Hospital in West LA.
The temporal and spectral features of the waveforms were studied and the results were used in synthesizing the utterances using the Klatt formant synthesizer.
Preliminary results on 30 breathy and strained voices indicate that the perception of ‘‘pathological’’ breathiness is mainly related to: (1) a large open quotient of the glottal waveform (OQ) and (2) the amplitude of aspiration noise (AH) relative to that of voicing (AV) with female voices exhibiting a larger (AH–AV) than male voices.
For some voices, it was also necessary to introduce extra poles to the vocal-tract transfer function to achieve a better spectral match.
Synthesis of strained voices required a lower OQ than that needed for normal voices and, in some cases, amplitude and/or frequency modulation was introduced to achieve a better match in the time domain.
The synthetic voices were judged perceptually by clinicians to be of high quality.
The results will be discussed in terms of the effects of different vibratory patterns of the vocal folds on the acoustic speech waveform.
Related Results
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Speech, communication, and neuroimaging in Parkinson's disease : Characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : Characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Brain mechanism of unfamiliar and familiar voice processing: an activation likelihood estimation meta-analysis
Brain mechanism of unfamiliar and familiar voice processing: an activation likelihood estimation meta-analysis
Interpersonal communication through vocal information is very important for human society. During verbal interactions, our vocal cord vibrations convey important information regard...
How to speak and vocal hygiene
How to speak and vocal hygiene
An abnormal tongue shape, pitch difference or voice quality can lead to difficulty communicating effectively. Common among teachers are voice issues, which can be uncomfortable and...
Voice in marketing interactions
Voice in marketing interactions
[ACCESS RESTRICTED TO THE UNIVERSITY OF MISSOURI AT REQUEST OF AUTHOR.] Voice is an intrinsic feature of marketing interactions that varies among individual agents, across encounte...
Applications Of Acoustic Image Logs
Applications Of Acoustic Image Logs
Abstract
Acoustic image logs have been acquired in the Barua/Motatan and Mara fields as a part of the information acquisition program implemented by Maraven, S.A....
From Voice Signals to Voice Maps
From Voice Signals to Voice Maps
Abstract
This article is intended as an introductory tutorial for technically inclined clinicians, vocologists and voice pedagogues who want ...

