Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Speech and Prosodic Processing for Assistive Technology

View through CrossRef
A speaker's utterance may convey different meanings to a hearer than what the speaker intended. Such ambiguities can be resolved by emphasizing accents at different positions. In human communication, the utterances are emphasized at a focus part to distinguish the important content and reduce ambiguity in the utterance. In our Focus-to-Emphasize Tone (FET) system, we determine how the speaker's utterances are influenced by focus and speaker's intention. The relationships of focus information, speaker's intention and prosodic phenomena are investigated to recognize the intonation patterns and annotate the sentence with prosodic marks. We propose using the Focus to Emphasize Tone (FET) analysis, which includes: (i) generating the constraints for foci, speaker's intention and prosodic features, (ii) defining the intonation patterns, and (iii) labelling a set of prosodic marks for a sentence. We also design the FET structure to support our analysis and to contain focus, speaker's intention and prosodic components. An implementation of the system is described and the evaluation results on the CMU Communicator (CMU–COM) dataset are presented.
Title: Speech and Prosodic Processing for Assistive Technology
Description:
A speaker's utterance may convey different meanings to a hearer than what the speaker intended.
Such ambiguities can be resolved by emphasizing accents at different positions.
In human communication, the utterances are emphasized at a focus part to distinguish the important content and reduce ambiguity in the utterance.
In our Focus-to-Emphasize Tone (FET) system, we determine how the speaker's utterances are influenced by focus and speaker's intention.
The relationships of focus information, speaker's intention and prosodic phenomena are investigated to recognize the intonation patterns and annotate the sentence with prosodic marks.
We propose using the Focus to Emphasize Tone (FET) analysis, which includes: (i) generating the constraints for foci, speaker's intention and prosodic features, (ii) defining the intonation patterns, and (iii) labelling a set of prosodic marks for a sentence.
We also design the FET structure to support our analysis and to contain focus, speaker's intention and prosodic components.
An implementation of the system is described and the evaluation results on the CMU Communicator (CMU–COM) dataset are presented.

Related Results

Lists, Spatial Practice and Assistive Technologies for the Blind
Lists, Spatial Practice and Assistive Technologies for the Blind
IntroductionSupermarkets are functionally challenging environments for people with vision impairments. A supermarket is likely to house an average of 45,000 products in a median fl...
Assistive activity technology as symbolic expressions of the self
Assistive activity technology as symbolic expressions of the self
BACKGROUND: Different types of assistive technologies can support participation for people with disability; nonetheless, technology can break with peoples self-image, sometimes res...
Assistive Technology
Assistive Technology
Considering the vicious cycle of exclusion that students with special needs are often trapped in— lacking the means for equal participation in education, society, and mainstream de...
Discontinuous noun phrases in Vietnamese
Discontinuous noun phrases in Vietnamese
Since Vietnamese is an isolating language, word order plays an important role in identifying the function of a particular word. Yet in some contexts word order may be flexible espe...
Working memory capacity predicts sensitivity to prosodic structure
Working memory capacity predicts sensitivity to prosodic structure
Listeners vary in the perception and interpretation of speech prosody (the variations inintonation, loudness, and rhythm of spoken language). The source of this variability isunkno...
Working memory capacity predicts sensitivity to prosodic structure
Working memory capacity predicts sensitivity to prosodic structure
Listeners vary in the perception and interpretation of speech prosody (the variations inintonation, loudness, and rhythm of spoken language). The source of this variability isunkno...
Working memory capacity predicts sensitivity to prosodic structure
Working memory capacity predicts sensitivity to prosodic structure
Listeners vary in the perception and interpretation of speech prosody (the variations inintonation, loudness, and rhythm of spoken language). The source of this variability isunkno...

Back to Top