Javascript must be enabled to continue!
Speech and Prosodic Processing for Assistive Technology
View through CrossRef
A speaker's utterance may convey different meanings to a hearer than what the speaker intended. Such ambiguities can be resolved by emphasizing accents at different positions. In human communication, the utterances are emphasized at a focus part to distinguish the important content and reduce ambiguity in the utterance. In our Focus-to-Emphasize Tone (FET) system, we determine how the speaker's utterances are influenced by focus and speaker's intention. The relationships of focus information, speaker's intention and prosodic phenomena are investigated to recognize the intonation patterns and annotate the sentence with prosodic marks. We propose using the Focus to Emphasize Tone (FET) analysis, which includes: (i) generating the constraints for foci, speaker's intention and prosodic features, (ii) defining the intonation patterns, and (iii) labelling a set of prosodic marks for a sentence. We also design the FET structure to support our analysis and to contain focus, speaker's intention and prosodic components. An implementation of the system is described and the evaluation results on the CMU Communicator (CMU–COM) dataset are presented.
Title: Speech and Prosodic Processing for Assistive Technology
Description:
A speaker's utterance may convey different meanings to a hearer than what the speaker intended.
Such ambiguities can be resolved by emphasizing accents at different positions.
In human communication, the utterances are emphasized at a focus part to distinguish the important content and reduce ambiguity in the utterance.
In our Focus-to-Emphasize Tone (FET) system, we determine how the speaker's utterances are influenced by focus and speaker's intention.
The relationships of focus information, speaker's intention and prosodic phenomena are investigated to recognize the intonation patterns and annotate the sentence with prosodic marks.
We propose using the Focus to Emphasize Tone (FET) analysis, which includes: (i) generating the constraints for foci, speaker's intention and prosodic features, (ii) defining the intonation patterns, and (iii) labelling a set of prosodic marks for a sentence.
We also design the FET structure to support our analysis and to contain focus, speaker's intention and prosodic components.
An implementation of the system is described and the evaluation results on the CMU Communicator (CMU–COM) dataset are presented.
Related Results
Lists, Spatial Practice and Assistive Technologies for the Blind
Lists, Spatial Practice and Assistive Technologies for the Blind
IntroductionSupermarkets are functionally challenging environments for people with vision impairments. A supermarket is likely to house an average of 45,000 products in a median fl...
Primary caregivers’ perspectives on the use of assistive technologies in children with physical disabilities: a qualitative systematic review
Primary caregivers’ perspectives on the use of assistive technologies in children with physical disabilities: a qualitative systematic review
Background/Aims Assistive technologies are critical for promoting independence and participation in children with physical disabilities. Assistive technology refers to any item, eq...
Assistive activity technology as symbolic expressions of the self
Assistive activity technology as symbolic expressions of the self
BACKGROUND: Different types of assistive technologies can support participation for people with disability; nonetheless, technology can break with peoples self-image, sometimes res...
Assistive Technology
Assistive Technology
Considering the vicious cycle of exclusion that students with special needs are often trapped in— lacking the means for equal participation in education, society, and mainstream de...
Discontinuous noun phrases in Vietnamese
Discontinuous noun phrases in Vietnamese
Since Vietnamese is an isolating language, word order plays an important role in identifying the function of a particular word. Yet in some contexts word order may be flexible espe...
Working memory capacity predicts sensitivity to prosodic structure
Working memory capacity predicts sensitivity to prosodic structure
Listeners vary in the perception and interpretation of speech prosody (the variations inintonation, loudness, and rhythm of spoken language). The source of this variability isunkno...
Working memory capacity predicts sensitivity to prosodic structure
Working memory capacity predicts sensitivity to prosodic structure
Listeners vary in the perception and interpretation of speech prosody (the variations inintonation, loudness, and rhythm of spoken language). The source of this variability isunkno...
Working memory capacity predicts sensitivity to prosodic structure
Working memory capacity predicts sensitivity to prosodic structure
Listeners vary in the perception and interpretation of speech prosody (the variations inintonation, loudness, and rhythm of spoken language). The source of this variability isunkno...

