Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Speech and Prosodic Processing for Assistive Technology

View through CrossRef
A speaker's utterance may convey different meanings to a hearer than what the speaker intended. Such ambiguities can be resolved by emphasizing accents at different positions. In human communication, the utterances are emphasized at a focus part to distinguish the important content and reduce ambiguity in the utterance. In our Focus-to-Emphasize Tone (FET) system, we determine how the speaker's utterances are influenced by focus and speaker's intention. The relationships of focus information, speaker's intention and prosodic phenomena are investigated to recognize the intonation patterns and annotate the sentence with prosodic marks. We propose using the Focus to Emphasize Tone (FET) analysis, which includes: (i) generating the constraints for foci, speaker's intention and prosodic features, (ii) defining the intonation patterns, and (iii) labelling a set of prosodic marks for a sentence. We also design the FET structure to support our analysis and to contain focus, speaker's intention and prosodic components. An implementation of the system is described and the evaluation results on the CMU Communicator (CMU–COM) dataset are presented.
Title: Speech and Prosodic Processing for Assistive Technology
Description:
A speaker's utterance may convey different meanings to a hearer than what the speaker intended.
Such ambiguities can be resolved by emphasizing accents at different positions.
In human communication, the utterances are emphasized at a focus part to distinguish the important content and reduce ambiguity in the utterance.
In our Focus-to-Emphasize Tone (FET) system, we determine how the speaker's utterances are influenced by focus and speaker's intention.
The relationships of focus information, speaker's intention and prosodic phenomena are investigated to recognize the intonation patterns and annotate the sentence with prosodic marks.
We propose using the Focus to Emphasize Tone (FET) analysis, which includes: (i) generating the constraints for foci, speaker's intention and prosodic features, (ii) defining the intonation patterns, and (iii) labelling a set of prosodic marks for a sentence.
We also design the FET structure to support our analysis and to contain focus, speaker's intention and prosodic components.
An implementation of the system is described and the evaluation results on the CMU Communicator (CMU–COM) dataset are presented.

Related Results

Assistive activity technology as symbolic expressions of the self
Assistive activity technology as symbolic expressions of the self
BACKGROUND: Different types of assistive technologies can support participation for people with disability; nonetheless, technology can break with peoples self-image, sometimes res...
Assistive Technology
Assistive Technology
Considering the vicious cycle of exclusion that students with special needs are often trapped in— lacking the means for equal participation in education, society, and mainstream de...
Discontinuous noun phrases in Vietnamese
Discontinuous noun phrases in Vietnamese
Since Vietnamese is an isolating language, word order plays an important role in identifying the function of a particular word. Yet in some contexts word order may be flexible espe...
Overt and implicit prosody contribute to neurophysiological responses previously attributed to grammatical processing
Overt and implicit prosody contribute to neurophysiological responses previously attributed to grammatical processing
AbstractRecent neurophysiological research suggests that slow cortical activity tracks hierarchical syntactic structure during online sentence processing. Here we tested an alterna...
Overt and covert prosody are reflected in neurophysiological responses previously attributed to grammatical processing
Overt and covert prosody are reflected in neurophysiological responses previously attributed to grammatical processing
AbstractRecent neurophysiological research suggests that slow cortical activity tracks hierarchical syntactic structure during online sentence processing (e.g., Ding, Melloni, Zhan...
The Neural Mechanisms of Private Speech in Second Language Learners’ Oral Production: An fNIRS Study
The Neural Mechanisms of Private Speech in Second Language Learners’ Oral Production: An fNIRS Study
Background: According to Vygotsky’s sociocultural theory, private speech functions both as a tool for thought regulation and as a transitional form between outer and inner speech. ...
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...

Back to Top