Javascript must be enabled to continue!

Speech and Prosodic Processing for Assistive Technology

A speaker's utterance may convey different meanings to a hearer than what the speaker intended. Such ambiguities can be resolved by emphasizing accents at different positions. In human communication, the utterances are emphasized at a focus part to distinguish the important content and reduce ambiguity in the utterance. In our Focus-to-Emphasize Tone (FET) system, we determine how the speaker's utterances are influenced by focus and speaker's intention. The relationships of focus information, speaker's intention and prosodic phenomena are investigated to recognize the intonation patterns and annotate the sentence with prosodic marks. We propose using the Focus to Emphasize Tone (FET) analysis, which includes: (i) generating the constraints for foci, speaker's intention and prosodic features, (ii) defining the intonation patterns, and (iii) labelling a set of prosodic marks for a sentence. We also design the FET structure to support our analysis and to contain focus, speaker's intention and prosodic components. An implementation of the system is described and the evaluation results on the CMU Communicator (CMU–COM) dataset are presented.

IOS Press

Narupiyakul Lalita Keselj Vlado Cercone Nick Sirinaovakul Booncharoen

Frontiers in Artificial Intelligence and Applications

2025

Title: Speech and Prosodic Processing for Assistive Technology

Description:

A speaker's utterance may convey different meanings to a hearer than what the speaker intended.

Such ambiguities can be resolved by emphasizing accents at different positions.

In human communication, the utterances are emphasized at a focus part to distinguish the important content and reduce ambiguity in the utterance.

In our Focus-to-Emphasize Tone (FET) system, we determine how the speaker's utterances are influenced by focus and speaker's intention.

The relationships of focus information, speaker's intention and prosodic phenomena are investigated to recognize the intonation patterns and annotate the sentence with prosodic marks.

We propose using the Focus to Emphasize Tone (FET) analysis, which includes: (i) generating the constraints for foci, speaker's intention and prosodic features, (ii) defining the intonation patterns, and (iii) labelling a set of prosodic marks for a sentence.

We also design the FET structure to support our analysis and to contain focus, speaker's intention and prosodic components.

An implementation of the system is described and the evaluation results on the CMU Communicator (CMU–COM) dataset are presented.

Back

IntroductionSupermarkets are functionally challenging environments for people with vision impairments. A supermarket is likely to house an average of 45,000 products in a median fl...

Primary caregivers’ perspectives on the use of assistive technologies in children with physical disabilities: a qualitative systematic review

Background/Aims Assistive technologies are critical for promoting independence and participation in children with physical disabilities. Assistive technology refers to any item, eq...

General auditory and speech-specific contributions to cortical envelope tracking revealed using auditory chimeras

1. Abstract In recent years research on natural speech processing has benefited from recognizing that low frequency cortical activity tracks the amp...

Assistive activity technology as symbolic expressions of the self

BACKGROUND: Different types of assistive technologies can support participation for people with disability; nonetheless, technology can break with peoples self-image, sometimes res...

Assistive Technology

Considering the vicious cycle of exclusion that students with special needs are often trapped in— lacking the means for equal participation in education, society, and mainstream de...

Prosodic assessment in Egyptian children with specific language impairment

EnAbstract Background Prosody is the aspect of language that conveys emotion by changes in tone, rhythm, and emphasis during speech. Prosody include...

Discontinuous noun phrases in Vietnamese

Since Vietnamese is an isolating language, word order plays an important role in identifying the function of a particular word. Yet in some contexts word order may be flexible espe...

Working memory capacity predicts sensitivity to prosodic structure

Listeners vary in the perception and interpretation of speech prosody (the variations inintonation, loudness, and rhythm of spoken language). The source of this variability isunkno...

Email:
Password:

Email:

Speech and Prosodic Processing for Assistive Technology

Related Results