Javascript must be enabled to continue!
Effects of Spatial Speech Presentation on Listener Response Strategy for Talker-Identification
View through CrossRef
This study investigates effects of spatial auditory cues on human listeners' response strategy for identifying two alternately active talkers (“turn-taking” listening scenario). Previous research has demonstrated subjective benefits of audio spatialization with regard to speech intelligibility and talker-identification effort. So far, the deliberate activation of specific perceptual and cognitive processes by listeners to optimize their task performance remained largely unexamined. Spoken sentences selected as stimuli were either clean or degraded due to background noise or bandpass filtering. Stimuli were presented via three horizontally positioned loudspeakers: In a non-spatial mode, both talkers were presented through a central loudspeaker; in a spatial mode, each talker was presented through the central or a talker-specific lateral loudspeaker. Participants identified talkers via speeded keypresses and afterwards provided subjective ratings (speech quality, speech intelligibility, voice similarity, talker-identification effort). In the spatial mode, presentations at lateral loudspeaker locations entailed quicker behavioral responses, which were significantly slower in comparison to a talker-localization task. Under clean speech, response times globally increased in the spatial vs. non-spatial mode (across all locations); these “response time switch costs,” presumably being caused by repeated switching of spatial auditory attention between different locations, diminished under degraded speech. No significant effects of spatialization on subjective ratings were found. The results suggested that when listeners could utilize task-relevant auditory cues about talker location, they continued to rely on voice recognition instead of localization of talker sound sources as primary response strategy. Besides, the presence of speech degradations may have led to increased cognitive control, which in turn compensated for incurring response time switch costs.
Frontiers Media SA
Title: Effects of Spatial Speech Presentation on Listener Response Strategy for Talker-Identification
Description:
This study investigates effects of spatial auditory cues on human listeners' response strategy for identifying two alternately active talkers (“turn-taking” listening scenario).
Previous research has demonstrated subjective benefits of audio spatialization with regard to speech intelligibility and talker-identification effort.
So far, the deliberate activation of specific perceptual and cognitive processes by listeners to optimize their task performance remained largely unexamined.
Spoken sentences selected as stimuli were either clean or degraded due to background noise or bandpass filtering.
Stimuli were presented via three horizontally positioned loudspeakers: In a non-spatial mode, both talkers were presented through a central loudspeaker; in a spatial mode, each talker was presented through the central or a talker-specific lateral loudspeaker.
Participants identified talkers via speeded keypresses and afterwards provided subjective ratings (speech quality, speech intelligibility, voice similarity, talker-identification effort).
In the spatial mode, presentations at lateral loudspeaker locations entailed quicker behavioral responses, which were significantly slower in comparison to a talker-localization task.
Under clean speech, response times globally increased in the spatial vs.
non-spatial mode (across all locations); these “response time switch costs,” presumably being caused by repeated switching of spatial auditory attention between different locations, diminished under degraded speech.
No significant effects of spatialization on subjective ratings were found.
The results suggested that when listeners could utilize task-relevant auditory cues about talker location, they continued to rely on voice recognition instead of localization of talker sound sources as primary response strategy.
Besides, the presence of speech degradations may have led to increased cognitive control, which in turn compensated for incurring response time switch costs.
Related Results
Cometary Physics Laboratory: spectrophotometric experiments
Cometary Physics Laboratory: spectrophotometric experiments
<p><strong><span dir="ltr" role="presentation">1. Introduction</span></strong&...
Xie, Liu, & Jaeger (2020). Cross-talker generalization during foreign-accented speech perception
Xie, Liu, & Jaeger (2020). Cross-talker generalization during foreign-accented speech perception
Speech perception depends on the ability to generalize previously experienced input effectively across talkers. How such cross-talker generalization is achieved has remained an ope...
Recognizing voices through a cochlear implant: A systematic review
Recognizing voices through a cochlear implant: A systematic review
Objective: Some cochlear implant (CI) users report having difficulty accessing indexical information in the speech signal, presumably due to the transformation from acoustic to ele...
Talker variability facilitates the statistical learning of speech sounds
Talker variability facilitates the statistical learning of speech sounds
Natural speech contains many sources of acoustic variability both within and between talkers, which challenges speech recognition in some contexts but may facilitate language under...
Neural Speech-Tracking During Selective Attention: A Spatially Realistic Audiovisual Study
Neural Speech-Tracking During Selective Attention: A Spatially Realistic Audiovisual Study
Abstract
Paying attention to a target talker in multi-talker scenarios is associated with its more accurate neural-tracking relative to competing non-target speech....
Effects of talker uncertainty I: Auditory word recognition
Effects of talker uncertainty I: Auditory word recognition
The production and resulting acoustic composition of spoken words vary as functions of individual talker characteristics. However, the effects of talker differences on auditory wor...
Why are listeners hindered by talker variability?
Why are listeners hindered by talker variability?
AbstractThough listeners readily recognize speech from a variety of talkers, accommodating talker variability comes at a cost: Myriad studies have shown that listeners are slower t...
Talker and accent familiarity yield advantages for voice identity perception: a voice sorting study
Talker and accent familiarity yield advantages for voice identity perception: a voice sorting study
Familiarity benefits in voice identity perception have been frequently described in the literature. Typically, studies have contrasted listeners who were either familiar or unfamil...

