Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Talker variability facilitates the statistical learning of speech sounds

View through CrossRef
Natural speech contains many sources of acoustic variability both within and between talkers, which challenges speech recognition in some contexts but may facilitate language understanding in novel listening situations. Despite this ubiquitous variability, most previous studies that have examined the ability to extract sound patterns in speech—known as statistical learning—have used highly controlled, artificial, monotonic streams of syllables. Thus, it is unknown whether variability in speech may help or hinder statistical learning – an important question to resolve if statistical learning does indeed play a role in the segmentation of naturally spoken language, as widely theorized. Here, we assessed whether the use of naturally produced, variable speech sounds produced by multiple talkers benefits or impairs statistical learning, including the ability to generalize patterns to a novel talker. During training, participants listened to approximately 12 minutes of continuous speech made up of repeating trisyllabic words, spoken either by a single talker (single-talker condition) or four talkers speaking for three minutes each (multiple-talker condition). Post-training, all participants completed three assessments of learning: (1) an explicit familiarity rating task, (2) an explicit forced-choice recognition task, and (3) an implicit syllable target detection task. Results indicated that participants in both training conditions showed evidence of statistical learning across all assessments, providing an important demonstration that statistical learning is robust to additional variability in the speech signal. Further, in both the forced-choice recognition task and target detection task, participants in the multiple-talker condition showed evidence of facilitated statistical learning, particularly when listening to a novel talker. In the familiarity rating task, performance was comparable between conditions; however, participants trained with multiple talkers were less likely to conflate word familiarity with talker voice familiarity. Overall, these results suggest that training with multiple talkers can improve aspects of statistical learning across multiple measures of learning.
Title: Talker variability facilitates the statistical learning of speech sounds
Description:
Natural speech contains many sources of acoustic variability both within and between talkers, which challenges speech recognition in some contexts but may facilitate language understanding in novel listening situations.
Despite this ubiquitous variability, most previous studies that have examined the ability to extract sound patterns in speech—known as statistical learning—have used highly controlled, artificial, monotonic streams of syllables.
Thus, it is unknown whether variability in speech may help or hinder statistical learning – an important question to resolve if statistical learning does indeed play a role in the segmentation of naturally spoken language, as widely theorized.
Here, we assessed whether the use of naturally produced, variable speech sounds produced by multiple talkers benefits or impairs statistical learning, including the ability to generalize patterns to a novel talker.
During training, participants listened to approximately 12 minutes of continuous speech made up of repeating trisyllabic words, spoken either by a single talker (single-talker condition) or four talkers speaking for three minutes each (multiple-talker condition).
Post-training, all participants completed three assessments of learning: (1) an explicit familiarity rating task, (2) an explicit forced-choice recognition task, and (3) an implicit syllable target detection task.
Results indicated that participants in both training conditions showed evidence of statistical learning across all assessments, providing an important demonstration that statistical learning is robust to additional variability in the speech signal.
Further, in both the forced-choice recognition task and target detection task, participants in the multiple-talker condition showed evidence of facilitated statistical learning, particularly when listening to a novel talker.
In the familiarity rating task, performance was comparable between conditions; however, participants trained with multiple talkers were less likely to conflate word familiarity with talker voice familiarity.
Overall, these results suggest that training with multiple talkers can improve aspects of statistical learning across multiple measures of learning.

Related Results

Xie, Liu, & Jaeger (2020). Cross-talker generalization during foreign-accented speech perception
Xie, Liu, & Jaeger (2020). Cross-talker generalization during foreign-accented speech perception
Speech perception depends on the ability to generalize previously experienced input effectively across talkers. How such cross-talker generalization is achieved has remained an ope...
Recognizing voices through a cochlear implant: A systematic review
Recognizing voices through a cochlear implant: A systematic review
Objective: Some cochlear implant (CI) users report having difficulty accessing indexical information in the speech signal, presumably due to the transformation from acoustic to ele...
Effects of Spatial Speech Presentation on Listener Response Strategy for Talker-Identification
Effects of Spatial Speech Presentation on Listener Response Strategy for Talker-Identification
This study investigates effects of spatial auditory cues on human listeners' response strategy for identifying two alternately active talkers (“turn-taking” listening scenario). Pr...
Neural Speech-Tracking During Selective Attention: A Spatially Realistic Audiovisual Study
Neural Speech-Tracking During Selective Attention: A Spatially Realistic Audiovisual Study
Abstract Paying attention to a target talker in multi-talker scenarios is associated with its more accurate neural-tracking relative to competing non-target speech....
Effects of talker uncertainty I: Auditory word recognition
Effects of talker uncertainty I: Auditory word recognition
The production and resulting acoustic composition of spoken words vary as functions of individual talker characteristics. However, the effects of talker differences on auditory wor...
Why are listeners hindered by talker variability?
Why are listeners hindered by talker variability?
AbstractThough listeners readily recognize speech from a variety of talkers, accommodating talker variability comes at a cost: Myriad studies have shown that listeners are slower t...
Talker and accent familiarity yield advantages for voice identity perception: a voice sorting study
Talker and accent familiarity yield advantages for voice identity perception: a voice sorting study
Familiarity benefits in voice identity perception have been frequently described in the literature. Typically, studies have contrasted listeners who were either familiar or unfamil...
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...

Back to Top