Javascript must be enabled to continue!
Audiovisual Speech Perception and the McGurk Effect
View through CrossRef
Research on visual and audiovisual speech information has profoundly influenced the fields of psycholinguistics, perception psychology, and cognitive neuroscience. Visual speech findings have provided some of most the important human demonstrations of our new conception of the perceptual brain as being supremely multimodal. This “multisensory revolution” has seen a tremendous growth in research on how the senses integrate, cross-facilitate, and share their experience with one another.
The ubiquity and apparent automaticity of multisensory speech has led many theorists to propose that the speech brain is agnostic with regard to sense modality: it might not know or care from which modality speech information comes. Instead, the speech function may act to extract supramodal informational patterns that are common in form across energy streams. Alternatively, other theorists have argued that any common information existent across the modalities is minimal and rudimentary, so that multisensory perception largely depends on the observer’s associative experience between the streams. From this perspective, the auditory stream is typically considered primary for the speech brain, with visual speech simply appended to its processing. If the utility of multisensory speech is a consequence of a supramodal informational coherence, then cross-sensory “integration” may be primarily a consequence of the informational input itself. If true, then one would expect to see evidence for integration occurring early in the perceptual process, as well in a largely complete and automatic/impenetrable manner. Alternatively, if multisensory speech perception is based on associative experience between the modal streams, then no constraints on how completely or automatically the senses integrate are dictated. There is behavioral and neurophysiological research supporting both perspectives.
Much of this research is based on testing the well-known McGurk effect, in which audiovisual speech information is thought to integrate to the extent that visual information can affect what listeners report hearing. However, there is now good reason to believe that the McGurk effect is not a valid test of multisensory integration. For example, there are clear cases in which responses indicate that the effect fails, while other measures suggest that integration is actually occurring. By mistakenly conflating the McGurk effect with speech integration itself, interpretations of the completeness and automaticity of multisensory may be incorrect. Future research should use more sensitive behavioral and neurophysiological measures of cross-modal influence to examine these issues.
Title: Audiovisual Speech Perception and the McGurk Effect
Description:
Research on visual and audiovisual speech information has profoundly influenced the fields of psycholinguistics, perception psychology, and cognitive neuroscience.
Visual speech findings have provided some of most the important human demonstrations of our new conception of the perceptual brain as being supremely multimodal.
This “multisensory revolution” has seen a tremendous growth in research on how the senses integrate, cross-facilitate, and share their experience with one another.
The ubiquity and apparent automaticity of multisensory speech has led many theorists to propose that the speech brain is agnostic with regard to sense modality: it might not know or care from which modality speech information comes.
Instead, the speech function may act to extract supramodal informational patterns that are common in form across energy streams.
Alternatively, other theorists have argued that any common information existent across the modalities is minimal and rudimentary, so that multisensory perception largely depends on the observer’s associative experience between the streams.
From this perspective, the auditory stream is typically considered primary for the speech brain, with visual speech simply appended to its processing.
If the utility of multisensory speech is a consequence of a supramodal informational coherence, then cross-sensory “integration” may be primarily a consequence of the informational input itself.
If true, then one would expect to see evidence for integration occurring early in the perceptual process, as well in a largely complete and automatic/impenetrable manner.
Alternatively, if multisensory speech perception is based on associative experience between the modal streams, then no constraints on how completely or automatically the senses integrate are dictated.
There is behavioral and neurophysiological research supporting both perspectives.
Much of this research is based on testing the well-known McGurk effect, in which audiovisual speech information is thought to integrate to the extent that visual information can affect what listeners report hearing.
However, there is now good reason to believe that the McGurk effect is not a valid test of multisensory integration.
For example, there are clear cases in which responses indicate that the effect fails, while other measures suggest that integration is actually occurring.
By mistakenly conflating the McGurk effect with speech integration itself, interpretations of the completeness and automaticity of multisensory may be incorrect.
Future research should use more sensitive behavioral and neurophysiological measures of cross-modal influence to examine these issues.
Related Results
Audiovisual Speech Perception in Aging Cochlear Implant Users and Age-Matched Non-Implanted Adults
Audiovisual Speech Perception in Aging Cochlear Implant Users and Age-Matched Non-Implanted Adults
Objectives. Older typical-hearing adults without a cochlear-implant (CI) have been found to exhibit greater multisensory benefits when identifying audiovisual speech than younger n...
Les illusions McGurk dans la parole : 25 ans de recherches
Les illusions McGurk dans la parole : 25 ans de recherches
Summary : The McGurk illusions in speech: 25years of research.
When presented with an auditory /b/ dubbed onto a visual /g/, listeners perceive sometimes a fused phoneme like ...
McGurk doesn’t work: Individual differences and task demands explain the McGurk illusion
McGurk doesn’t work: Individual differences and task demands explain the McGurk illusion
Visual speech cues play an important role in speech recognition, and the McGurk effect is a classic demonstration of this. In the original McGurk and MacDonald (1976) experiment, 9...
Artivismo, activismo y sin autoría audiovisual: el caso del colectivo Cine sin Autor (CsA) / Artivism, activism and audiovisual authorship: the case of Cine sin Autor (CsA)
Artivismo, activismo y sin autoría audiovisual: el caso del colectivo Cine sin Autor (CsA) / Artivism, activism and audiovisual authorship: the case of Cine sin Autor (CsA)
Resumen: El cine sin autor es un proceso de producción audiovisual que problematiza algunas de las ideas canónicas o fijas de la traslación de la autoridad desde un individuo hasta...
The McGurk Effect Across Languages
The McGurk Effect Across Languages
The McGurk effect denotes a phenomenon of speech perception where a listener attends to mismatched audio and visual stimuli and perceives an illusory third sound, typically a confl...
The Neural Mechanisms of Private Speech in Second Language Learners’ Oral Production: An fNIRS Study
The Neural Mechanisms of Private Speech in Second Language Learners’ Oral Production: An fNIRS Study
Background: According to Vygotsky’s sociocultural theory, private speech functions both as a tool for thought regulation and as a transitional form between outer and inner speech. ...
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...

