Language

English

Publication Date

10-15-2025

Journal

The Journal of Neuroscience

DOI

10.1523/JNEUROSCI.1037-25.2025

PMID

40930977

PMCID

PMC12528836

PubMedCentral® Posted Date

9-10-2025

PubMedCentral® Full Text Version

Post-print

Abstract

Human speech perception is multisensory, integrating auditory information from the talker's voice with visual information from the talker's face. BOLD fMRI studies have implicated the superior temporal gyrus (STG) in processing auditory speech and the superior temporal sulcus (STS) in integrating auditory and visual speech, but as an indirect hemodynamic measure, fMRI is limited in its ability to track the rapid neural computations underlying speech perception. Using stereoelectroencephalography (sEEG) electrodes, we directly recorded from the STG and STS in 42 epilepsy patients (25F, 17M). Participants identified single words presented in auditory, visual, and audiovisual formats with and without added auditory noise. Seeing the talker's face provided a strong perceptual benefit, improving perception of noisy speech in every participant. Neurally, a subpopulation of electrodes concentrated in mid-posterior STG and STS responded to both auditory speech (latency 71 ms) and visual speech (109 ms). Significant multisensory enhancement was observed, especially in the upper bank of the STS: compared with auditory-only speech, the response latency for audiovisual speech was 40% faster and the response amplitude was 18% larger. In contrast, STG showed neither faster nor larger multisensory responses. Surprisingly, STS response latencies for audiovisual speech were significantly faster than those in the STG (50 ms vs 64 ms), suggesting a parallel pathway model in which the STG plays the primary role in auditory-only speech perception, while the STS takes the lead in audiovisual speech perception. Together with fMRI, sEEG provides converging evidence that STS plays a key role in multisensory integration.

Keywords

Humans, Male, Female, Speech Perception, Temporal Lobe, Adult, Visual Perception, Electroencephalography, Young Adult, Photic Stimulation, Acoustic Stimulation, Magnetic Resonance Imaging, Stereotaxic Techniques, Adolescent, Middle Aged, Auditory Perception, Brain Mapping, audiovisual, multisensory, sEEG, temporal cortex

Published Open-Access

yes

Share

COinS
 
 

To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.