Language
English
Publication Date
10-15-2025
Journal
The Journal of Neuroscience
DOI
10.1523/JNEUROSCI.1037-25.2025
PMID
40930977
PMCID
PMC12528836
PubMedCentral® Posted Date
9-10-2025
PubMedCentral® Full Text Version
Post-print
Abstract
Human speech perception is multisensory, integrating auditory information from the talker's voice with visual information from the talker's face. BOLD fMRI studies have implicated the superior temporal gyrus (STG) in processing auditory speech and the superior temporal sulcus (STS) in integrating auditory and visual speech, but as an indirect hemodynamic measure, fMRI is limited in its ability to track the rapid neural computations underlying speech perception. Using stereoelectroencephalography (sEEG) electrodes, we directly recorded from the STG and STS in 42 epilepsy patients (25F, 17M). Participants identified single words presented in auditory, visual, and audiovisual formats with and without added auditory noise. Seeing the talker's face provided a strong perceptual benefit, improving perception of noisy speech in every participant. Neurally, a subpopulation of electrodes concentrated in mid-posterior STG and STS responded to both auditory speech (latency 71 ms) and visual speech (109 ms). Significant multisensory enhancement was observed, especially in the upper bank of the STS: compared with auditory-only speech, the response latency for audiovisual speech was 40% faster and the response amplitude was 18% larger. In contrast, STG showed neither faster nor larger multisensory responses. Surprisingly, STS response latencies for audiovisual speech were significantly faster than those in the STG (50 ms vs 64 ms), suggesting a parallel pathway model in which the STG plays the primary role in auditory-only speech perception, while the STS takes the lead in audiovisual speech perception. Together with fMRI, sEEG provides converging evidence that STS plays a key role in multisensory integration.
Keywords
Humans, Male, Female, Speech Perception, Temporal Lobe, Adult, Visual Perception, Electroencephalography, Young Adult, Photic Stimulation, Acoustic Stimulation, Magnetic Resonance Imaging, Stereotaxic Techniques, Adolescent, Middle Aged, Auditory Perception, Brain Mapping, audiovisual, multisensory, sEEG, temporal cortex
Published Open-Access
yes
Recommended Citation
Zhang, Yue; Magnotti, John F; Zhang, Xiang; et al., "Stereoelectroencephalography Reveals Neural Signatures of Multisensory Integration in the Human Superior Temporal Sulcus during Audiovisual Speech Perception" (2025). Faculty and Staff Publications. 4905.
https://digitalcommons.library.tmc.edu/baylor_docs/4905
Included in
Health Services Research Commons, Medical Cell Biology Commons, Medical Molecular Biology Commons, Medical Specialties Commons, Microbiology Commons