Neural Measures of Pitch Processing in EEG Responses to Running Speech

被引:10
|
作者
Bachmann, Florine L. [1 ]
MacDonald, Ewen N. [2 ]
Hjortkjaer, Jens [1 ,3 ]
机构
[1] Tech Univ Denmark, Dept Hlth Technol, Hearing Syst Sect, Lyngby, Denmark
[2] Univ Waterloo, Dept Syst Design Engn, Waterloo, ON, Canada
[3] Copenhagen Univ Hosp Amager & Hvidovre, Danish Res Ctr Magnet Resonance, Ctr Funct & Diagnost Imaging & Res, Copenhagen, Denmark
关键词
neural tracking; subcortical; running speech; auditory brainstem response; temporal response function; encoding model; EEG; AUDITORY BRAIN-STEM; FREQUENCY-FOLLOWING RESPONSES; FUNDAMENTAL-FREQUENCY; HEARING-LOSS; REPRESENTATION; PERCEPTION; TRACKING; AGE;
D O I
10.3389/fnins.2021.738408
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Linearized encoding models are increasingly employed to model cortical responses to running speech. Recent extensions to subcortical responses suggest clinical perspectives, potentially complementing auditory brainstem responses (ABRs) or frequency-following responses (FFRs) that are current clinical standards. However, while it is well-known that the auditory brainstem responds both to transient amplitude variations and the stimulus periodicity that gives rise to pitch, these features co-vary in running speech. Here, we discuss challenges in disentangling the features that drive the subcortical response to running speech. Cortical and subcortical electroencephalographic (EEG) responses to running speech from 19 normal-hearing listeners (12 female) were analyzed. Using forward regression models, we confirm that responses to the rectified broadband speech signal yield temporal response functions consistent with wave V of the ABR, as shown in previous work. Peak latency and amplitude of the speech-evoked brainstem response were correlated with standard click-evoked ABRs recorded at the vertex electrode (Cz). Similar responses could be obtained using the fundamental frequency (F0) of the speech signal as model predictor. However, simulations indicated that dissociating responses to temporal fine structure at the F0 from broadband amplitude variations is not possible given the high co-variance of the features and the poor signal-to-noise ratio (SNR) of subcortical EEG responses. In cortex, both simulations and data replicated previous findings indicating that envelope tracking on frontal electrodes can be dissociated from responses to slow variations in F0 (relative pitch). Yet, no association between subcortical F0-tracking and cortical responses to relative pitch could be detected. These results indicate that while subcortical speech responses are comparable to click-evoked ABRs, dissociating pitch-related processing in the auditory brainstem may be challenging with natural speech stimuli.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Piano training enhances the neural processing of pitch and improves speech perception in Mandarin-speaking children
    Nan, Yun
    Liu, Li
    Geiser, Eveline
    Shu, Hua
    Gong, Chen Chen
    Dong, Qi
    Gabrieli, John D. E.
    Desimone, Robert
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2018, 115 (28) : E6630 - E6639
  • [22] The neural basis of sublexical speech and corresponding nonspeech processing: A combined EEG-MEG study
    Kuuluvainen, Soila
    Nevalainen, Paivi
    Sorokin, Alexander
    Mittag, Maria
    Partanen, Eino
    Putkinen, Vesa
    Seppanen, Miia
    Kahkonen, Seppo
    Kujala, Teija
    BRAIN AND LANGUAGE, 2014, 130 : 19 - 32
  • [23] Pitch detection and formant analysis of Arabic speech processing
    Cherif, A
    Bouafif, L
    Dabbabi, T
    APPLIED ACOUSTICS, 2001, 62 (10) : 1129 - 1140
  • [24] Pitch Processing of Speech: Comparison of Psychoacoustic and Electrophysiological Data
    Wrzosek, Malgorzata
    Maculewicz, Justyna
    Hafke-Dys, Honorata
    Nowik, Agnieszka
    Preis, Anna
    Kroliczak, Grzegorz
    ARCHIVES OF ACOUSTICS, 2013, 38 (03) : 375 - 381
  • [25] Optimal pitch bases expansions in speech signal processing
    Nickel, RM
    Oswal, SP
    CONFERENCE RECORD OF THE THIRTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 2003, : 1885 - 1889
  • [26] Speech transmission index from running speech: A neural network approach
    Li, FF
    Cox, TJ
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2003, 113 (04): : 1999 - 2008
  • [27] Information Geometry Theoretic Measures for Characterizing Neural Information Processing from Simulated EEG Signals
    Hua, Jia-Chen
    Kim, Eun-jin
    He, Fei
    ENTROPY, 2024, 26 (03)
  • [28] Neural specializations for speech and pitch: moving beyond the dichotomies
    Zatorre, Robert J.
    Gandour, Jackson T.
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2008, 363 (1493) : 1087 - 1104
  • [29] The neural bases underlying pitch processing difficulties
    Foxton, Jessica M.
    Weisz, Nathan
    Bauchet-Lecaignard, Francoise
    Delpuech, Claude
    Bertrand, Olivier
    NEUROIMAGE, 2009, 45 (04) : 1305 - 1313
  • [30] Decoding Envelope and Frequency-Following EEG Responses to Continuous Speech Using Deep Neural Networks
    Thornton, Mike D.
    Mandic, Danilo P.
    Reichenbach, Tobias J.
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2024, 5 : 700 - 716