Overt speech decoding from cortical activity: a comparison of different linear methods

被引:1
|
作者
Le Godais, Gael [1 ]
Roussel, Philemon [1 ]
Bocquelet, Florent [1 ]
Aubert, Marc [1 ]
Kahane, Philippe [1 ,2 ]
Chabardes, Stephan [1 ,3 ]
Yvert, Blaise [1 ]
机构
[1] Univ Grenoble Alpes, Grenoble Inst Neurosci, INSERM, U1216, Grenoble, France
[2] CHU Grenoble Alpes, Dept Neurol, Grenoble, France
[3] Univ Grenoble Alpes, CHU Grenoble Alpes, Clinatec, Grenoble, France
来源
基金
欧盟地平线“2020”;
关键词
decoding; ECoG; brain-computer interface; linear methods; speech prostheses; intracranial recordings; articulatory synthesis; HUMAN SENSORIMOTOR CORTEX; ORGANIZATION;
D O I
10.3389/fnhum.2023.1124065
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
IntroductionSpeech BCIs aim at reconstructing speech in real time from ongoing cortical activity. Ideal BCIs would need to reconstruct speech audio signal frame by frame on a millisecond-timescale. Such approaches require fast computation. In this respect, linear decoder are good candidates and have been widely used in motor BCIs. Yet, they have been very seldomly studied for speech reconstruction, and never for reconstruction of articulatory movements from intracranial activity. Here, we compared vanilla linear regression, ridge-regularized linear regressions, and partial least squares regressions for offline decoding of overt speech from cortical activity. MethodsTwo decoding paradigms were investigated: (1) direct decoding of acoustic vocoder features of speech, and (2) indirect decoding of vocoder features through an intermediate articulatory representation chained with a real-time-compatible DNN-based articulatory-to-acoustic synthesizer. Participant's articulatory trajectories were estimated from an electromagnetic-articulography dataset using dynamic time warping. The accuracy of the decoders was evaluated by computing correlations between original and reconstructed features. ResultsWe found that similar performance was achieved by all linear methods well above chance levels, albeit without reaching intelligibility. Direct and indirect methods achieved comparable performance, with an advantage for direct decoding. DiscussionFuture work will address the development of an improved neural speech decoder compatible with fast frame-by-frame speech reconstruction from ongoing activity at a millisecond timescale.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Decoding Speech from Cortical Surface Electrical Activity
    Moses, David A.
    Liu, Patricia M.
    Chang, Edward F.
    NEW ENGLAND JOURNAL OF MEDICINE, 2021, 385 (16):
  • [2] A quantitative comparison of linear and non-linear models of motor cortical activity for the encoding and decoding of arm motions
    Gao, Y
    Black, MJ
    Bienenstock, E
    Wu, W
    Donoghue, JP
    1ST INTERNATIONAL IEEE EMBS CONFERENCE ON NEURAL ENGINEERING 2003, CONFERENCE PROCEEDINGS, 2003, : 189 - 192
  • [3] COMPARISON OF DIFFERENT SPEECH ENHANCEMENT METHODS ON RECOGNITION OF NOISY SPEECH
    AHMED, MS
    ALMARZOUG, AM
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 1994, 19 (01): : 45 - 56
  • [4] Decoding speech intent from non-frontal cortical areas
    Prakash, Prashanth Ravi
    Lei, Tianhao
    Flint, Robert D.
    Hsieh, Jason K.
    Fitzgerald, Zachary
    Mugler, Emily
    Templer, Jessica
    Goldrick, Matthew A.
    Tate, Matthew C.
    Rosenow, Joshua
    Glaser, Joshua
    Slutzky, Marc W.
    JOURNAL OF NEURAL ENGINEERING, 2025, 22 (01)
  • [5] Comparisons between Linear and Nonlinear Methods for Decoding Motor Cortical Activities of Monkey
    Xu, Kai
    Wang, Yueming
    Zhang, Shaomin
    Zhao, Ting
    Wang, Yiwen
    Chen, Weidong
    Zheng, Xiaoxiang
    2011 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2011, : 4207 - 4210
  • [6] Decoding of temporal intervals from cortical ensemble activity
    Lebedev, Mikhail A.
    O'Doherty, Joseph E.
    Nicolelis, Miguel A. L.
    JOURNAL OF NEUROPHYSIOLOGY, 2008, 99 (01) : 166 - 186
  • [7] A comparison of normals' and aphasics' ability to plan respiratory activity in overt and covert speech
    Hoole, P
    Ziegler, W
    SPEECH PRODUCTION: MOTOR CONTROL, BRAIN RESEARCH AND FLUENCY DISORDERS, 1997, 1146 : 205 - 211
  • [8] Comparison of different cortical current density methods for electric source reconstruction of distributed cortical activity: A simulation study
    Sick, C
    Huppertz, HJ
    Kristeva-Feige, R
    NEUROIMAGE, 2001, 13 (06) : S246 - S246
  • [9] Cortical linear encoding and decoding of sounds: Similarities and differences between naturalistic speech and music listening
    Simon, Adele
    Bech, Soren
    Loquet, Gerard
    Ostergaard, Jan
    EUROPEAN JOURNAL OF NEUROSCIENCE, 2024, 59 (08) : 2059 - 2074
  • [10] Decoding grasp and speech signals from the cortical grasp circuit in a tetraplegic human
    Wandelt, Sarah K.
    Kellis, Spencer
    Bjanes, David A.
    Pejsa, Kelsie
    Lee, Brian
    Liu, Charles
    Andersen, Richard A.
    NEURON, 2022, 110 (11) : 1777 - +