Overt speech decoding from cortical activity: a comparison of different linear methods

被引:1
|
作者
Le Godais, Gael [1 ]
Roussel, Philemon [1 ]
Bocquelet, Florent [1 ]
Aubert, Marc [1 ]
Kahane, Philippe [1 ,2 ]
Chabardes, Stephan [1 ,3 ]
Yvert, Blaise [1 ]
机构
[1] Univ Grenoble Alpes, Grenoble Inst Neurosci, INSERM, U1216, Grenoble, France
[2] CHU Grenoble Alpes, Dept Neurol, Grenoble, France
[3] Univ Grenoble Alpes, CHU Grenoble Alpes, Clinatec, Grenoble, France
来源
基金
欧盟地平线“2020”;
关键词
decoding; ECoG; brain-computer interface; linear methods; speech prostheses; intracranial recordings; articulatory synthesis; HUMAN SENSORIMOTOR CORTEX; ORGANIZATION;
D O I
10.3389/fnhum.2023.1124065
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
IntroductionSpeech BCIs aim at reconstructing speech in real time from ongoing cortical activity. Ideal BCIs would need to reconstruct speech audio signal frame by frame on a millisecond-timescale. Such approaches require fast computation. In this respect, linear decoder are good candidates and have been widely used in motor BCIs. Yet, they have been very seldomly studied for speech reconstruction, and never for reconstruction of articulatory movements from intracranial activity. Here, we compared vanilla linear regression, ridge-regularized linear regressions, and partial least squares regressions for offline decoding of overt speech from cortical activity. MethodsTwo decoding paradigms were investigated: (1) direct decoding of acoustic vocoder features of speech, and (2) indirect decoding of vocoder features through an intermediate articulatory representation chained with a real-time-compatible DNN-based articulatory-to-acoustic synthesizer. Participant's articulatory trajectories were estimated from an electromagnetic-articulography dataset using dynamic time warping. The accuracy of the decoders was evaluated by computing correlations between original and reconstructed features. ResultsWe found that similar performance was achieved by all linear methods well above chance levels, albeit without reaching intelligibility. Direct and indirect methods achieved comparable performance, with an advantage for direct decoding. DiscussionFuture work will address the development of an improved neural speech decoder compatible with fast frame-by-frame speech reconstruction from ongoing activity at a millisecond timescale.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Decoding grasp aperture from motor-cortical population activity
    Artemiadis, Panagiotis K.
    Shakhnarovich, Gregory
    Vargas-Irwin, Carlos
    Donoghue, John P.
    Black, Michael J.
    2007 3RD INTERNATIONAL IEEE/EMBS CONFERENCE ON NEURAL ENGINEERING, VOLS 1 AND 2, 2007, : 518 - +
  • [22] Real-time decoding of question-and-answer speech dialogue using human cortical activity
    Moses, David A.
    Leonard, Matthew K.
    Makin, Joseph G.
    Chang, Edward F.
    NATURE COMMUNICATIONS, 2019, 10 (1)
  • [23] Real-time decoding of question-and-answer speech dialogue using human cortical activity
    David A. Moses
    Matthew K. Leonard
    Joseph G. Makin
    Edward F. Chang
    Nature Communications, 10
  • [24] A Comparison of Linear and Nonlinear Dimensionality Reduction Methods Applied to Synthetic Speech
    Errity, Andrew
    McKenna, John
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1079 - 1082
  • [25] Decoding of Ankle Flexion and Extension from Cortical Current Sources Estimated from Non-invasive Brain Activity Recording Methods
    Tobar, Alejandra Mejia
    Hyoudou, Rikiya
    Kita, Kahori
    Nakamura, Tatsuhiro
    Kambara, Hiroyuki
    Ogata, Yousuke
    Hanakawa, Takashi
    Koike, Yasuharu
    Yoshimura, Natsue
    FRONTIERS IN NEUROSCIENCE, 2018, 11
  • [26] Comparison of Diverse Decoding Methods from Conditional Language Models
    Ippolito, Daphne
    Kriz, Reno
    Kustikova, Maria
    Sedoc, Joao
    Callison-Burch, Chris
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3752 - 3762
  • [27] Comparison of multilevel coded modulation with different decoding methods over AWGN channels
    Yuan, DF
    Yao, Q
    Wang, CX
    Cao, ZG
    PIMRC 2000: 11TH IEEE INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2000, : 567 - 571
  • [28] Linear versus deep learning methods for noisy speech separation for EEG-informed attention decoding
    Das, Neetha
    Zegers, Jeroen
    Van Hamme, Hugo
    Francart, Tom
    Bertrand, Alexander
    JOURNAL OF NEURAL ENGINEERING, 2020, 17 (04)
  • [29] Comparison of different methods of linear dsDNA plasmids isolation from Debaryomyces hansenii yeasts
    Polomska, Xymena
    Kierul, Malgorzata
    Dabrowska, Anna
    Szoltysik, Marek
    Zarowska, Barbara
    YEAST, 2013, 30 : 216 - 216
  • [30] Comparison of Different Extraction Methods and Antioxidant Activity of Anthocyanins from Eggplant Peel
    Zheng S.
    Deng Z.
    Jiang H.
    Li H.
    Journal of Chinese Institute of Food Science and Technology, 2017, 17 (01) : 92 - 99