Overt speech decoding from cortical activity: a comparison of different linear methods

被引：1

作者：

Le Godais, Gael ^{[1
]}

Roussel, Philemon ^{[1
]}

Bocquelet, Florent ^{[1
]}

Aubert, Marc ^{[1
]}

Kahane, Philippe ^{[1
,2
]}

Chabardes, Stephan ^{[1
,3
]}

Yvert, Blaise ^{[1
]}

机构：

[1] Univ Grenoble Alpes, Grenoble Inst Neurosci, INSERM, U1216, Grenoble, France

[2] CHU Grenoble Alpes, Dept Neurol, Grenoble, France

[3] Univ Grenoble Alpes, CHU Grenoble Alpes, Clinatec, Grenoble, France

来源：

FRONTIERS IN HUMAN NEUROSCIENCE | 2023年 / 17卷

基金：

欧盟地平线“2020”;

关键词：

decoding; ECoG; brain-computer interface; linear methods; speech prostheses; intracranial recordings; articulatory synthesis; HUMAN SENSORIMOTOR CORTEX; ORGANIZATION;

D O I：

10.3389/fnhum.2023.1124065

中图分类号：

Q189 [神经科学];

学科分类号：

071006 ;

摘要：

IntroductionSpeech BCIs aim at reconstructing speech in real time from ongoing cortical activity. Ideal BCIs would need to reconstruct speech audio signal frame by frame on a millisecond-timescale. Such approaches require fast computation. In this respect, linear decoder are good candidates and have been widely used in motor BCIs. Yet, they have been very seldomly studied for speech reconstruction, and never for reconstruction of articulatory movements from intracranial activity. Here, we compared vanilla linear regression, ridge-regularized linear regressions, and partial least squares regressions for offline decoding of overt speech from cortical activity. MethodsTwo decoding paradigms were investigated: (1) direct decoding of acoustic vocoder features of speech, and (2) indirect decoding of vocoder features through an intermediate articulatory representation chained with a real-time-compatible DNN-based articulatory-to-acoustic synthesizer. Participant's articulatory trajectories were estimated from an electromagnetic-articulography dataset using dynamic time warping. The accuracy of the decoders was evaluated by computing correlations between original and reconstructed features. ResultsWe found that similar performance was achieved by all linear methods well above chance levels, albeit without reaching intelligibility. Direct and indirect methods achieved comparable performance, with an advantage for direct decoding. DiscussionFuture work will address the development of an improved neural speech decoder compatible with fast frame-by-frame speech reconstruction from ongoing activity at a millisecond timescale.

引用

页数：13

共 50 条

[21] Decoding grasp aperture from motor-cortical population activity
Artemiadis, Panagiotis K.
Shakhnarovich, Gregory
Vargas-Irwin, Carlos
Donoghue, John P.
Black, Michael J.
2007 3RD INTERNATIONAL IEEE/EMBS CONFERENCE ON NEURAL ENGINEERING, VOLS 1 AND 2, 2007, : 518 - +
[22] Real-time decoding of question-and-answer speech dialogue using human cortical activity
Moses, David A.
Leonard, Matthew K.
Makin, Joseph G.
Chang, Edward F.
NATURE COMMUNICATIONS, 2019, 10 (1)
[23] Real-time decoding of question-and-answer speech dialogue using human cortical activity
David A. Moses
Matthew K. Leonard
Joseph G. Makin
Edward F. Chang
Nature Communications, 10
[24] A Comparison of Linear and Nonlinear Dimensionality Reduction Methods Applied to Synthetic Speech
Errity, Andrew
McKenna, John
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1079 - 1082
[25] Decoding of Ankle Flexion and Extension from Cortical Current Sources Estimated from Non-invasive Brain Activity Recording Methods
Tobar, Alejandra Mejia
Hyoudou, Rikiya
Kita, Kahori
Nakamura, Tatsuhiro
Kambara, Hiroyuki
Ogata, Yousuke
Hanakawa, Takashi
Koike, Yasuharu
Yoshimura, Natsue
FRONTIERS IN NEUROSCIENCE, 2018, 11
[26] Comparison of Diverse Decoding Methods from Conditional Language Models
Ippolito, Daphne
Kriz, Reno
Kustikova, Maria
Sedoc, Joao
Callison-Burch, Chris
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3752 - 3762
[27] Comparison of multilevel coded modulation with different decoding methods over AWGN channels
Yuan, DF
Yao, Q
Wang, CX
Cao, ZG
PIMRC 2000: 11TH IEEE INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2000, : 567 - 571
[28] Linear versus deep learning methods for noisy speech separation for EEG-informed attention decoding
Das, Neetha
Zegers, Jeroen
Van Hamme, Hugo
Francart, Tom
Bertrand, Alexander
JOURNAL OF NEURAL ENGINEERING, 2020, 17 (04)
[29] Comparison of different methods of linear dsDNA plasmids isolation from Debaryomyces hansenii yeasts
Polomska, Xymena
Kierul, Malgorzata
Dabrowska, Anna
Szoltysik, Marek
Zarowska, Barbara
YEAST, 2013, 30 : 216 - 216
[30] Comparison of Different Extraction Methods and Antioxidant Activity of Anthocyanins from Eggplant Peel
Zheng S.
Deng Z.
Jiang H.
Li H.
Journal of Chinese Institute of Food Science and Technology, 2017, 17 (01) : 92 - 99

← 1 2 3 4 5 →