Real-time decoding of question-and-answer speech dialogue using human cortical activity

被引：144

作者：

Moses, David A. ^{[1
,2
]}

Leonard, Matthew K. ^{[1
,2
]}

Makin, Joseph G. ^{[1
,2
]}

Chang, Edward F. ^{[1
,2
]}

机构：

[1] UC San Francisco, Dept Neurol Surg, 675 Nelson Rising Lane, San Francisco, CA 94158 USA

[2] UC San Francisco, Ctr Integrat Neurosci, 675 Nelson Rising Lane, San Francisco, CA 94158 USA

来源：

NATURE COMMUNICATIONS | 2019年 / 10卷 / 1期

关键词：

HUMAN SENSORIMOTOR CORTEX; BRAIN-COMPUTER INTERFACE; ERROR;

D O I：

10.1038/s41467-019-10994-4

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Natural communication often occurs in dialogue, differentially engaging auditory and sensorimotor brain regions during listening and speaking. However, previous attempts to decode speech directly from the human brain typically consider listening or speaking tasks in isolation. Here, human participants listened to questions and responded aloud with answers while we used high-density electrocorticography (ECoG) recordings to detect when they heard or said an utterance and to then decode the utterance's identity. Because certain answers were only plausible responses to certain questions, we could dynamically update the prior probabilities of each answer using the decoded question likelihoods as context. We decode produced and perceived utterances with accuracy rates as high as 61% and 76%, respectively (chance is 7% and 20%). Contextual integration of decoded question likelihoods significantly improves answer decoding. These results demonstrate real-time decoding of speech in an interactive, conversational setting, which has important implications for patients who are unable to communicate.

引用

页数：14

共 50 条

[41] High-Performance Real-Time Human Activity Recognition Using Machine Learning
Thottempudi, Pardhu
Acharya, Biswaranjan
Moreira, Fernando
MATHEMATICS, 2024, 12 (22)
[42] Characterizing Covert Articulation in Apraxic Speech Using Real-time MRI
Hagedorn, Christina
Proctor, Michael
Goldstein, Louis
Tempini, Maria Luisa Gorno
Narayanan, Shrikanth S.
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1050 - 1053
[43] Real-time implementation of speech recognition using RISC processor core
Chang, CT
Chang, CT
Yang, HL
Chang, HT
NINTH ANNUAL IEEE INTERNATIONAL ASIC CONFERENCE AND EXHIBIT, PROCEEDINGS, 1996, : 231 - 234
[44] Real-time binaural target speech extraction using phase unwrapping
Saito E.
Kawamura A.
IEEJ Transactions on Electronics, Information and Systems, 2021, 141 (10) : 1077 - 1086
[45] Learning patterns of activity using real-time tracking
Stauffer, C
Grimson, WEL
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (08) : 747 - 757
[46] Real-time recognition of activity using temporal templates
Bobick, A
Davis, J
THIRD IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION - WACV '96, PROCEEDINGS, 1996, : 39 - 42
[47] REAL-TIME SPEECH ENHANCEMENT SYSTEM USING ENVELOPE EXPANSION TECHNIQUE
CLARKSON, PM
BAHGAT, S
ELECTRONICS LETTERS, 1989, 25 (17) : 1186 - 1188
[48] Real-time Gesture Animation Generation from Speech for Virtual Human Interaction
Rebol, Manuel
Guetl, Christian
Pietroszek, Krzysztof
EXTENDED ABSTRACTS OF THE 2021 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'21), 2021,
[49] INFLUENCE OF HUMAN-FACTORS ON PERFORMANCE OF A REAL-TIME SPEECH RECOGNITION SYSTEM
CARPENTER, BE
LAVINGTON, SH
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1973, 53 (01): : 42 - 45
[50] A Real-Time Human Imitation System Using Kinect
Ou, Yongsheng
Hu, Jianbing
Wang, Zhiyang
Fu, Yiqun
Wu, Xinyu
Li, Xiaoyun
INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2015, 7 (05) : 587 - 600

← 1 2 3 4 5 →