Real-time decoding of question-and-answer speech dialogue using human cortical activity

被引:144
|
作者
Moses, David A. [1 ,2 ]
Leonard, Matthew K. [1 ,2 ]
Makin, Joseph G. [1 ,2 ]
Chang, Edward F. [1 ,2 ]
机构
[1] UC San Francisco, Dept Neurol Surg, 675 Nelson Rising Lane, San Francisco, CA 94158 USA
[2] UC San Francisco, Ctr Integrat Neurosci, 675 Nelson Rising Lane, San Francisco, CA 94158 USA
关键词
HUMAN SENSORIMOTOR CORTEX; BRAIN-COMPUTER INTERFACE; ERROR;
D O I
10.1038/s41467-019-10994-4
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Natural communication often occurs in dialogue, differentially engaging auditory and sensorimotor brain regions during listening and speaking. However, previous attempts to decode speech directly from the human brain typically consider listening or speaking tasks in isolation. Here, human participants listened to questions and responded aloud with answers while we used high-density electrocorticography (ECoG) recordings to detect when they heard or said an utterance and to then decode the utterance's identity. Because certain answers were only plausible responses to certain questions, we could dynamically update the prior probabilities of each answer using the decoded question likelihoods as context. We decode produced and perceived utterances with accuracy rates as high as 61% and 76%, respectively (chance is 7% and 20%). Contextual integration of decoded question likelihoods significantly improves answer decoding. These results demonstrate real-time decoding of speech in an interactive, conversational setting, which has important implications for patients who are unable to communicate.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] High-Performance Real-Time Human Activity Recognition Using Machine Learning
    Thottempudi, Pardhu
    Acharya, Biswaranjan
    Moreira, Fernando
    MATHEMATICS, 2024, 12 (22)
  • [42] Characterizing Covert Articulation in Apraxic Speech Using Real-time MRI
    Hagedorn, Christina
    Proctor, Michael
    Goldstein, Louis
    Tempini, Maria Luisa Gorno
    Narayanan, Shrikanth S.
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1050 - 1053
  • [43] Real-time implementation of speech recognition using RISC processor core
    Chang, CT
    Chang, CT
    Yang, HL
    Chang, HT
    NINTH ANNUAL IEEE INTERNATIONAL ASIC CONFERENCE AND EXHIBIT, PROCEEDINGS, 1996, : 231 - 234
  • [44] Real-time binaural target speech extraction using phase unwrapping
    Saito E.
    Kawamura A.
    IEEJ Transactions on Electronics, Information and Systems, 2021, 141 (10) : 1077 - 1086
  • [45] Learning patterns of activity using real-time tracking
    Stauffer, C
    Grimson, WEL
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (08) : 747 - 757
  • [46] Real-time recognition of activity using temporal templates
    Bobick, A
    Davis, J
    THIRD IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION - WACV '96, PROCEEDINGS, 1996, : 39 - 42
  • [47] REAL-TIME SPEECH ENHANCEMENT SYSTEM USING ENVELOPE EXPANSION TECHNIQUE
    CLARKSON, PM
    BAHGAT, S
    ELECTRONICS LETTERS, 1989, 25 (17) : 1186 - 1188
  • [48] Real-time Gesture Animation Generation from Speech for Virtual Human Interaction
    Rebol, Manuel
    Guetl, Christian
    Pietroszek, Krzysztof
    EXTENDED ABSTRACTS OF THE 2021 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'21), 2021,
  • [49] INFLUENCE OF HUMAN-FACTORS ON PERFORMANCE OF A REAL-TIME SPEECH RECOGNITION SYSTEM
    CARPENTER, BE
    LAVINGTON, SH
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1973, 53 (01): : 42 - 45
  • [50] A Real-Time Human Imitation System Using Kinect
    Ou, Yongsheng
    Hu, Jianbing
    Wang, Zhiyang
    Fu, Yiqun
    Wu, Xinyu
    Li, Xiaoyun
    INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2015, 7 (05) : 587 - 600