Real-time decoding of question-and-answer speech dialogue using human cortical activity

被引:144
|
作者
Moses, David A. [1 ,2 ]
Leonard, Matthew K. [1 ,2 ]
Makin, Joseph G. [1 ,2 ]
Chang, Edward F. [1 ,2 ]
机构
[1] UC San Francisco, Dept Neurol Surg, 675 Nelson Rising Lane, San Francisco, CA 94158 USA
[2] UC San Francisco, Ctr Integrat Neurosci, 675 Nelson Rising Lane, San Francisco, CA 94158 USA
关键词
HUMAN SENSORIMOTOR CORTEX; BRAIN-COMPUTER INTERFACE; ERROR;
D O I
10.1038/s41467-019-10994-4
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Natural communication often occurs in dialogue, differentially engaging auditory and sensorimotor brain regions during listening and speaking. However, previous attempts to decode speech directly from the human brain typically consider listening or speaking tasks in isolation. Here, human participants listened to questions and responded aloud with answers while we used high-density electrocorticography (ECoG) recordings to detect when they heard or said an utterance and to then decode the utterance's identity. Because certain answers were only plausible responses to certain questions, we could dynamically update the prior probabilities of each answer using the decoded question likelihoods as context. We decode produced and perceived utterances with accuracy rates as high as 61% and 76%, respectively (chance is 7% and 20%). Contextual integration of decoded question likelihoods significantly improves answer decoding. These results demonstrate real-time decoding of speech in an interactive, conversational setting, which has important implications for patients who are unable to communicate.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Real-time optical imaging of cortical interictal activity in primates and humans
    Haglund, MM
    Hochman, DW
    EPILEPSIA, 2005, 46 : 270 - 271
  • [22] Real-time analysis of preprotachykinin promoter activity in single cortical neurons
    Walker, PD
    Andrade, R
    Quinn, JP
    Bannon, MJ
    JOURNAL OF NEUROCHEMISTRY, 2000, 75 (02) : 882 - 885
  • [23] Using Speech Recognition for Real-Time Captioning of Multiple Speakers
    Wald, Mike
    Bain, Keith
    IEEE MULTIMEDIA, 2008, 15 (04) : 56 - 57
  • [24] SENSORIMOTOR ADAPTATION OF SPEECH USING REAL-TIME ARTICULATORY RESYNTHESIS
    Berry, Jeff
    North, Cassandra
    Johnson, Michael T.
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [25] Real-time imaging of cortical and subcortical control of muscle sympathetic nerve activity in awake human subjects
    James, Cheree
    Macefield, Vaughan G.
    Henderson, Luke A.
    NEUROIMAGE, 2013, 70 : 59 - 65
  • [26] Real-time processing of speech signals using networked computers
    Pendse, R
    Yip, AW
    Hoyer, EA
    40TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1 AND 2, 1998, : 802 - 805
  • [27] Real-time evaluation of human secretin receptor activity using cytosensor microphysiometry
    Ng, SSM
    Pang, RTK
    Chow, BKC
    Cheng, CHK
    JOURNAL OF CELLULAR BIOCHEMISTRY, 1999, 72 (04) : 517 - 527
  • [28] An Efficient Technique for Real-Time Human Activity Classification Using Accelerometer Data
    Biagetti, Giorgio
    Crippa, Paolo
    Falaschetti, Laura
    Orcioni, Simone
    Turchetti, Claudio
    INTELLIGENT DECISION TECHNOLOGIES 2016, PT I, 2016, 56 : 425 - 434
  • [29] Real-Time Multi-Modal Human-Robot Collaboration Using Gestures and Speech
    Chen, Haodong
    Leu, Ming C.
    Yin, Zhaozheng
    JOURNAL OF MANUFACTURING SCIENCE AND ENGINEERING-TRANSACTIONS OF THE ASME, 2022, 144 (10):
  • [30] Decoding Of Articulatory Gestures During Word Production Using Speech Motor And Premotor Cortical Activity
    Mugler, Emily M.
    Goldrick, Matthew
    Rosenow, Joshua M.
    Tate, Matthew C.
    Slutzky, Marc W.
    2015 37TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2015, : 5339 - 5342