Real-time decoding of question-and-answer speech dialogue using human cortical activity

被引:144
|
作者
Moses, David A. [1 ,2 ]
Leonard, Matthew K. [1 ,2 ]
Makin, Joseph G. [1 ,2 ]
Chang, Edward F. [1 ,2 ]
机构
[1] UC San Francisco, Dept Neurol Surg, 675 Nelson Rising Lane, San Francisco, CA 94158 USA
[2] UC San Francisco, Ctr Integrat Neurosci, 675 Nelson Rising Lane, San Francisco, CA 94158 USA
关键词
HUMAN SENSORIMOTOR CORTEX; BRAIN-COMPUTER INTERFACE; ERROR;
D O I
10.1038/s41467-019-10994-4
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Natural communication often occurs in dialogue, differentially engaging auditory and sensorimotor brain regions during listening and speaking. However, previous attempts to decode speech directly from the human brain typically consider listening or speaking tasks in isolation. Here, human participants listened to questions and responded aloud with answers while we used high-density electrocorticography (ECoG) recordings to detect when they heard or said an utterance and to then decode the utterance's identity. Because certain answers were only plausible responses to certain questions, we could dynamically update the prior probabilities of each answer using the decoded question likelihoods as context. We decode produced and perceived utterances with accuracy rates as high as 61% and 76%, respectively (chance is 7% and 20%). Contextual integration of decoded question likelihoods significantly improves answer decoding. These results demonstrate real-time decoding of speech in an interactive, conversational setting, which has important implications for patients who are unable to communicate.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Real-time decoding of question-and-answer speech dialogue using human cortical activity
    David A. Moses
    Matthew K. Leonard
    Joseph G. Makin
    Edward F. Chang
    Nature Communications, 10
  • [2] mimir: A Market-Based Real-Time Question and Answer Service
    Hsieh, Gary
    Counts, Scott
    CHI2009: PROCEEDINGS OF THE 27TH ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, VOLS 1-4, 2009, : 769 - 778
  • [3] MultiQT: Multimodal Learning for Real-Time Question Tracking in Speech
    Havtorn, Jakob D.
    Latko, Jan
    Edin, Joakim
    Borgholt, Lasse
    Maaloe, Lars
    Belgrano, Lorenzo
    Jacobsen, Nicolai F.
    Sdun, Regitze
    Agic, Zeljko
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 2370 - 2380
  • [4] Neural speech recognition: continuous phoneme decoding using spatiotemporal representations of human cortical activity
    Moses, David A.
    Mesgarani, Nima
    Leonard, Matthew K.
    Chang, Edward F.
    JOURNAL OF NEURAL ENGINEERING, 2016, 13 (05)
  • [5] Integration of Speech and Text Processing Modules into a Real-Time Dialogue System
    Ptacek, Jan
    Ircing, Pavel
    Spousta, Miroslav
    Romportl, Jan
    Loose, Zdenek
    Cinkova, Silvie
    Relano Gil, Jose
    Santos, Raul
    TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 552 - +
  • [6] Real-time imaging of human cortical activity evoked by painful esophageal stimulation
    Hobson, AR
    Furlong, PL
    Worthen, SF
    Hillebrand, A
    Barnes, GR
    Singh, KD
    Aziz, Q
    GASTROENTEROLOGY, 2005, 128 (03) : 610 - 619
  • [7] Real-time classification of auditory sentences using evoked cortical activity in humans
    Moses, David A.
    Leonard, Matthew K.
    Chang, Edward F.
    JOURNAL OF NEURAL ENGINEERING, 2018, 15 (03)
  • [8] Real-time decoding of nonstationary neural activity in motor cortex
    Wu, Wei
    Hatsopoulos, Nicholas G.
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2008, 16 (03) : 213 - 222
  • [9] REAL-TIME SPEECH CODING AND DECODING FOR GSM SYSTEM AND ITS IMPLEMENT IN VC
    Wan, Guojin
    Xu, Qingyi
    Xiao, Jing
    Lu, Sheng
    PROCEEDINGS OF 2011 INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY AND APPLICATION, ICCTA2011, 2011, : 848 - 852
  • [10] Real-time Human Activity Recognition
    Albukhary, N.
    Mustafah, Y. M.
    6TH INTERNATIONAL CONFERENCE ON MECHATRONICS (ICOM'17), 2017, 260