Towards an End-to-End Framework for Invasive Brain Signal Decoding with Large Language Models

被引:0
|
作者
Feng, Sheng [1 ]
Liu, Heyang [1 ]
Wang, Yu [1 ,2 ]
Wang, Yanfeng [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Cooperat Medianet Innovat Ctr, Shanghai, Peoples R China
[2] Shanghai Artificial Intelligence Lab, Shanghai, Peoples R China
来源
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
speech neuroprosthesis; end-to-end; brain-computer interface; large-vocabulary continuous decoding; SPEECH;
D O I
10.21437/Interspeech.2024-382
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we introduce a groundbreaking end-to-end (E2E) framework for decoding invasive brain signals, marking a significant advancement in the field of speech neuroprosthesis. Our methodology leverages the comprehensive reasoning abilities of large language models (LLMs) to facilitate direct decoding. By fully integrating LLMs, we achieve results comparable to the state-of-the-art cascade models. Our findings underscore the immense potential of E2E frameworks in speech neuroprosthesis, particularly as the technology behind brain-computer interfaces (BCIs) and the availability of relevant datasets continue to evolve. This work not only showcases the efficacy of combining LLMs with E2E decoding for enhancing speech neuroprosthesis but also sets a new direction for future research in BCI applications, underscoring the impact of LLMs in decoding complex neural signals for communication restoration. Code will be made available at https://github.com/FsFrancis15/BrainLLM.
引用
收藏
页码:1495 / 1499
页数:5
相关论文
共 50 条
  • [1] A Lightweight End-to-End Neural Networks for Decoding of Motor Imagery Brain Signal
    Lee, Hyeon Kyu
    Myoung, Ji-Soo
    Choi, Young-Seok
    2022 THIRTEENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN), 2022, : 411 - 413
  • [2] JOINT ENDPOINTING AND DECODING WITH END-TO-END MODELS
    Chang, Shuo-Yiin
    Prabhavalkar, Rohit
    He, Yanzhang
    Sainath, Tara N.
    Simko, Gabor
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5626 - 5630
  • [3] TOWARDS END-TO-END SPOKEN LANGUAGE UNDERSTANDING
    Serdyuk, Dmitriy
    Wang, Yongqiang
    Fuegen, Christian
    Kumar, Anuj
    Liu, Baiyang
    Bengio, Yoshua
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5754 - 5758
  • [4] On the Uses of Large Language Models to Design End-to-end Learning Semantic Communication
    Wang, Ying
    Sun, Zhuo
    Fan, Jinpo
    Ma, Hao
    2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
  • [5] LMDrive: Closed-Loop End-to-End Driving with Large Language Models
    Shao, Hao
    Hu, Yuxuan
    Wang, Letian
    Song, Guanglu
    Waslander, Steven L.
    Liu, Yu
    Li, Hongsheng
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 15120 - 15130
  • [6] A Streaming End-to-End Framework For Spoken Language Understanding
    Potdar, Nihal
    Avila, Anderson R.
    Xing, Chao
    Wang, Dong
    Cao, Yiran
    Chen, Xiao
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3906 - 3914
  • [7] Towards Automated End-to-End Health Misinformation Free Search with a Large Language Model
    Pradeep, Ronak
    Lin, Jimmy
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT IV, 2024, 14611 : 78 - 86
  • [8] LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models
    Chen, Xi
    Zhang, Songyang
    Bai, Qibing
    Chen, Kai
    Nakamura, Satoshi
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 6976 - 6987
  • [9] An automatic end-to-end chemical synthesis development platform powered by large language models
    Ruan, Yixiang
    Lu, Chenyin
    Xu, Ning
    He, Yuchen
    Chen, Yixin
    Zhang, Jian
    Xuan, Jun
    Pan, Jianzhang
    Fang, Qun
    Gao, Hanyu
    Shen, Xiaodong
    Ye, Ning
    Zhang, Qiang
    Mo, Yiming
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [10] Maximum-a-Posteriori-Based Decoding for End-to-End Acoustic Models
    Kanda, Naoyuki
    Lu, Xugang
    Kawai, Hisashi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (05) : 1023 - 1034