Towards an End-to-End Framework for Invasive Brain Signal Decoding with Large Language Models

被引：0

作者：

Feng, Sheng ^{[1
]}

Liu, Heyang ^{[1
]}

Wang, Yu ^{[1
,2
]}

Wang, Yanfeng ^{[1
,2
]}

机构：

[1] Shanghai Jiao Tong Univ, Cooperat Medianet Innovat Ctr, Shanghai, Peoples R China

[2] Shanghai Artificial Intelligence Lab, Shanghai, Peoples R China

来源：

INTERSPEECH 2024 | 2024年

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

speech neuroprosthesis; end-to-end; brain-computer interface; large-vocabulary continuous decoding; SPEECH;

D O I：

10.21437/Interspeech.2024-382

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we introduce a groundbreaking end-to-end (E2E) framework for decoding invasive brain signals, marking a significant advancement in the field of speech neuroprosthesis. Our methodology leverages the comprehensive reasoning abilities of large language models (LLMs) to facilitate direct decoding. By fully integrating LLMs, we achieve results comparable to the state-of-the-art cascade models. Our findings underscore the immense potential of E2E frameworks in speech neuroprosthesis, particularly as the technology behind brain-computer interfaces (BCIs) and the availability of relevant datasets continue to evolve. This work not only showcases the efficacy of combining LLMs with E2E decoding for enhancing speech neuroprosthesis but also sets a new direction for future research in BCI applications, underscoring the impact of LLMs in decoding complex neural signals for communication restoration. Code will be made available at https://github.com/FsFrancis15/BrainLLM.

引用

页码：1495 / 1499

页数：5

共 50 条

[1] A Lightweight End-to-End Neural Networks for Decoding of Motor Imagery Brain Signal
Lee, Hyeon Kyu
Myoung, Ji-Soo
Choi, Young-Seok
2022 THIRTEENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN), 2022, : 411 - 413
[2] JOINT ENDPOINTING AND DECODING WITH END-TO-END MODELS
Chang, Shuo-Yiin
Prabhavalkar, Rohit
He, Yanzhang
Sainath, Tara N.
Simko, Gabor
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5626 - 5630
[3] TOWARDS END-TO-END SPOKEN LANGUAGE UNDERSTANDING
Serdyuk, Dmitriy
Wang, Yongqiang
Fuegen, Christian
Kumar, Anuj
Liu, Baiyang
Bengio, Yoshua
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5754 - 5758
[4] On the Uses of Large Language Models to Design End-to-end Learning Semantic Communication
Wang, Ying
Sun, Zhuo
Fan, Jinpo
Ma, Hao
2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
[5] LMDrive: Closed-Loop End-to-End Driving with Large Language Models
Shao, Hao
Hu, Yuxuan
Wang, Letian
Song, Guanglu
Waslander, Steven L.
Liu, Yu
Li, Hongsheng
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 15120 - 15130
[6] A Streaming End-to-End Framework For Spoken Language Understanding
Potdar, Nihal
Avila, Anderson R.
Xing, Chao
Wang, Dong
Cao, Yiran
Chen, Xiao
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3906 - 3914
[7] Towards Automated End-to-End Health Misinformation Free Search with a Large Language Model
Pradeep, Ronak
Lin, Jimmy
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT IV, 2024, 14611 : 78 - 86
[8] LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models
Chen, Xi
Zhang, Songyang
Bai, Qibing
Chen, Kai
Nakamura, Satoshi
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 6976 - 6987
[9] An automatic end-to-end chemical synthesis development platform powered by large language models
Ruan, Yixiang
Lu, Chenyin
Xu, Ning
He, Yuchen
Chen, Yixin
Zhang, Jian
Xuan, Jun
Pan, Jianzhang
Fang, Qun
Gao, Hanyu
Shen, Xiaodong
Ye, Ning
Zhang, Qiang
Mo, Yiming
NATURE COMMUNICATIONS, 2024, 15 (01)
[10] Maximum-a-Posteriori-Based Decoding for End-to-End Acoustic Models
Kanda, Naoyuki
Lu, Xugang
Kawai, Hisashi
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (05) : 1023 - 1034

← 1 2 3 4 5 →