Frozen Language Model Helps ECG Zero-Shot Learning

被引：0

作者：

Li, Jun ^{[1
]}

Liu, Che ^{[2
,3
]}

Cheng, Sibo ^{[3
]}

Arcucci, Rossella ^{[2
,3
]}

Hong, Shenda ^{[4
,5
]}

机构：

[1] Jilin Univ, Coll Elect Sci & Engn, Changchun, Peoples R China

[2] Imperial Coll London, Dept Earth Sci & Engn, London SW7 2AZ, England

[3] Imperial Coll London, Data Sci Inst, Dept Comp, London, England

[4] Peking Univ, Natl Inst Hlth Data Sci, Beijing, Peoples R China

[5] Peking Univ, Inst Med Technol, Hlth Sci Ctr, Beijing, Peoples R China

来源：

MEDICAL IMAGING WITH DEEP LEARNING, VOL 227 | 2023年 / 227卷

基金：

中国国家自然科学基金;

关键词：

Multimodal self-supervised learning; Zero-shot learning; Language model; ECG; Signal processing; MYOCARDIAL-INFARCTION; SIGNALS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The electrocardiogram (ECG) is one of the most commonly used non-invasive, convenient medical monitoring tools that assist in the clinical diagnosis of heart diseases. Recently, deep learning (DL) techniques, particularly self-supervised learning (SSL), have demonstrated great potential in the classification of ECG. SSL pre-training has achieved competitive performance with only a small amount of annotated data after fine-tuning. However, current SSL methods rely on the availability of annotated data and are unable to predict labels not existing in fine-tuning datasets. To address this challenge, we propose Multimodal ECG-Text Self-supervised pre-training (METS), the first work to utilize the auto-generated clinical reports to guide ECG SSL pre-training. We use a trainable ECG encoder and a frozen language model to embed paired ECG and automatically machine-generated clinical reports separately. The SSL aims to maximize the similarity between paired ECG and auto-generated report while minimize the similarity between ECG and other reports. In downstream classification tasks, METS achieves around 10% improvement in performance without using any annotated data via zero-shot classification, compared to other supervised and SSL baselines that rely on annotated data. Furthermore, METS achieves the highest recall and F1 scores on the MIT-BIH dataset, despite MIT-BIH containing different classes of ECG compared to the pre-trained dataset. The extensive experiments have demonstrated the advantages of using ECG-Text multimodal self-supervised learning in terms of generalizability, effectiveness, and efficiency.

引用

页码：402 / 415

页数：14

共 50 条

[21] Learning to Model Relationships for Zero-Shot Video Classification
Gao, Junyu
Zhang, Tianzhu
Xu, Changsheng
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) : 3476 - 3491
[22] A Unified Approach for Conventional Zero-Shot, Generalized Zero-Shot, and Few-Shot Learning
Rahman, Shafin
Khan, Salman
Porikli, Fatih
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (11) : 5652 - 5667
[23] Zero-shot Visual Question Answering with Language Model Feedback
Du, Yifan
Li, Junyi
Tang, Tianyi
Zhao, Wayne Xin
Wen, Ji-Rong
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 9268 - 9281
[24] Zero-Shot ECG Diagnosis with Large Language Models and Retrieval-Augmented Generation
Yu, Han
Guo, Peikun
Sano, Akane
MACHINE LEARNING FOR HEALTH, ML4H, VOL 225, 2023, 225 : 650 - 663
[25] Zero-shot Model Diagnosis
Luo, Jinqi
Wang, Zhaoning
Wu, Chen Henry
Huang, Dong
De la Torre, Fernando
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11631 - 11640
[26] Language-Augmented Pixel Embedding for Generalized Zero-Shot Learning
Wang, Ziyang
Gou, Yunhao
Li, Jingjing
Zhu, Lei
Shen, Heng Tao
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (03) : 1019 - 1030
[27] Thinking Like an Author: A Zero-Shot Learning Approach to Keyphrase Generation with Large Language Model
Wang, Siyu
Dai, Shengran
Jiang, Jianhui
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT III, ECML PKDD 2024, 2024, 14943 : 335 - 350
[28] Learning semantic ambiguities for zero-shot learning
Hanouti, Celina
Le Borgne, Herve
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (26) : 40745 - 40759
[29] Learning semantic ambiguities for zero-shot learning
Celina Hanouti
Hervé Le Borgne
Multimedia Tools and Applications, 2023, 82 : 40745 - 40759
[30] Practical Aspects of Zero-Shot Learning
Saad, Elie
Paprzycki, Marcin
Ganzha, Maria
COMPUTATIONAL SCIENCE, ICCS 2022, PT II, 2022, : 88 - 95

← 1 2 3 4 5 →