PneumoLLM: Harnessing the power of large language model for pneumoconiosis diagnosis

被引:3
|
作者
Song, Meiyue [1 ,2 ]
Wang, Jiarui [3 ]
Yu, Zhihua [4 ]
Wang, Jiaxin [6 ]
Yang, Le [5 ]
Lu, Yuting [3 ]
Li, Baicun [7 ]
Wang, Xue [8 ,9 ]
Wang, Xiaoxu [3 ]
Huang, Qinghua [10 ]
Li, Zhijun [11 ,12 ]
Kanellakis, Nikolaos I. [13 ,14 ,15 ]
Liu, Jiangfeng [1 ,16 ,17 ]
Wang, Jing [1 ,2 ]
Wang, Binglu [3 ]
Yang, Juntao [1 ,16 ,17 ]
机构
[1] Peking Union Med Coll, Chinese Acad Med Sci, Sch Basic Med, Inst Basic Med Sci, Beijing 100005, Peoples R China
[2] State Key Lab Resp Hlth & Multimorbid, Beijing 100005, Peoples R China
[3] Northwestern Polytech Univ, Sch Automat, Xian 710072, Shaanxi, Peoples R China
[4] Jinneng Holding Coal Ind Grp Co Ltd, Occupat Dis Precaut Clin, Datong 037001, Shanxi, Peoples R China
[5] Tsinghua Univ, Sch Med, Beijing 100084, Peoples R China
[6] Changan Univ, Sch Elect & Control Engn, Xian 710064, Shaanxi, Peoples R China
[7] Chinese Acad Med Sci, China Japan Friendship Hosp, Inst Resp Med,Natl Clin Res Ctr Resp Dis, Ctr Resp Med,Natl Ctr Resp Med, Beijing 100020, Peoples R China
[8] Harbin Med Univ, Dept Resp, Affiliated Hosp 2, Harbin 150086, Heilongjiang, Peoples R China
[9] Harbin Med Univ, Internal Med, Harbin 150081, Heilongjiang, Peoples R China
[10] Northwestern Polytech Univ, Sch Artificial Intelligence Opt & Elect iOPEN, Xian 710072, Peoples R China
[11] Shanghai YangZhi Rehabil Hosp, Translat Res Ctr, Shanghai Sunshine Rehabil Ctr, Shanghai 201619, Peoples R China
[12] Tongji Univ, Sch Mech Engn, Shanghai 201804, Peoples R China
[13] Univ Oxford, CAMS,Oxford Inst, Nuffield Dept Med, Lab Pleural & Lung Canc Translat Res, Oxford, England
[14] Oxford Univ Hosp NHS Fdn Trust, Churchill Hosp, Oxford Ctr Resp Med, Oxford, England
[15] Univ Oxford, Natl Inst Hlth Res Oxford Biomed Res Ctr, Oxford, England
[16] Chinese Acad Med Sci & Peking Union Med Coll, Plast Surg Hosp, Beijing 100144, Peoples R China
[17] State Key Lab Common Mech Res Major Dis, Beijing 100005, Peoples R China
关键词
Large language model; Medical image diagnosis; Foundational model;
D O I
10.1016/j.media.2024.103248
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The conventional pretraining-and-finetuning paradigm, while effective for common diseases with ample data, faces challenges in diagnosing data-scarce occupational diseases like pneumoconiosis. Recently, large language models (LLMs) have exhibits unprecedented ability when conducting multiple tasks in dialogue, bringing opportunities to diagnosis. A common strategy might involve using adapter layers for vision- language alignment and diagnosis in a dialogic manner. Yet, this approach often requires optimization of extensive learnable parameters in the text branch and the dialogue head, potentially diminishing the LLMs' efficacy, especially with limited training data. In our work, we innovate by eliminating the text branch and substituting the dialogue head with a classification head. This approach presents a more effective method for harnessing LLMs in diagnosis with fewer learnable parameters. Furthermore, to balance the retention of detailed image information with progression towards accurate diagnosis, we introduce the contextual multitoken engine. This engine is specialized in adaptively generating diagnostic tokens. Additionally, we propose the information emitter module, which unidirectionally emits information from image tokens to diagnosis tokens. Comprehensive experiments validate the superiority of our methods.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Harnessing the Power of Large Language Models
    Hofmann, Meike
    Burch, Gerald F.
    Burch, Jana J.
    ISACA Journal, 2024, 1 : 32 - 39
  • [2] Editorial: Harnessing the Power of Large Language Model-Based Chatbots for Scientific Discovery
    Merz, Kenneth M.
    Wei, Guo-Wei
    Zhu, Feng
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2023, 63 (17) : 5395 - 5395
  • [3] Harnessing the Power of Large Language Models in Agricultural Safety & Health
    Shutske, John M.
    JOURNAL OF AGRICULTURAL SAFETY AND HEALTH, 2023, 29 (04): : 205 - 224
  • [4] Omega - harnessing the power of large language models for bioimage analysis
    Royer, Loic A.
    NATURE METHODS, 2024, 21 (08) : 1371 - 1373
  • [5] Harnessing the Power of Large Language Models for Automated Code Generation and Verification
    Antero, Unai
    Blanco, Francisco
    Onativia, Jon
    Salle, Damien
    Sierra, Basilio
    ROBOTICS, 2024, 13 (09)
  • [6] HARNESSING THE POWER OF INTERPRETIVE LANGUAGE
    SHAWVER, L
    PSYCHOTHERAPY-THEORY RESEARCH AND PRACTICE, 1983, 20 (01): : 3 - 11
  • [7] Harnessing the Power of Large Language Models for Natural Language to First-Order Logic Translation
    Yang, Yuan
    Xiong, Siheng
    Payani, Ali
    Shareghi, Ehsan
    Fekri, Faramarz
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 6942 - 6959
  • [8] MindfulDiary: Harnessing Large Language Model to Support Psychiatric Patients' Journaling
    Kim, Taewan
    Bae, Seolyeong
    Kim, Hyun Ah
    Lee, Su-woo
    Hong, Hwajung
    Yang, Chanmo
    Kim, Young-Ho
    PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS (CHI 2024), 2024,
  • [9] MindfulDiary: Harnessing Large Language Model to Support Psychiatric Patients' Journaling
    KAIST, Korea, Republic of
    不详
    不详
    不详
    不详
    不详
    Conf Hum Fact Comput Syst Proc,
  • [10] Harnessing the Power of Gaming in Language Education
    Liontas, John I.
    IRANIAN JOURNAL OF LANGUAGE TEACHING RESEARCH, 2022, 10 (02) : 1 - 16