Aligning Medical Images with General Knowledge from Large Language Models

被引:0
|
作者
Fang, Xiao [1 ]
Lin, Yi [1 ]
Zhang, Dong [2 ]
Cheng, Kwang-Ting [2 ]
Chen, Hao [1 ,3 ,4 ]
机构
[1] HKUST, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[2] HKUST, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[3] HKUST, Dept Chem & Biol Engn, Hong Kong, Peoples R China
[4] HKUST Shenzhen Hong Kong Collaborat Innovat Res I, Shenzhen, Peoples R China
关键词
Prompt Learning; Vision-Language Models; Large Language Model; Medical Image Analysis;
D O I
10.1007/978-3-031-72117-5_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pre-trained large vision-language models (VLMs) like CLIP have revolutionized visual representation learning using natural language as supervisions, and demonstrated promising generalization ability. In this work, we propose ViP, a novel visual symptom-guided prompt learning framework for medical image analysis, which facilitates general knowledge transfer from CLIP. ViP consists of two key components: a visual symptom generator (VSG) and a dual-prompt network. Specifically, VSG aims to extract explicable visual symptoms from pre-trained large language models, while the dual-prompt network utilizes these visual symptoms to guide the training on two learnable prompt modules, i.e., context prompt and merge prompt, which effectively adapts our framework to medical image analysis via large VLMs. Extensive experimental results demonstrate that ViP can outperform state-of-the-art methods on two challenging datasets. The code is available at https://github.com/xiaofang007/ViP.
引用
收藏
页码:57 / 67
页数:11
相关论文
共 50 条
  • [1] Poisoning medical knowledge using large language models
    Yang, Junwei
    Xu, Hanwen
    Mirzoyan, Srbuhi
    Chen, Tong
    Liu, Zixuan
    Liu, Zequn
    Ju, Wei
    Liu, Luchen
    Xiao, Zhiping
    Zhang, Ming
    Wang, Sheng
    NATURE MACHINE INTELLIGENCE, 2024, 6 (10) : 1156 - 1168
  • [2] Aligning Large Language Models for Controllable Recommendations
    Lu, Wensheng
    Lian, Jianxun
    Zhang, Wei
    Li, Guanghua
    Zhou, Mingyang
    Liao, Hao
    Xie, Xing
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 8159 - 8172
  • [3] Vision-Language Models in medical image analysis: From simple fusion to general large models
    Li, Xiang
    Li, Like
    Jiang, Yuchen
    Wang, Hao
    Qiao, Xinyu
    Feng, Ting
    Luo, Hao
    Zhao, Yong
    INFORMATION FUSION, 2025, 118
  • [4] Aligning Large Language Models through Synthetic Feedback
    Kim, Sungdong
    Bae, Sanghwan
    Shin, Jamin
    Kang, Soyoung
    Kwak, Donghyun
    Yoo, Kang Min
    Seo, Minjoon
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 13677 - 13700
  • [5] Quo Vadis ChatGPT? From large language models to Large Knowledge Models
    Venkatasubramanian, Venkat
    Chakraborty, Arijit
    COMPUTERS & CHEMICAL ENGINEERING, 2025, 192
  • [6] Symbolic Knowledge Distillation: from General Language Models to Commonsense Models
    West, Peter
    Bhagavatula, Chandra
    Hessel, Jack
    Hwang, Jena D.
    Jiang, Liwei
    Le Bras, Ronan
    Lu, Ximing
    Welleck, Sean
    Choi, Yejin
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 4602 - 4625
  • [7] From general to specific: Tailoring large language models for real-world medical communications
    Sun, Xinti
    Tang, Wenjun
    Huang, Zigeng
    Long, Erping
    Wan, Peixing
    CLINICAL AND TRANSLATIONAL MEDICINE, 2025, 15 (01):
  • [8] A medical question answering system using large language models and knowledge graphs
    Guo, Quan
    Cao, Shuai
    Yi, Zhang
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (11) : 8548 - 8564
  • [9] Distilling Script Knowledge from Large Language Models for Constrained Language Planning
    Yuan, Siyu
    Chen, Jiangjie
    Fu, Ziquan
    Ge, Xuyang
    Shah, Soham
    Jankowski, Charles Robert
    Xiao, Yanghua
    Yang, Deqing
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 4303 - 4325
  • [10] From Static to Dynamic: Knowledge Metabolism for Large Language Models
    Du, Mingzhe
    Luu, Anh Tuan
    Ji, Bin
    Ng, See-Kiong
    THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23784 - 23786