Vision-knowledge fusion model for multi-domain medical report generation

被引:12
|
作者
Xu, Dexuan [1 ,2 ]
Zhu, Huashi [1 ,2 ]
Huang, Yu [1 ]
Jin, Zhi [3 ]
Ding, Weiping [4 ]
Li, Hang [5 ,6 ]
Ran, Menglong [5 ,6 ]
机构
[1] Peking Univ, Natl Engn Res Ctr Software Engn, Beijing 100871, Peoples R China
[2] Peking Univ, Sch Software & Microelect, Beijing 100871, Peoples R China
[3] Peking Univ, Key Lab High Confidence Software Technol, Beijing 100871, Peoples R China
[4] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China
[5] Peking Univ, Dept Dermatol, Hosp 1, Beijing 100034, Peoples R China
[6] Natl Clin Res Ctr Skin & Immune Dis, Beijing 100034, Peoples R China
关键词
Medical report generation; Knowledge graph; Multi-modal fusion; Graph neural network;
D O I
10.1016/j.inffus.2023.101817
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Medical report generation with knowledge graph is an essential task in the medical field. Although the existing knowledge graphs have many entities, their semantics are not sufficient due to the challenge of uniformly extracting and fusing the expert knowledge from different diseases. Therefore, it is necessary to automatically construct specific knowledge graph. In this paper, we propose a vision-knowledge fusion model based on medical images and knowledge graphs to fully utilize high-quality data from different diseases and languages. Firstly, we give a general method to automatically construct every domain knowledge graph based on medical standards. Secondly, we design a knowledge-based attention mechanism to effectively fuse image and knowledge. Then, we build a triples restoration module to obtain fine-grained knowledge, and the knowledge-based evaluation metrics are first proposed which are more reasonable and measurable from different dimensions. Finally, we conduct experiments to verify the effectiveness of our model on two different diseases datasets: the IU-Xray chest radiograph public dataset and the NCRC-DS dataset of Chinese dermoscopy reports we compiled. Our model outperforms previous benchmark methods and achieves excellent evaluation scores on both datasets. Additionally, interpretability and clinical usefulness of the model are validated and our method can be generalized to multiple domains and different diseases.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Continually Tuning a Large Language Model for Multi-domain Radiology Report Generation
    Sun, Yihua
    Khor, Hee Guan
    Wang, Yuanzheng
    Wang, Zhuhao
    Zhao, Hongliang
    Zhang, Yu
    Ma, Longfei
    Zheng, Zhuozhao
    Liao, Hongen
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT V, 2024, 15005 : 177 - 187
  • [2] A FEATURE DICTIONARY SUPPORTING A MULTI-DOMAIN MEDICAL KNOWLEDGE BASE
    NAEYMIRAD, F
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 1989, 30 (2-3) : 217 - 228
  • [3] A vision-language model with multi-granular knowledge fusion in medical imaging
    Chen, Kai
    Li, Yunxin
    Zhu, Xiwen
    Zhang, Wentai
    Hu, Baotian
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2025, 28 (01):
  • [4] MDViT: Multi-domain Vision Transformer for Small Medical Image Segmentation Datasets
    Du, Siyi
    Bayasi, Nourhan
    Hamarneh, Ghassan
    Garbi, Rafeef
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 448 - 458
  • [5] Generation of nearshore bars by multi-domain hybrid numerical model
    Lee, CE
    Kim, MH
    Edge, BL
    JOURNAL OF COASTAL RESEARCH, 1999, 15 (04) : 892 - 901
  • [6] Multi-domain fusion for cargo UAV fault diagnosis knowledge graph construction
    Xiao A.
    Yan W.
    Zhang X.
    Liu Y.
    Zhang H.
    Liu Q.
    Auton. Intell. Syst., 1 (1):
  • [7] A Multi-Domain Platform For Medical Imaging
    Viana-Ferreira, Carlos
    Costa, Carlos
    Oliveira, Jose Luis
    2013 IEEE 26TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2013, : 89 - 94
  • [8] Spirit Distillation: A Model Compression Method with Multi-domain Knowledge Transfer
    Wu, Zhiyuan
    Jiang, Yu
    Zhao, Minghao
    Cui, Chupeng
    Yang, Zongmin
    Xue, Xinhui
    Qi, Hong
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, 2021, 12815 : 553 - 565
  • [9] Fault diagnosis of complex systems based on multi-sensor and multi-domain knowledge information fusion
    Yang, Yong-Min
    Ge, Zhe-Xue
    Xu, Yong-Cheng
    PROCEEDINGS OF 2008 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL, VOLS 1 AND 2, 2008, : 1065 - 1069
  • [10] Neural Paraphrase Generation with Multi-domain Corpus
    Qiao, Lin
    Li, Yida
    Zhong, ChenLi
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT I, 2021, 12891 : 54 - 66