Vision-knowledge fusion model for multi-domain medical report generation

被引:12
|
作者
Xu, Dexuan [1 ,2 ]
Zhu, Huashi [1 ,2 ]
Huang, Yu [1 ]
Jin, Zhi [3 ]
Ding, Weiping [4 ]
Li, Hang [5 ,6 ]
Ran, Menglong [5 ,6 ]
机构
[1] Peking Univ, Natl Engn Res Ctr Software Engn, Beijing 100871, Peoples R China
[2] Peking Univ, Sch Software & Microelect, Beijing 100871, Peoples R China
[3] Peking Univ, Key Lab High Confidence Software Technol, Beijing 100871, Peoples R China
[4] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China
[5] Peking Univ, Dept Dermatol, Hosp 1, Beijing 100034, Peoples R China
[6] Natl Clin Res Ctr Skin & Immune Dis, Beijing 100034, Peoples R China
关键词
Medical report generation; Knowledge graph; Multi-modal fusion; Graph neural network;
D O I
10.1016/j.inffus.2023.101817
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Medical report generation with knowledge graph is an essential task in the medical field. Although the existing knowledge graphs have many entities, their semantics are not sufficient due to the challenge of uniformly extracting and fusing the expert knowledge from different diseases. Therefore, it is necessary to automatically construct specific knowledge graph. In this paper, we propose a vision-knowledge fusion model based on medical images and knowledge graphs to fully utilize high-quality data from different diseases and languages. Firstly, we give a general method to automatically construct every domain knowledge graph based on medical standards. Secondly, we design a knowledge-based attention mechanism to effectively fuse image and knowledge. Then, we build a triples restoration module to obtain fine-grained knowledge, and the knowledge-based evaluation metrics are first proposed which are more reasonable and measurable from different dimensions. Finally, we conduct experiments to verify the effectiveness of our model on two different diseases datasets: the IU-Xray chest radiograph public dataset and the NCRC-DS dataset of Chinese dermoscopy reports we compiled. Our model outperforms previous benchmark methods and achieves excellent evaluation scores on both datasets. Additionally, interpretability and clinical usefulness of the model are validated and our method can be generalized to multiple domains and different diseases.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Linking semantic and knowledge representations in a multi-domain dialogue system
    Dzikovska, Myroslava O.
    Allen, James F.
    Swift, Mary D.
    JOURNAL OF LOGIC AND COMPUTATION, 2008, 18 (03) : 405 - 430
  • [42] Multi-domain Knowledge Sharing and Reuse in Home Intelligent Space
    Zhang Y.
    Tian G.
    Zhang S.
    Li C.
    Jiqiren/Robot, 2019, 41 (04): : 507 - 518
  • [43] A prompt-driven framework for multi-domain knowledge tracing
    Liu, Zitao
    Huang, Shuyan
    Guo, Teng
    Hou, Mingliang
    Liang, Qianru
    MACHINE LEARNING, 2025, 114 (04)
  • [44] Multi-domain medical image translation generation for lung image classification based on generative adversarial networks
    Chen, Yunfeng
    Lin, Yalan
    Xu, Xiaodie
    Ding, Jinzhen
    Li, Chuzhao
    Zeng, Yiming
    Xie, Weifang
    Huang, Jianlong
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 229
  • [45] A multi-perspective multi-domain model of self-concept: structure and sources of self concept knowledge
    Cheung, PC
    Lau, S
    ASIAN JOURNAL OF SOCIAL PSYCHOLOGY, 2001, 4 (01) : 1 - 21
  • [46] Distributed, Multi-Domain Option Generation Across Legacy Planners
    Schneider, M. K.
    Barbulescu, L.
    Baffle-Rafferty, L.
    Smith, L.
    Cook, M.
    Kapler, T.
    Loppie, M.
    Pelletier, E.
    Rubinstein, Z.
    Smith, S.
    Miller, M.
    HandUbera, J.
    Richard, C.
    Kuperman, G.
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS IV, 2022, 12113
  • [47] Parallel Interactive Networks for Multi-Domain Dialogue State Generation
    Chen, Junfan
    Zhang, Richong
    Mao, Yongyi
    Xu, Jie
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1921 - 1931
  • [48] Extensible Multi-domain Generation of Virtual Worlds using Blackboards
    Deglorie, Gaetan
    Goossens, Rian
    Van Hoecke, Sofie
    Lambert, Peter
    PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2017), VOL 1, 2017, : 82 - 92
  • [49] Federated Multi-domain GNN Network for Brain Multigraph Generation
    Xu, Chun
    Rekik, Islem
    PREDICTIVE INTELLIGENCE IN MEDICINE, PRIME 2023, 2023, 14277 : 194 - 205
  • [50] Development and application of a multi-domain dynamic model for direct steam generation solar power plant
    Rousset, A.
    Baviere, R.
    Vuillerme, V.
    IFAC PAPERSONLINE, 2018, 51 (02): : 777 - 782