Vision-knowledge fusion model for multi-domain medical report generation

被引:12
|
作者
Xu, Dexuan [1 ,2 ]
Zhu, Huashi [1 ,2 ]
Huang, Yu [1 ]
Jin, Zhi [3 ]
Ding, Weiping [4 ]
Li, Hang [5 ,6 ]
Ran, Menglong [5 ,6 ]
机构
[1] Peking Univ, Natl Engn Res Ctr Software Engn, Beijing 100871, Peoples R China
[2] Peking Univ, Sch Software & Microelect, Beijing 100871, Peoples R China
[3] Peking Univ, Key Lab High Confidence Software Technol, Beijing 100871, Peoples R China
[4] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China
[5] Peking Univ, Dept Dermatol, Hosp 1, Beijing 100034, Peoples R China
[6] Natl Clin Res Ctr Skin & Immune Dis, Beijing 100034, Peoples R China
关键词
Medical report generation; Knowledge graph; Multi-modal fusion; Graph neural network;
D O I
10.1016/j.inffus.2023.101817
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Medical report generation with knowledge graph is an essential task in the medical field. Although the existing knowledge graphs have many entities, their semantics are not sufficient due to the challenge of uniformly extracting and fusing the expert knowledge from different diseases. Therefore, it is necessary to automatically construct specific knowledge graph. In this paper, we propose a vision-knowledge fusion model based on medical images and knowledge graphs to fully utilize high-quality data from different diseases and languages. Firstly, we give a general method to automatically construct every domain knowledge graph based on medical standards. Secondly, we design a knowledge-based attention mechanism to effectively fuse image and knowledge. Then, we build a triples restoration module to obtain fine-grained knowledge, and the knowledge-based evaluation metrics are first proposed which are more reasonable and measurable from different dimensions. Finally, we conduct experiments to verify the effectiveness of our model on two different diseases datasets: the IU-Xray chest radiograph public dataset and the NCRC-DS dataset of Chinese dermoscopy reports we compiled. Our model outperforms previous benchmark methods and achieves excellent evaluation scores on both datasets. Additionally, interpretability and clinical usefulness of the model are validated and our method can be generalized to multiple domains and different diseases.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] AI and ML in the Multi-Domain Operations Era: Vision and Pitfalls
    Baker, Michael A.
    Al-Khalifa, Khaled A.
    Harlas, Ioannis N.
    King, Marvin L.
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS II, 2020, 11413
  • [22] A MULTI-DOMAIN KNOWLEDGE TRANSFER METHOD FOR CONCEPTUAL DESIGN COMBINE WITH FBS AND KNOWLEDGE GRAPHA MULTI-DOMAIN KNOWLEDGE TRANSFER METHOD FOR CONCEPTUAL DESIGN COMBINE WITH FBS AND KNOWLEDGE GRAPH
    Lai, Bing
    Zhao, Wu
    Yu, Zeyuan
    Guo, Xin
    Zhang, Kai
    PROCEEDINGS OF ASME 2022 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2022, VOL 2, 2022,
  • [23] An Object-oriented Modular Design Model Supported by Integrated Multi-domain Knowledge
    Li, X.
    Huang, Y. Q.
    ADVANCES IN MATERIALS MANUFACTURING SCIENCE AND TECHNOLOGY XIV, 2012, 697-698 : 785 - +
  • [24] Construction of Knowledge Graph of Maintainability Design Based on Multi-domain Fusion of High-speed Trains
    Guo, Heng
    Li, Rong
    Zhang, Haizhu
    Wei, Yongjie
    Dai, Yuebin
    Zhongguo Jixie Gongcheng/China Mechanical Engineering, 2022, 33 (24): : 3015 - 3023
  • [25] MEDEVAL: A Multi-Level, Multi-Task, and Multi-Domain Medical Benchmark for Language Model Evaluation
    He, Zexue
    Wang, Yu
    Yan, An
    Liu, Yao
    Chan, Eric Y.
    Gentili, Amilcare
    McAuley, Julian
    Hsu, Chun-Nan
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 8725 - 8744
  • [26] Multi-Domain Simulation Model of a Wheel Loader
    Saha, Rohit
    Hwang, Long-Kung
    Kumar, Mahesh Madurai
    Zhao, Yunfeng
    Yu, Chen
    Ransijn, Bob
    SAE INTERNATIONAL JOURNAL OF COMMERCIAL VEHICLES, 2016, 9 (02) : 252 - 259
  • [27] ReMeDi: Resources for Multi-domain, Multi-service, Medical Dialogues
    Yan, Guojun
    Pei, Jiahuan
    Ren, Pengjie
    Ren, Zhaochun
    Xin, Xin
    Liang, Huasheng
    de Rijke, Maarten
    Chen, Zhumin
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 3013 - 3024
  • [28] A TRACED ROLES MODEL FOR MULTI-DOMAIN AUTHORIZATION
    Benjumea, Andres
    Agudo, Isaac
    INTERNATIONAL JOURNAL ON INFORMATION TECHNOLOGIES AND SECURITY, 2009, 1 (04): : 55 - 64
  • [29] MULTI-DOMAIN TLM MODEL FOR INTRAVASCULAR ULTRASOUND
    Borji, Rafik
    Franchek, Matthew A.
    PROCEEDINGS OF THE ASME DYNAMIC SYSTEMS AND CONTROL CONFERENCE 2009, PTS A AND B, 2010, : 697 - 704
  • [30] Automatic Support for Multi-Domain Model Management
    Torres, Weslley
    van den Brand, Mark G. J.
    Serebrenik, Alexander
    2020 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2020), 2020, : 830 - 833