Vision-knowledge fusion model for multi-domain medical report generation

被引:12
|
作者
Xu, Dexuan [1 ,2 ]
Zhu, Huashi [1 ,2 ]
Huang, Yu [1 ]
Jin, Zhi [3 ]
Ding, Weiping [4 ]
Li, Hang [5 ,6 ]
Ran, Menglong [5 ,6 ]
机构
[1] Peking Univ, Natl Engn Res Ctr Software Engn, Beijing 100871, Peoples R China
[2] Peking Univ, Sch Software & Microelect, Beijing 100871, Peoples R China
[3] Peking Univ, Key Lab High Confidence Software Technol, Beijing 100871, Peoples R China
[4] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China
[5] Peking Univ, Dept Dermatol, Hosp 1, Beijing 100034, Peoples R China
[6] Natl Clin Res Ctr Skin & Immune Dis, Beijing 100034, Peoples R China
关键词
Medical report generation; Knowledge graph; Multi-modal fusion; Graph neural network;
D O I
10.1016/j.inffus.2023.101817
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Medical report generation with knowledge graph is an essential task in the medical field. Although the existing knowledge graphs have many entities, their semantics are not sufficient due to the challenge of uniformly extracting and fusing the expert knowledge from different diseases. Therefore, it is necessary to automatically construct specific knowledge graph. In this paper, we propose a vision-knowledge fusion model based on medical images and knowledge graphs to fully utilize high-quality data from different diseases and languages. Firstly, we give a general method to automatically construct every domain knowledge graph based on medical standards. Secondly, we design a knowledge-based attention mechanism to effectively fuse image and knowledge. Then, we build a triples restoration module to obtain fine-grained knowledge, and the knowledge-based evaluation metrics are first proposed which are more reasonable and measurable from different dimensions. Finally, we conduct experiments to verify the effectiveness of our model on two different diseases datasets: the IU-Xray chest radiograph public dataset and the NCRC-DS dataset of Chinese dermoscopy reports we compiled. Our model outperforms previous benchmark methods and achieves excellent evaluation scores on both datasets. Additionally, interpretability and clinical usefulness of the model are validated and our method can be generalized to multiple domains and different diseases.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Collaborative Filtering Recommendation Based on Multi-domain Semantic Fusion
    Li, Xiang
    He, Jingsha
    Zhu, Nafei
    Hou, Ziqiang
    2020 IEEE 44TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2020), 2020, : 255 - 261
  • [32] Relative rates of gene fusion and fission in multi-domain proteins
    Kummerfeld, SK
    Teichmann, SA
    TRENDS IN GENETICS, 2005, 21 (01) : 25 - 30
  • [33] Multi-domain Neural Network Language Model
    Alumae, Tanel
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2181 - 2185
  • [34] Fusion of Multi-domain EEG Signatures Improves Emotion Recognition
    Wang, Xiaomin
    Pei, Yu
    Luo, Zhiguo
    Zhao, Shaokai
    Xie, Liang
    Yan, Ye
    Yin, Erwei
    Liu, Shuang
    Ming, Dong
    JOURNAL OF INTEGRATIVE NEUROSCIENCE, 2024, 23 (01)
  • [35] Advances in Knowledge Fusion Research in Medical Domain
    Peng, Lin
    Song, Jun
    Xiong, Lingzhu
    Du, Jianqiang
    Ye, Qing
    Liu, Andong
    Computer Engineering and Applications, 60 (09): : 48 - 64
  • [36] A uniform human knowledge interface to the multi-domain knowledge bases in the National Knowledge Infrastructure
    Feng, QG
    Cao, CN
    Si, JX
    Zheng, YF
    APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS X, 2003, : 163 - 176
  • [37] MLNet: A Multi-Domain Lightweight Network for Multi-Focus Image Fusion
    Nie, Xixi
    Hu, Bo
    Gao, Xinbo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 5565 - 5579
  • [38] Utilizing online content as domain knowledge in a multi-domain dynamic dialogue system
    Wootton, Craig
    McTear, Michael
    Anderson, Terry
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 693 - 696
  • [39] Domain-Aware Contrastive Knowledge Transfer for Multi-domain Imbalanced Data
    Ke, Zixuan
    Kachuee, Mohammad
    Lee, Sungjin
    PROCEEDINGS OF THE 12TH WORKSHOP ON COMPUTATIONAL APPROACHES TO SUBJECTIVITY, SENTIMENT & SOCIAL MEDIA ANALYSIS, 2022, : 25 - 36
  • [40] A Multi-Domain Self-Report Measure of Coparenting
    Feinberg, Mark E.
    Brown, Louis D.
    Kan, Marni L.
    PARENTING-SCIENCE AND PRACTICE, 2012, 12 (01): : 1 - 21