Attribute Prototype-Guided Iterative Scene Graph for Explainable Radiology Report Generation

被引:1
|
作者
Zhang, Ke [1 ]
Yang, Yan [1 ]
Yu, Jun [2 ,3 ]
Fan, Jianping [4 ]
Jiang, Hanliang [5 ]
Huang, Qingming [6 ]
Han, Weidong [5 ,6 ]
机构
[1] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Key Lab Complex Syst Modeling & Simulat, Hangzhou 310018, Peoples R China
[2] Harbin Inst Technol, Dept Comp Sci & Technol, Shenzhen 518055, Peoples R China
[3] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Hangzhou 310018, Peoples R China
[4] Lenovo Res, AI Lab, Beijing 100094, Peoples R China
[5] Zhejiang Univ, Sir Run Run Shaw Hosp, Natl Inst Resp Dis, Coll Med,Reg Med Ctr, Hangzhou 310016, Peoples R China
[6] Zhejiang Normal Univ, Coll Math Med, Jinhua 321017, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Prototypes; Biomedical imaging; Lung; Cognition; Iterative methods; Visualization; Radiology report generation; scene graph generation; prototype learning; interpretability;
D O I
10.1109/TMI.2024.3424505
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The potential of automated radiology report generation in alleviating the time-consuming tasks of radiologists is increasingly being recognized in medical practice. Existing report generation methods have evolved from using image-level features to the latest approach of utilizing anatomical regions, significantly enhancing interpretability. However, directly and simplistically using region features for report generation compromises the capability of relation reasoning and overlooks the common attributes potentially shared across regions. To address these limitations, we propose a novel region-based Attribute Prototype-guided Iterative Scene Graph generation framework (AP-ISG) for report generation, utilizing scene graph generation as an auxiliary task to further enhance interpretability and relational reasoning capability. The core components of AP-ISG are the Iterative Scene Graph Generation (ISGG) module and the Attribute Prototype-guided Learning (APL) module. Specifically, ISSG employs an autoregressive scheme for structural edge reasoning and a contextualization mechanism for relational reasoning. APL enhances intra-prototype matching and reduces inter-prototype semantic overlap in the visual space to fully model the potential attribute commonalities among regions. Extensive experiments on the MIMIC-CXR with Chest ImaGenome datasets demonstrate the superiority of AP-ISG across multiple metrics.
引用
收藏
页码:4470 / 4482
页数:13
相关论文
共 44 条
  • [21] Cross-Modal Prototype Driven Network for Radiology Report Generation
    Wang, Jun
    Bhalerao, Abhir
    He, Yulan
    COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 563 - 579
  • [22] Multi-Prototype Space Learning for Commonsense-Based Scene Graph Generation
    Chen, Lianggangxu
    Song, Youqi
    Cai, Yiqing
    Lu, Jiale
    Li, Yang
    Xie, Yuan
    Wang, Changbo
    He, Gaoqi
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 1129 - 1137
  • [23] Intensive vision-guided network for radiology report generation
    Zheng, Fudan
    Li, Mengfei
    Wang, Ying
    Yu, Weijiang
    Wang, Ruixuan
    Chen, Zhiguang
    Xiao, Nong
    Lu, Yutong
    PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (04):
  • [24] Trust it or not: Confidence-guided automatic radiology report generation
    Wang, Yixin
    Lin, Zihao
    Xu, Zhe
    Dong, Haoyu
    Luo, Jie
    Tian, Jiang
    Shi, Zhongchao
    Huang, Lifu
    Zhang, Yang
    Fan, Jianping
    He, Zhiqiang
    NEUROCOMPUTING, 2024, 578
  • [25] LANDMARK: language-guided representation enhancement framework for scene graph generation
    Chang, Xiaoguang
    Wang, Teng
    Cai, Shaowei
    Sun, Changyin
    APPLIED INTELLIGENCE, 2023, 53 (21) : 26126 - 26138
  • [26] LANDMARK: language-guided representation enhancement framework for scene graph generation
    Xiaoguang Chang
    Teng Wang
    Shaowei Cai
    Changyin Sun
    Applied Intelligence, 2023, 53 : 26126 - 26138
  • [27] Semantic Diversity-Aware Prototype-Based Learning for Unbiased Scene Graph Generation
    Jeon, Jaehyeong
    Kim, Kibum
    Yoon, Kanghoon
    Park, Chanyoung
    COMPUTER VISION - ECCV 2024, PT XXXVII, 2025, 15095 : 379 - 395
  • [28] Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation
    Jeon, Jaehyeong
    Kim, Kibum
    Yoon, Kanghoon
    Park, Chanyoung
    arXiv,
  • [29] Lesion Guided Explainable Few Weak-Shot Medical Report Generation
    Sun, Jinghan
    Wei, Dong
    Wang, Liansheng
    Zheng, Yefeng
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT V, 2022, 13435 : 615 - 625
  • [30] Multi-granularity Semantic Guided Transformer for Radiology Report Generation
    Song, Yu
    Hua, Xiaojin
    Mang, Kunli
    Zan, Hongying
    Lie, Runzhi
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT III, NLPCC 2024, 2025, 15361 : 458 - 471