Attribute Prototype-Guided Iterative Scene Graph for Explainable Radiology Report Generation

被引:1
|
作者
Zhang, Ke [1 ]
Yang, Yan [1 ]
Yu, Jun [2 ,3 ]
Fan, Jianping [4 ]
Jiang, Hanliang [5 ]
Huang, Qingming [6 ]
Han, Weidong [5 ,6 ]
机构
[1] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Key Lab Complex Syst Modeling & Simulat, Hangzhou 310018, Peoples R China
[2] Harbin Inst Technol, Dept Comp Sci & Technol, Shenzhen 518055, Peoples R China
[3] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Hangzhou 310018, Peoples R China
[4] Lenovo Res, AI Lab, Beijing 100094, Peoples R China
[5] Zhejiang Univ, Sir Run Run Shaw Hosp, Natl Inst Resp Dis, Coll Med,Reg Med Ctr, Hangzhou 310016, Peoples R China
[6] Zhejiang Normal Univ, Coll Math Med, Jinhua 321017, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Prototypes; Biomedical imaging; Lung; Cognition; Iterative methods; Visualization; Radiology report generation; scene graph generation; prototype learning; interpretability;
D O I
10.1109/TMI.2024.3424505
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The potential of automated radiology report generation in alleviating the time-consuming tasks of radiologists is increasingly being recognized in medical practice. Existing report generation methods have evolved from using image-level features to the latest approach of utilizing anatomical regions, significantly enhancing interpretability. However, directly and simplistically using region features for report generation compromises the capability of relation reasoning and overlooks the common attributes potentially shared across regions. To address these limitations, we propose a novel region-based Attribute Prototype-guided Iterative Scene Graph generation framework (AP-ISG) for report generation, utilizing scene graph generation as an auxiliary task to further enhance interpretability and relational reasoning capability. The core components of AP-ISG are the Iterative Scene Graph Generation (ISGG) module and the Attribute Prototype-guided Learning (APL) module. Specifically, ISSG employs an autoregressive scheme for structural edge reasoning and a contextualization mechanism for relational reasoning. APL enhances intra-prototype matching and reduces inter-prototype semantic overlap in the visual space to fully model the potential attribute commonalities among regions. Extensive experiments on the MIMIC-CXR with Chest ImaGenome datasets demonstrate the superiority of AP-ISG across multiple metrics.
引用
收藏
页码:4470 / 4482
页数:13
相关论文
共 44 条
  • [31] Triple Correlations-Guided Label Supplementation for Unbiased Video Scene Graph Generation
    Wang, Wenqing
    Gao, Kaifeng
    Luo, Yawei
    Jiang, Tao
    Gao, Fei
    Shao, Jian
    Sun, Jianwen
    Xiao, Jun
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5153 - 5163
  • [32] Iterative Learning with Extra and Inner Knowledge for Long-tail Dynamic Scene Graph Generation
    Li, Yiming
    Yang, Xiaoshan
    Xu, Changsheng
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4707 - 4715
  • [33] ORGAN: Observation-Guided Radiology Report Generation via Tree Reasoning
    Hou, Wenjun
    Xu, Kaishuai
    Cheng, Yi
    Li, Wenjie
    Liu, Jiang
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 8108 - 8122
  • [34] SGG-MVAR: Cross-Modal Retrieval With Scene Graph Generation and Multiview Attribute Relationship Guidance
    Wang, Suping
    Zhou, Fei
    Yang, Ming
    Shi, Lei
    Tan, Chaohong
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2025,
  • [35] Memory-aligned Knowledge Graph for Clinically Accurate Radiology Image Report Generation
    Yan, Sixing
    PROCEEDINGS OF THE 21ST WORKSHOP ON BIOMEDICAL LANGUAGE PROCESSING (BIONLP 2022), 2022, : 116 - 122
  • [36] Eye Gaze Guided Cross-Modal Alignment Network for Radiology Report Generation
    Peng, Peixi
    Fan, Wanshu
    Shen, Yue
    Liu, Wenfei
    Yang, Xin
    Zhang, Qiang
    Wei, Xiaopeng
    Zhou, Dongsheng
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (12) : 7406 - 7419
  • [37] KGVL-BART: Knowledge Graph Augmented Visual Language BART for Radiology Report Generation
    Kale, Kaveri
    Bhattacharyya, Pushpak
    Gune, Milind
    Shetty, Aditya
    Lawyer, Rustom
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 3401 - 3411
  • [38] Knowledge Graph-Enhanced Vision-to-Language Multimodal Models for Radiology Report Generation
    Mou, Yongli
    SEMANTIC WEB: ESWC 2024 SATELLITE EVENTS, PT II, 2025, 15345 : 115 - 124
  • [39] Dynamic Interactive Relation Capturing via Scene Graph Learning for Robotic Surgical Report Generation
    Wang, Hongqiu
    Jin, Yueming
    Zhu, Lei
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 2702 - 2709
  • [40] GMoD: Graph-Driven Momentum Distillation Framework with Active Perception of Disease Severity for Radiology Report Generation
    Xiang, ZhiPeng
    Cui, ShaoGuo
    Shang, CaoZhi
    Jiang, Jingfeng
    Zhang, Liqiang
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT V, 2024, 15005 : 295 - 305