Improving radiology report generation with multi-grained abnormality prediction

被引:2
|
作者
Jin, Yuda [1 ]
Chen, Weidong [3 ]
Tian, Yuanhe [4 ]
Song, Yan [3 ]
Yan, Chenggang [2 ]
机构
[1] Hangzhou Dianzi Univ, HDU ITMO Joint Inst, Hangzhou, Peoples R China
[2] Hangzhou Dianzi Univ, Sch Commun Engn, Hangzhou, Peoples R China
[3] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei, Peoples R China
[4] Univ Washington, Dept Linguist, Seattle, WA USA
基金
中国国家自然科学基金;
关键词
Radiology report generation; Multi-grained abnormality prediction; Reinforcement learning;
D O I
10.1016/j.neucom.2024.128122
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Conventional data-driven approaches for radiology report generation face a problem that the descriptions of the normal regions in the real data are much more than the that for abnormal ones, which potentially leads to the bias of losing focus on abnormalities. Previous work showed promising results owing to the fact that most of the content in reports are described within normal range, which, although fluent, has the limitation of tending to favor the evaluation metrics rather than produce useful hints for human judgment and model learning. To this end, we propose to explicitly predict abnormalities for radiology report generation, following a multitask learning scheme to drive the model paying more attention on the abnormal regions with multi-grained information, including abnormalities in different granularities, so as to tackle the aforementioned limitation for report generation. In doing so, we propose a disease detector (DD) to identify coarse-grained abnormality, and a medical concept detector (MCD) to associate the predicted disease with the predefined fine-grained pathological concepts. To integrate the information from the proposed abnormality prediction, we design a dual-stream adaptive decoder that takes such information into account with a gate unit controls the integration at each generation step. Extensive experiment results on two widely used benchmark datasets indicate that our method achieves 29.8% and 34.2% performance improvement over the baseline on the natural language generation (NLG) metrics and clinical efficacy (CE) metrics respectively, demonstrate the superiority of the proposed approach.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Multi-Grained Radiology Report Generation With Sentence-Level Image-Language Contrastive Learning
    Liu, Aohan
    Guo, Yuchen
    Yong, Jun-Hai
    Xu, Feng
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (07) : 2657 - 2669
  • [2] Multi-grained Aspect Fusion for Review Response Generation
    Yuan, Yun
    Gong, Chen
    Kong, Dexin
    Yu, Nan
    Fu, Guohong
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IX, 2023, 14262 : 25 - 37
  • [3] Progressive prediction: Video anomaly detection via multi-grained prediction
    Zeng, Xianlin
    Jiang, Yalong
    Wang, Yufeng
    Fu, Qiang
    Ding, Wenrui
    IET IMAGE PROCESSING, 2024, 18 (10) : 2568 - 2583
  • [4] Improving apparel detection with category grouping and multi-grained branches
    Qing Tian
    Sampath Chanda
    Amit Kumar K C
    Douglas Gray
    Multimedia Tools and Applications, 2023, 82 : 7383 - 7400
  • [5] Improving apparel detection with category grouping and multi-grained branches
    Tian, Qing
    Chanda, Sampath
    Kumar, Amit K. C.
    Gray, Douglas
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (05) : 7383 - 7400
  • [6] CAFE: Adaptive VDI Workload Prediction with Multi-Grained Features
    Zhang, Yao
    Fan, Wen-Ping
    Wu, Xuan
    Chen, Hua
    Li, Bin-Yang
    Zhang, Min-Ling
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5821 - 5828
  • [7] Monero With Multi-Grained Redaction
    Huang, Ke
    Mu, Yi
    Rezaeibagha, Fatemeh
    Zhang, Xiaosong
    Li, Xiong
    Cao, Sheng
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2024, 21 (01) : 241 - 253
  • [8] Multi-grained contextual code representation learning for commit message generation
    Wang, Chuangwei
    Zhang, Li
    Zhang, Xiaofang
    INFORMATION AND SOFTWARE TECHNOLOGY, 2024, 167
  • [9] Improving Speech Translation by Cross-Modal Multi-Grained Contrastive Learning
    Zhang, Hao
    Si, Nianwen
    Chen, Yaqi
    Zhang, Wenlin
    Yang, Xukui
    Qu, Dan
    Zhang, Wei-Qiang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1075 - 1086
  • [10] Multi-Grained Named Entity Recognition
    Xia, Congying
    Zhang, Chenwei
    Yang, Tao
    Li, Yaliang
    Du, Nan
    Wu, Xian
    Fan, Wei
    Ma, Fenglong
    Yu, Philip
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1430 - 1440