Token Imbalance Adaptation for Radiology Report Generation

被引:0
|
作者
Wu, Yuexin [1 ]
Huang, I-Chan [2 ]
Huang, Xiaolei [1 ]
机构
[1] Univ Memphis, Memphis, TN 38152 USA
[2] St Jude Childrens Res Hosp, Memphis, TN USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imbalanced token distributions naturally exist in text documents, leading neural language models to overfit on frequent tokens. The token imbalance may dampen the robustness of radiology report generators, as complex medical terms appear less frequently but reflect more medical information. In this study, we demonstrate how current state-of-the-art models fail to generate infrequent tokens on two standard benchmark datasets (IU X-RAY and MIMIC-CXR) of radiology report generation. To solve the challenge, we propose the Token Imbalance Adapter (TIMER), aiming to improve generation robustness on infrequent tokens. The model automatically leverages token imbalance by an unlikelihood loss and dynamically optimizes generation processes to augment infrequent tokens. We compare our approach with multiple state-of-the-art methods on the two benchmarks. Experiments demonstrate the effectiveness of our approach in enhancing model robustness overall and infrequent tokens. Our ablation analysis shows that our reinforcement learning method has a major effect in adapting token imbalance for radiology report generation.
引用
收藏
页码:72 / 85
页数:14
相关论文
共 50 条
  • [1] UniCrossAdapter: Multimodal Adaptation of CLIP for Radiology Report Generation
    Chen, Yaxiong
    Du, Chuang
    Li, Chunlei
    Hu, Jingliang
    Shi, Yilei
    Xiong, Shengwu
    Zhu, Xiao Xiang
    Mou, Lichao
    FOUNDATION MODELS FOR GENERAL MEDICAL AI, MEDAGI 2024, 2025, 15184 : 113 - 123
  • [2] Pragmatic Radiology Report Generation
    Dang Nguyen
    Chen, Chacha
    He, He
    Tan, Chenhao
    MACHINE LEARNING FOR HEALTH, ML4H, VOL 225, 2023, 225 : 385 - 402
  • [3] Replace and Report: NLP Assisted Radiology Report Generation
    Kale, Kaveri
    Bhattacharyya, Pushpak
    Jadhav, Kshitij
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 10731 - 10742
  • [4] Multimodal contrastive learning for radiology report generation
    Wu X.
    Li J.
    Wang J.
    Qian Q.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (08) : 11185 - 11194
  • [5] Report generation using digital speech recognition in radiology
    F. Vorbeck
    A. Ba-Ssalamah
    J. Kettenbach
    P. Huebsch
    European Radiology, 2000, 10 : 1976 - 1982
  • [6] Unsupervised disease tags for automatic radiology report generation
    Yi, Xiulong
    Fu, You
    Hua, Rong
    Liu, Ruiqing
    Zhang, Hao
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89
  • [7] Report generation using digital speech recognition in radiology
    Vorbeck, F
    Ba-Ssalamah, A
    Kettenbach, J
    Huebsch, P
    EUROPEAN RADIOLOGY, 2000, 10 (12) : 1976 - 1982
  • [8] Bootstrapping Large Language Models for Radiology Report Generation
    Liu, Chang
    Tian, Yuanhe
    Chen, Weidong
    Song, Yan
    Zhang, Yongdong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 18635 - 18643
  • [9] Reinforced visual interaction fusion radiology report generation
    Wang, Liya
    Chen, Haipeng
    Liu, Yu
    Lyu, Yingda
    Qiu, Feng
    MULTIMEDIA SYSTEMS, 2024, 30 (05)
  • [10] A Self-guided Framework for Radiology Report Generation
    Li, Jun
    Li, Shibo
    Hu, Ying
    Tao, Huiren
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VIII, 2022, 13438 : 588 - 598