Token Imbalance Adaptation for Radiology Report Generation

被引:0
|
作者
Wu, Yuexin [1 ]
Huang, I-Chan [2 ]
Huang, Xiaolei [1 ]
机构
[1] Univ Memphis, Memphis, TN 38152 USA
[2] St Jude Childrens Res Hosp, Memphis, TN USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imbalanced token distributions naturally exist in text documents, leading neural language models to overfit on frequent tokens. The token imbalance may dampen the robustness of radiology report generators, as complex medical terms appear less frequently but reflect more medical information. In this study, we demonstrate how current state-of-the-art models fail to generate infrequent tokens on two standard benchmark datasets (IU X-RAY and MIMIC-CXR) of radiology report generation. To solve the challenge, we propose the Token Imbalance Adapter (TIMER), aiming to improve generation robustness on infrequent tokens. The model automatically leverages token imbalance by an unlikelihood loss and dynamically optimizes generation processes to augment infrequent tokens. We compare our approach with multiple state-of-the-art methods on the two benchmarks. Experiments demonstrate the effectiveness of our approach in enhancing model robustness overall and infrequent tokens. Our ablation analysis shows that our reinforcement learning method has a major effect in adapting token imbalance for radiology report generation.
引用
收藏
页码:72 / 85
页数:14
相关论文
共 50 条
  • [31] ADAPTATION TO A THREONINE IMBALANCE IN RATS
    CARROLL, C
    ARATA, D
    FEDERATION PROCEEDINGS, 1959, 18 (01) : 520 - 520
  • [32] MKCL: Medical Knowledge with Contrastive Learning model for radiology report generation
    Hou, Xiaodi
    Liu, Zhi
    Li, Xiaobo
    Li, Xingwang
    Sang, Shengtian
    Zhang, Yijia
    JOURNAL OF BIOMEDICAL INFORMATICS, 2023, 146
  • [33] FgKF: Fine-Grained Knowledge Fusion for Radiology Report Generation
    Zhang, Kunli
    Huang, Xiyang
    Zan, Hongying
    Ma, Yutuan
    Gao, Qianxiang
    Li, Yaoxu
    HEALTH INFORMATION PROCESSING, CHIP 2023, 2023, 1993 : 238 - 253
  • [34] TRINet: Team Role Interaction Network for automatic radiology report generation
    Zheng, Zhiqiang
    Zhang, Yujie
    Liang, Enhe
    Weng, Zhi
    Chai, Jun
    Li, Jianxiu
    Computers in Biology and Medicine, 2024, 183
  • [35] A Pilot Study in Surveying Clinical Judgments to Evaluate Radiology Report Generation
    Boag, William
    Kane, Hassan
    Rawat, Saumya
    Wei, Jesse
    Goehler, Alexander
    PROCEEDINGS OF THE 2021 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, FACCT 2021, 2021, : 458 - 465
  • [36] GOLDMAN ON LEVEL GENERATION AND ACT TOKEN
    DAVIS, S
    MIND, 1977, 86 (342) : 237 - 241
  • [37] Improving radiology report generation with multi-grained abnormality prediction
    Jin, Yuda
    Chen, Weidong
    Tian, Yuanhe
    Song, Yan
    Yan, Chenggang
    NEUROCOMPUTING, 2024, 600
  • [38] Interactive dual-stream contrastive learning for radiology report generation
    Zhang, Ziqi
    Jiang, Ailian
    JOURNAL OF BIOMEDICAL INFORMATICS, 2024, 157
  • [39] DACG: Dual Attention and Context Guidance model for radiology report generation
    Lang, Wangyu
    Liu, Zhi
    Zhang, Yijia
    MEDICAL IMAGE ANALYSIS, 2025, 99
  • [40] Attention based automated radiology report generation using CNN and LSTM
    Sirshar, Mehreen
    Paracha, Muhammad Faheem Khalil
    Akram, Muhammad Usman
    Alghamdi, Norah Saleh
    Zaidi, Syeda Zainab Yousuf
    Fatima, Tatheer
    PLOS ONE, 2022, 17 (01):