Understanding Retrieval Robustness for Retrieval-Augmented Image Captioning

被引:0
|
作者
Li, Wenyan [1 ]
Li, Jiaang [1 ]
Ramose, Rita [2 ]
Tang, Raphael [3 ]
Elliott, Desmond [1 ]
机构
[1] Univ Copenhagen, Dept Comp Sci, Copenhagen, Denmark
[2] Univ Lisbon, Inst Super Tecn, NESC ID, Lisbon, Portugal
[3] Comcast Appl AI, Philadelphia, PA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in retrieval-augmented models for image captioning highlight the benefit of retrieving related captions for efficient, lightweight models with strong domain-transfer capabilities. While these models demonstrate the success of retrieval augmentation, retrieval models are still far from perfect in practice: the retrieved information can sometimes mislead the model, resulting in incorrect generation and worse performance. In this paper, we analyze the robustness of a retrieval-augmented captioning model SMALLCAP. Our analysis shows that the model is sensitive to tokens that appear in the majority of the retrieved captions, and the input attribution shows that those tokens are likely copied into the generated output. Given these findings, we propose to train the model by sampling retrieved captions from more diverse sets. This decreases the chance that the model learns to copy majority tokens, and improves both in-domain and cross-domain performance.
引用
收藏
页码:9285 / 9299
页数:15
相关论文
共 50 条
  • [41] Towards an FA ChatBot with Retrieval-augmented Language Modeling
    Fichtenkamm, Maik
    Kofler, Markus
    Schekotihin, Konstantin
    Burmer, Christian
    2024 IEEE INTERNATIONAL SYMPOSIUM ON THE PHYSICAL AND FAILURE ANALYSIS OF INTEGRATED CIRCUITS, IPFA 2024, 2024,
  • [42] LeanDojo: Theorem Proving with Retrieval-Augmented Language Models
    Yang, Kaiyu
    Swope, Aidan M.
    Gu, Alex
    Chalamala, Rahul
    Song, Peiyang
    Yu, Shixing
    Godil, Saad
    Prenger, Ryan
    Anandkumar, Anima
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [43] READSUM: Retrieval-Augmented Adaptive Transformer for Source Code Summarization
    Choi, Yunseok
    Na, Cheolwon
    Kim, Hyojun
    Lee, Jee-Hyong
    IEEE ACCESS, 2023, 11 : 51155 - 51165
  • [44] Building a Coding Assistant via the Retrieval-Augmented Language Model
    Li, Xinze
    Wang, Hanbin
    Liu, Zhenghao
    Yu, Shi
    Wang, Shuo
    Yan, Yukun
    Fu, Yukai
    Gu, Yu
    Yu, Ge
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2025, 43 (02)
  • [45] A Retrieval-Augmented Generation Strategy to Enhance Medical Chatbot Reliability
    Haez, Saba Ghanbari
    Segala, Marina
    Bellan, Patrizio
    Magnolini, Simone
    Sanna, Leonardo
    Consolandi, Monica
    Dragoni, Mauro
    ARTIFICIAL INTELLIGENCE IN MEDICINE, PT I, AIME 2024, 2024, 14844 : 213 - 223
  • [46] Retrieval-Augmented Mining of Temporal Logic Specifications from Data
    Saveri, Gaia
    Bortolussi, Luca
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT VII, ECML PKDD 2024, 2024, 14947 : 315 - 331
  • [47] REALM: Retrieval-Augmented Language Model Pre-Training
    Guu, Kelvin
    Lee, Kenton
    Tung, Zora
    Pasupat, Panupong
    Chang, Ming-Wei
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [48] Optimizing Retrieval-augmented Reader Models via Token Elimination
    Berchansky, Moshe
    Izsak, Peter
    Caciularu, Avi
    Dagan, Ido
    Wasserblat, Moshe
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 1506 - 1524
  • [49] Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
    Lewis, Patrick
    Perez, Ethan
    Piktus, Aleksandra
    Petroni, Fabio
    Karpukhin, Vladimir
    Goyal, Naman
    Kuttler, Heinrich
    Lewis, Mike
    Yih, Wen-tau
    Rocktaschel, Tim
    Riedel, Sebastian
    Kiela, Douwe
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [50] Retrieval-Augmented Generation: Advancing personalized care and research in oncology
    Zarfati, Mor
    Soffer, Shelly
    Nadkarni, Girish N.
    Klang, Eyal
    EUROPEAN JOURNAL OF CANCER, 2025, 220