Empirical study on using adapters for debiased Visual Question Answering

被引:2
|
作者
Cho, Jae Won [1 ]
Argaw, Dawit Mureja [1 ]
Oh, Youngtaek [1 ]
Kim, Dong-Jin [2 ]
Kweon, In So [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Guseong Dong 23, Daejeon 34141, South Korea
[2] Hanyang Univ, Wangsimni Ro 222, Seoul 04763, South Korea
关键词
Visual Question Answering; Model Robustness; Biased Data; Adapters;
D O I
10.1016/j.cviu.2023.103842
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we empirically study debiased Visual Question Answering (VQA) works with Adapters. Most VQA debiasing works sacrifice in-distribution (ID) performance for the sake of out-of-distribution (OOD) performance. Hence, we explore and experiment with the use of adapters to preserve the ID performance by training only a simple adapter network to debias and recreate performance. We conduct an extensive empirical study on recent well-established VQA debiasing works and show that the entirety of the debiasing information from the proposed debiasing methods can be captured and modelled using a single fully connected layer while preserving original network performance by skipping the adapters. Through our exploration, we find that different placements of adapters are required for different debiasing techniques and show the different possibilities of using adapters for debiasing through our experiments. We believe our findings in this work open up more questions to be asked and explored for the VQA community.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Debiased Visual Question Answering via the perspective of question types
    Huai, Tianyu
    Yang, Shuwen
    Zhang, Junhang
    Zhao, Jiabao
    He, Liang
    PATTERN RECOGNITION LETTERS, 2024, 178 : 181 - 187
  • [2] SQT: Debiased Visual Question Answering via Shuffling Question Types
    Huai, Tianyu
    Yang, Shuwen
    Zhang, Junhang
    Wang, Guoan
    Yu, Xinru
    Ma, Tianlong
    He, Liang
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 600 - 605
  • [3] Debiased Visual Question Answering from Feature and Sample Perspectives
    Wen, Zhiquan
    Xu, Guanghui
    Tan, Mingkui
    Wu, Qingyao
    Wu, Qi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [4] Balancing and Contrasting Biased Samples for Debiased Visual Question Answering
    Cao, Runlin
    Li, Zhixin
    23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023, 2023, : 962 - 967
  • [5] An Empirical Study on the Language Modal in Visual Question Answering
    Peng, Daowan
    Wei, Wei
    Mao, Xian-Ling
    Fu, Yuanyuan
    Chen, Dangyang
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4109 - 4117
  • [6] Robust data augmentation and contrast learning for debiased visual question answering
    Ning, Ke
    Li, Zhixin
    NEUROCOMPUTING, 2025, 626
  • [7] An Empirical Study of Multilingual Scene-Text Visual Question Answering
    Li, Lin
    Zhang, Haohan
    Fang, Zeqing
    PROCEEDINGS OF THE 2ND WORKSHOP ON USER-CENTRIC NARRATIVE SUMMARIZATION OF LONG VIDEOS, NARSUM 2023, 2023, : 3 - 8
  • [8] VQA-PDF: Purifying Debiased Features for Robust Visual Question Answering Task
    Bi, Yandong
    Jiang, Huajie
    Liu, Jing
    Liu, Mengting
    Hu, Yongli
    Yin, Baocai
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XII, ICIC 2024, 2024, 14873 : 264 - 277
  • [9] An Empirical Evaluation of Visual Question Answering for Novel Objects
    Ramakrishnan, Santhosh K.
    Pal, Ambar
    Sharma, Gaurav
    Mittal, Anurag
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 7312 - 7321
  • [10] Test-Time Model Adaptation for Visual Question Answering With Debiased Self-Supervisions
    Wen, Zhiquan
    Niu, Shuaicheng
    Li, Ge
    Wu, Qingyao
    Tan, Mingkui
    Wu, Qi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 2137 - 2147