Dual-Key Multimodal Backdoors for Visual Question Answering

被引:0
|
作者
Walmer, Matthew [1 ]
Sikka, Karan [2 ]
Sur, Indranil [2 ]
Shrivastava, Abhinav [1 ]
Jha, Susmit [2 ]
机构
[1] University of Maryland, College Park, United States
[2] SRI International, United States
来源
arXiv | 2021年
关键词
Backdoors - Multi-modal - Multimodal models - Multiple inputs - Non-trivial - Object detectors - Question Answering - Security vulnerabilities - Trojans - Visual feature;
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
相关论文
共 50 条
  • [21] Improving Visual Question Answering by Multimodal Gate Fusion Network
    Xiang, Shenxiang
    Chen, Qiaohong
    Fang, Xian
    Guo, Menghao
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [22] Multimodal Dual Attention Memory for Video Story Question Answering
    Kim, Kyung-Min
    Choi, Seong-Ho
    Kim, Jin-Hwa
    Zhang, Byoung-Tak
    COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 : 698 - 713
  • [23] DSAF: A Dual-Stage Attention Based Multimodal Fusion Framework for Medical Visual Question Answering
    K. Mukesh
    S. L. Jayaprakash
    R. Prasanna Kumar
    SN Computer Science, 6 (4)
  • [24] Dual Attention and Question Categorization-Based Visual Question Answering
    Mishra A.
    Anand A.
    Guha P.
    IEEE Transactions on Artificial Intelligence, 2023, 4 (01): : 81 - 91
  • [25] Visual-Semantic Dual Channel Network for Visual Question Answering
    Wang, Xin
    Chen, Qiaohong
    Hu, Ting
    Sun, Qi
    Jia, Yubo
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [26] Membrane trafficking - Dual-key strategy
    Itoh, T
    De Camilli, P
    NATURE, 2004, 429 (6988) : 141 - 143
  • [27] Visual Experience-Based Question Answering with Complex Multimodal Environments
    Kim, Incheol
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020 (2020)
  • [28] Multimodal Encoder-Decoder Attention Networks for Visual Question Answering
    Chen, Chongqing
    Han, Dezhi
    Wang, Jun
    IEEE ACCESS, 2020, 8 : 35662 - 35671
  • [29] HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language
    Parida, Shantipriya
    Abdulmumin, Idris
    Muhammad, Shamsuddeen Hassan
    Bose, Aneesh
    Kohli, Guneet Singh
    Ahmad, Ibrahim Said
    Kotwal, Ketan
    Sarkar, Sayan Deb
    Bojar, Ondrej
    Kakudi, Habeebah Adamu
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 10162 - 10183
  • [30] Multimodal feature fusion by relational reasoning and attention for visual question answering
    Zhang, Weifeng
    Yu, Jing
    Hu, Hua
    Hu, Haiyang
    Qin, Zengchang
    INFORMATION FUSION, 2020, 55 (55) : 116 - 126