SA-VQA: Structured Alignment of Visual and Semantic Representations for Visual Question Answering

被引:0
|
作者
Xiong, Peixi [1 ]
You, Quanzeng [2 ]
Yu, Pei [2 ]
Liu, Zicheng [2 ]
Wu, Ying [1 ]
机构
[1] Northwestern University, United States
[2] Microsoft Research
来源
arXiv | 2022年
关键词
Compilation and indexing terms; Copyright 2024 Elsevier Inc;
D O I
暂无
中图分类号
学科分类号
摘要
Deep learning
引用
收藏
相关论文
共 50 条
  • [31] Context-VQA: Towards Context-Aware and Purposeful Visual Question Answering
    Naik, Nandita
    Potts, Christopher
    Kreiss, Elisa
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2813 - 2817
  • [32] Event-Oriented Visual Question Answering: The E-VQA Dataset and Benchmark
    Yang, Zhenguo
    Xiang, Jiale
    You, Jiuxiang
    Li, Qing
    Liu, Wenyin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (10) : 10210 - 10223
  • [33] Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
    Yash Goyal
    Tejas Khot
    Aishwarya Agrawal
    Douglas Summers-Stay
    Dhruv Batra
    Devi Parikh
    International Journal of Computer Vision, 2019, 127 : 398 - 414
  • [34] Graph-enhanced visual representations and question-guided dual attention for visual question answering
    Yusuf, Abdulganiyu Abdu
    Feng, Chong
    Mao, Xianling
    Haruna, Yunusa
    Li, Xinyan
    Duma, Ramadhani Ally
    NEUROCOMPUTING, 2025, 614
  • [35] Visual Question Answering
    Nada, Ahmed
    Chen, Min
    2024 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS, ICNC, 2024, : 6 - 10
  • [36] Spot the Difference: Difference Visual Question Answering with Residual Alignment
    Lu, Zilin
    Xie, Yutong
    Zeng, Qingjie
    Lu, Mengkang
    Wu, Qi
    Xia, Yong
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT V, 2024, 15005 : 649 - 658
  • [37] A Corpus for Visual Question Answering Annotated with Frame Semantic Information
    Alizadeh, Mehrdad
    Di Eugenio, Barbara
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 5524 - 5531
  • [38] Semantic Relation Graph Reasoning Network for Visual Question Answering
    Lan, Hong
    Zhang, Pufen
    TWELFTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING SYSTEMS, 2021, 11719
  • [39] Learning visual question answering on controlled semantic noisy labels
    Zhang, Haonan
    Zeng, Pengpeng
    Hu, Yuxuan
    Qian, Jin
    Song, Jingkuan
    Gao, Lianli
    PATTERN RECOGNITION, 2023, 138
  • [40] A Language-Guided Progressive Fusion Network with semantic density alignment for Medical Visual Question Answering
    Du, Shuxian
    Liang, Shuang
    Gu, Yu
    JOURNAL OF BIOMEDICAL INFORMATICS, 2025, 165