Learning to Answer Questions from Image Using Convolutional Neural Network

被引:0
|
作者
Ma, Lin [1 ]
Lu, Zhengdong [1 ]
Li, Hang [1 ]
机构
[1] Huawei Technol, Noahs Ark Lab, Shenzhen, Peoples R China
来源
THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2016年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose to employ the convolutional neural network (CNN) for the image question answering (QA) task. Our proposed CNN provides an end-to-end framework with convolutional architectures for learning not only the image and question representations, but also their inter-modal interactions to produce the answer. More specifically, our model consists of three CNNs: one image CNN to encode the image content, one sentence CNN to compose the words of the question, and one multimodal convolution layer to learn their joint representation for the classification in the space of candidate answer words. We demonstrate the efficacy of our proposed model on the DAQUAR and COCO-QA datasets, which are two benchmark datasets for image QA, with the performances significantly outperforming the state-of-the-art.
引用
收藏
页码:3567 / 3573
页数:7
相关论文
共 50 条
  • [21] Learning Document Image Features With SqueezeNet Convolutional Neural Network
    Hassanpour, M.
    Malek, H.
    INTERNATIONAL JOURNAL OF ENGINEERING, 2020, 33 (07): : 1201 - 1207
  • [22] Hyperspectral Image Classification With Convolutional Neural Network and Active Learning
    Cao, Xiangyong
    Yao, Jing
    Xu, Zongben
    Meng, Deyu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (07): : 4604 - 4616
  • [23] Learning a Convolutional Neural Network for Image Compact-Resolution
    Li, Yue
    Liu, Dong
    Li, Houqiang
    Li, Li
    Li, Zhu
    Wu, Feng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (03) : 1092 - 1107
  • [24] Image Forensics Based on Transfer Learning and Convolutional Neural Network
    Zhan, Yifeng
    Chen, Yifang
    Zhang, Qiong
    Kang, Xiangui
    IH&MMSEC'17: PROCEEDINGS OF THE 2017 ACM WORKSHOP ON INFORMATION HIDING AND MULTIMEDIA SECURITY, 2017, : 165 - 170
  • [25] Image Recognition of Coal and Coal Gangue Using a Convolutional Neural Network and Transfer Learning
    Pu, Yuanyuan
    Apel, Derek B.
    Szmigiel, Alicja
    Chen, Jie
    ENERGIES, 2019, 12 (09)
  • [26] Advancements in Image Classification using Convolutional Neural Network
    Sultana, Farhana
    Sufian, Abu
    Dutta, Paramartha
    2018 FOURTH IEEE INTERNATIONAL CONFERENCE ON RESEARCH IN COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (ICRCICN), 2018, : 122 - 129
  • [27] Multifocus image fusion using convolutional neural network
    Wen, Yu
    Yang, Xiaomin
    Celik, Turgay
    Sushkova, Olga
    Albertini, Marcelo Keese
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (45-46) : 34531 - 34543
  • [28] Thermal Image Enhancement using Convolutional Neural Network
    Choi, Yukyung
    Kim, Namil
    Hwang, Soonmin
    Kweon, In So
    2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 223 - 230
  • [29] Image Compressed Sensing Using Convolutional Neural Network
    Shi, Wuzhen
    Jiang, Feng
    Liu, Shaohui
    Zhao, Debin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 375 - 388
  • [30] Image dehazing using autoencoder convolutional neural network
    Singh, Richa
    Dubey, Ashwani Kumar
    Kapoor, Rajiv
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2022, 13 (06) : 3002 - 3016