From Strings to Things: Knowledge-enabled VQA Model that can Read and Reason

被引:28
|
作者
Singh, Ajeet Kumar [1 ]
Mishra, Anand [2 ,3 ]
Shekhar, Shashank [3 ]
Chakraborty, Anirban [3 ]
机构
[1] TCS Res, Pune, Maharashtra, India
[2] IIT Jodhpur, Jodhpur, Rajasthan, India
[3] Indian Inst Sci, Bangalore, Karnataka, India
关键词
SCENE TEXT;
D O I
10.1109/ICCV.2019.00470
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text present in images are not merely strings, they provide useful cues about the image. Despite their utility in better image understanding, scene texts are not used in traditional visual question answering (VQA) models. In this work, we present a VQA model which can read scene texts and perform reasoning on a knowledge graph to arrive at an accurate answer. Our proposed model has three mutually interacting modules: (i) proposal module to get word and visual content proposals from the image, (ii) fusion module to fuse these proposals, question and knowledge base to mine relevant facts, and represent these facts as multi-relational graph, (iii) reasoning module to perform a novel gated graph neural network based reasoning on this graph. The performance of our knowledge-enabled VQA model is evaluated on our newly introduced dataset, viz. text-KVQA. To the best of our knowledge, this is the first dataset which identifies the need for bridging text recognition with knowledge graph based reasoning. Through extensive experiments, we show that our proposed method outperforms traditional VQA as well as question-answering over knowledge base-based methods on text-KVQA.
引用
收藏
页码:4601 / 4611
页数:11
相关论文
共 12 条
  • [1] A Survey of Advanced Information Fusion System: from Model-Driven to Knowledge-Enabled
    Zhu, Di
    Yin, Hailian
    Xu, Yidan
    Wu, Jiaqi
    Zhang, Bowen
    Cheng, Yaqi
    Yin, Zhanzuo
    Yu, Ziqiang
    Wen, Hao
    Li, Bohan
    DATA SCIENCE AND ENGINEERING, 2023, 8 (02) : 85 - 97
  • [2] A Survey of Advanced Information Fusion System: from Model-Driven to Knowledge-Enabled
    Di Zhu
    Hailian Yin
    Yidan Xu
    Jiaqi Wu
    Bowen Zhang
    Yaqi Cheng
    Zhanzuo Yin
    Ziqiang Yu
    Hao Wen
    Bohan Li
    Data Science and Engineering, 2023, 8 : 85 - 97
  • [3] An ensemble model for idioms and literal text classification using knowledge-enabled BERT in deep learning
    Abarna S.
    Sheeba J.I.
    Devaneyan S.P.
    Measurement: Sensors, 2022, 24
  • [4] Discovering superhard high-entropy diboride ceramics via a hybrid data-driven and knowledge-enabled model
    Lu, Jiaqi
    Zhang, Fengpei
    Wang, William Yi
    Yao, Gang
    Gao, Xingyu
    Liu, Ya
    Zhang, Zhi
    Wang, Jun
    Wang, Yiguang
    Liang, Xiubing
    Song, Haifeng
    Li, Jinshan
    Zhang, Pingxiang
    JOURNAL OF THE AMERICAN CERAMIC SOCIETY, 2023, 106 (11) : 6923 - 6936
  • [5] Can the Hagedorn phase transition be explained from matrix model for strings?
    Sathiapalan, B.
    Sircar, Nilanjan
    JOURNAL OF HIGH ENERGY PHYSICS, 2008, (08):
  • [6] A User-Centric Knowledge Creation Model in a Web of Object-Enabled Internet of Things Environment
    Kibria, Muhammad Golam
    Fattah, Sheik Mohammad Mostakim
    Jeong, Kwanghyeon
    Chong, Ilyoung
    Jeong, Youn-Kwae
    SENSORS, 2015, 15 (09) : 24054 - 24086
  • [7] How the Internet of Things can help knowledge management: a case study from the automotive domain
    Uden, Lorna
    He, Wu
    JOURNAL OF KNOWLEDGE MANAGEMENT, 2017, 21 (01) : 57 - 70
  • [8] An Explainable Artificial-Intelligence-Based CNN Model for Knowledge Extraction From the Social Internet of Things: Proposing a New Model
    Alkwai, Lulwah M.
    IEEE SYSTEMS MAN AND CYBERNETICS MAGAZINE, 2022, 8 (04): : 48 - 51
  • [9] "You Can Only Choose From the Things You Know": Engaging With Students' Dark and Politicized Funds of Knowledge in the Music Classroom
    Bylica, Kelly
    EDUCATION AND URBAN SOCIETY, 2023, 55 (05) : 617 - 640
  • [10] Customer knowledge management and IT-enabled business model innovation: A conceptual framework and a case study from China
    Wu Jiebing
    Guo Bin
    Shi Yongjiang
    EUROPEAN MANAGEMENT JOURNAL, 2013, 31 (04) : 359 - 372