Video question answering via traffic knowledge database and question classification

被引:0
|
作者
Xiaoyong Sun
Yu Dai
Yuchen Wang
Weifeng Ma
Xuefen Lin
机构
[1] Zhejiang University of Science and Technology,School of Information and Electronic Engineering
来源
Multimedia Systems | 2024年 / 30卷
关键词
Video question answering; Knowledge; Transformer; Question classification;
D O I
暂无
中图分类号
学科分类号
摘要
Video question answering (VideoQA) is a task that involves answering questions related to videos. The main idea is to understand the content of the video and to combine it with the relevant semantic context to answer various types of questions. Existing methods typically analyze the spatiotemporal correlations of the entire video to answer questions. However, for some simple questions, the answer is related to only a specific frame of the video, and analyzing the entire video undoubtedly increases the learning cost. For some complex questions, the information contained in the video is limited, and these methods are not sufficient to fully answer such questions. Therefore, we proposes a VideoQA model based on question classification and a traffic knowledge database. The model starts from the perspective of the question and classifies the questions into general scene questions and causal questions using different methods to process these two types of questions. For general scene questions, we first extract the key frames of the video to convert it into a simpler image question-answering task and then we use top–down and bottom–up attention mechanisms to process it. For causal questions, we design a lightweight traffic knowledge database that provides relevant traffic knowledge not originally present in VideoQA datasets, to help model reasoning. Then, we use a question and knowledge-guided aggregation graph attention network to process causal questions. The experimental results show that while greatly reducing resource costs, our model performs better on the TrafficQA dataset than do models utilizing millions of external data for pretraining.
引用
收藏
相关论文
共 50 条
  • [21] Question Classification for Medical Domain Question Answering System
    Dodiya, Tripti
    Jain, Sonal
    2016 IEEE INTERNATIONAL WIE CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (IEEE WIECON-ECE 2016), 2016, : 204 - 207
  • [22] Knowledge Base Question Answering via Structured Query Generation using Question domain
    Li, Jiecheng
    Peng, Zizhen
    Zhu, Xiaoying
    Lu, Keda
    2022 IEEE 21ST INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING AND COMMUNICATIONS, IUCC/CIT/DSCI/SMARTCNS, 2022, : 394 - 400
  • [23] Question Classification for Intelligent Question Answering: A Comprehensive Survey
    Sun, Hao
    Wang, Shu
    Zhu, Yunqiang
    Yuan, Wen
    Zou, Zhiqiang
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2023, 12 (10)
  • [24] Cross-domain question classification in community question answering via kernel mapping
    Su, Lei
    Hu, Zuoliang
    Yang, Bin
    Li, Yiyang
    Chen, Jun
    NEW REVIEW OF HYPERMEDIA AND MULTIMEDIA, 2015, 21 (3-4) : 227 - 241
  • [25] Question Identification and Classification on an Academic Question Answering Site
    Ojokoh, Bolanle
    Igbe, Tobore
    Araoye, Ayobami
    Ameh, Friday
    2016 IEEE/ACM JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL), 2016, : 223 - 224
  • [26] QUESTION ANSWERING USING QUESTION CLASSIFICATION AND DOCUMENT TAGGING
    Chali, Yllias
    APPLIED ARTIFICIAL INTELLIGENCE, 2009, 23 (06) : 500 - 521
  • [27] Question Formulation and Question Answering for Knowledge Graph Completion
    Khvalchik, Maria
    Blaschke, Christian
    Revenko, Artem
    DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA 2019), 2019, 1062 : 166 - 171
  • [28] Video Graph Transformer for Video Question Answering
    Xiao, Junbin
    Zhou, Pan
    Chua, Tat-Seng
    Yan, Shuicheng
    COMPUTER VISION, ECCV 2022, PT XXXVI, 2022, 13696 : 39 - 58
  • [29] Video Reference: A Video Question Answering Engine
    Gao, Lei
    Li, Guangda
    Zheng, Yan-Tao
    Hong, Richang
    Chua, Tat-Seng
    ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2010, 5916 : 799 - +
  • [30] SQAD: Simple Question Answering Database
    Medved, Marek
    Horak, Ales
    RASLAN 2014: RECENT ADVANCES IN SLAVONIC NATURAL LANGUAGE PROCESSING, 2014, : 121 - 128