Video question answering via traffic knowledge database and question classification

被引:0
|
作者
Xiaoyong Sun
Yu Dai
Yuchen Wang
Weifeng Ma
Xuefen Lin
机构
[1] Zhejiang University of Science and Technology,School of Information and Electronic Engineering
来源
Multimedia Systems | 2024年 / 30卷
关键词
Video question answering; Knowledge; Transformer; Question classification;
D O I
暂无
中图分类号
学科分类号
摘要
Video question answering (VideoQA) is a task that involves answering questions related to videos. The main idea is to understand the content of the video and to combine it with the relevant semantic context to answer various types of questions. Existing methods typically analyze the spatiotemporal correlations of the entire video to answer questions. However, for some simple questions, the answer is related to only a specific frame of the video, and analyzing the entire video undoubtedly increases the learning cost. For some complex questions, the information contained in the video is limited, and these methods are not sufficient to fully answer such questions. Therefore, we proposes a VideoQA model based on question classification and a traffic knowledge database. The model starts from the perspective of the question and classifies the questions into general scene questions and causal questions using different methods to process these two types of questions. For general scene questions, we first extract the key frames of the video to convert it into a simpler image question-answering task and then we use top–down and bottom–up attention mechanisms to process it. For causal questions, we design a lightweight traffic knowledge database that provides relevant traffic knowledge not originally present in VideoQA datasets, to help model reasoning. Then, we use a question and knowledge-guided aggregation graph attention network to process causal questions. The experimental results show that while greatly reducing resource costs, our model performs better on the TrafficQA dataset than do models utilizing millions of external data for pretraining.
引用
收藏
相关论文
共 50 条
  • [41] Knowledge Acquisition for Visual Question Answering via Iterative Querying
    Zhu, Yuke
    Lim, Joseph J.
    Li Fei-Fei
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6146 - 6155
  • [42] gMatch: Knowledge base question answering via semantic matching
    Jiao, Jie
    Wang, Shujun
    Zhang, Xiaowang
    Wang, Longbiao
    Feng, Zhiyong
    Wang, Junhu
    KNOWLEDGE-BASED SYSTEMS, 2021, 228
  • [43] Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models
    Ko, Dohwan
    Lee, Ji Soo
    Choi, Miso
    Chu, Jaewon
    Park, Jihwan
    Kim, Hyunwoo J.
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 3078 - 3089
  • [44] Enhancing yes/no question answering with weak supervision via extractive question answering
    Dimitris Dimitriadis
    Grigorios Tsoumakas
    Applied Intelligence, 2023, 53 : 27560 - 27570
  • [45] Enhancing yes/no question answering with weak supervision via extractive question answering
    Dimitriadis, Dimitris
    Tsoumakas, Grigorios
    APPLIED INTELLIGENCE, 2023, 53 (22) : 27560 - 27570
  • [46] Knowledge Graph Based Question Routing for Community Question Answering
    Liu, Zhu
    Li, Kan
    Qu, Dacheng
    NEURAL INFORMATION PROCESSING, ICONIP 2017, PT V, 2017, 10638 : 721 - 730
  • [48] An Empirical Comparison of Question Classification Methods for Question Answering Systems
    Cortes, Eduardo Gabriel
    Woloszyn, Vinicius
    Binder, Arne
    Himmelsbach, Tilo
    Barone, Dante
    Moeller, Sebastian
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 5408 - 5416
  • [49] A Survey of Question Semantic Parsing for Knowledge Base Question Answering
    Qiu Y.-Q.
    Wang Y.-Z.
    Bai L.
    Yin Z.-Y.
    Shen H.-W.
    Bai S.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2022, 50 (09): : 2242 - 2264
  • [50] Advances in question classification for open-domain question answering
    School of Computer Science and Technology, Anhui University of Technology, Maanshan
    Anhui
    243002, China
    不详
    Jiangsu
    210023, China
    Tien Tzu Hsueh Pao, 8 (1627-1636):