Attention-Based Bidirectional Recurrent Neural Networks for Description Generation of Videos

被引:0
|
作者
Du, Xiaotong [1 ]
Yuan, Jiabin [1 ]
Liu, Hu [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Jiangsu, Peoples R China
来源
关键词
Video description; Convolutional Neural Networks; Bidirectional Recurrent Neural Networks; Attention mechanism;
D O I
10.1007/978-3-030-00021-9_40
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Describing videos in human language is of vital importance in many applications, such as managing massive videos on line and providing descriptive video service (DVS) for blind people. In order to further promote existing video description frameworks, this paper presents an end-to-end deep learning model incorporating Convolutional Neural Networks (CNNs) and Bidirectional Recurrent Neural Networks (BiRNNs) based on a multimodal attention mechanism. Firstly, the model produces richer video representations, including image feature, motion feature and audio feature, than other similar researches. Secondly, BiRNNs model encodes these features in both forward and backward directions. Finally, an attention-based decoder translates sequential outputs of encoder to sequential words. The model is evaluated on Microsoft Research Video Description Corpus (MSVD) dataset. The results demonstrate the necessity of combining BiRNNs with a multimodal attention mechanism and the superiority of this model over other state-of-the-art methods conducted on this dataset.
引用
收藏
页码:440 / 451
页数:12
相关论文
共 50 条
  • [1] Attention-Based Bidirectional Gated Recurrent Unit Neural Networks for Sentiment Analysis
    Yu, Qing
    Zhao, Hui
    Wang, Zuohua
    2019 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND PATTERN RECOGNITION (AIPR 2019), 2019, : 116 - 119
  • [2] Explainable Software vulnerability detection based on Attention-based Bidirectional Recurrent Neural Networks
    Mao, Yi
    Li, Yun
    Sun, Jiatai
    Chen, Yixin
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 4651 - 4656
  • [3] Dipole: Diagnosis Prediction in Healthcare via Attention-based Bidirectional Recurrent Neural Networks
    Ma, Fenglong
    Chitta, Radha
    Zhou, Jing
    You, Quanzeng
    Sun, Tong
    Gao, Jing
    KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 1903 - 1911
  • [4] Detection of Paroxysmal Atrial Fibrillation using Attention-based Bidirectional Recurrent Neural Networks
    Shashikumar, Supreeth P.
    Shah, Amit J.
    Clifford, Gari D.
    Nemati, Shamim
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 715 - 723
  • [5] Conversational Analysis using Utterance-level Attention-based Bidirectional Recurrent Neural Networks
    Bothe, Chandrakant
    Magg, Sven
    Weber, Cornelius
    Wermter, Stefan
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 996 - 1000
  • [6] Attention-Based Convolution Bidirectional Recurrent Neural Network for Sentiment Analysis
    Sivakumar, Soubraylu
    Haritha, D.
    Ram, Sree N.
    Kumar, Naveen
    Krishna, Rama G.
    Kumar, Dinesh A.
    INTERNATIONAL JOURNAL OF DECISION SUPPORT SYSTEM TECHNOLOGY, 2022, 14 (01)
  • [7] Attention-based bidirectional gated recurrent unit neural networks for well logs prediction and lithology identification
    Zeng, Lili
    Ren, Weijian
    Shan, Liqun
    NEUROCOMPUTING, 2020, 414 : 153 - 171
  • [8] Text Classification Research with Attention-based Recurrent Neural Networks
    Du, C.
    Huang, L.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2018, 13 (01) : 50 - 61
  • [9] An Attention-Based Convolutional Recurrent Neural Networks for Scene Text Recognition
    Alshawi, Adil Abdullah Abdulhussein
    Tanha, Jafar
    Balafar, Mohammad Ali
    IEEE ACCESS, 2024, 12 : 8123 - 8134
  • [10] Attention-Based Radar PRI Modulation Recognition With Recurrent Neural Networks
    Li, Xueqiong
    Liu, Zhangmeng
    Huang, Zhitao
    IEEE ACCESS, 2020, 8 : 57426 - 57436