Efficient Video Captioning with Frame Similarity-Based Filtering

被引:0
|
作者
Rashno, Elyas [1 ]
Zulkernine, Farhana [1 ]
机构
[1] Queens Univ, Sch Comp, Kingston, ON, Canada
关键词
Video Caption Generation; Video frame similarity; Sequence to Sequence; Stacked LSTM;
D O I
10.1007/978-3-031-39821-6_7
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Video captioning combines computer vision and Natural Language Processing (NLP) to perform the challenging task of scene understanding. The rapid advancements in artificial intelligence have led to a growing interest in video captioning, which involves generating natural language descriptions based on the visual content of videos. In this paper, we present a novel approach to video caption generation. The proposed method first extracts frames from the video and reduces the number of frames based on their similarity. The remaining frames are then processed by a Convolution Neural Network (CNN) to extract a feature vector, which is then fed into a Long Short-Term Memory (LSTM) network to generate the captions. The results are compared with the state-of-the-art models which demonstrate that the proposed approach outperforms the existing methods on MSVD, M-VAD, and MPII-MD datasets.
引用
收藏
页码:98 / 112
页数:15
相关论文
共 50 条
  • [21] A neural network filtering approach for similarity-based remaining useful life estimation
    Bektas, Oguz
    Jones, Jeffrey A.
    Sankararaman, Shankar
    Roychoudhury, Indranil
    Goebel, Kai
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2019, 101 (1-4): : 87 - 103
  • [22] An Efficient Framework for Dense Video Captioning
    Suin, Maitreya
    Rajagopalan, A. N.
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12039 - 12046
  • [23] A Cognitive Similarity-Based Measure to Enhance the Performance of Collaborative Filtering-Based Recommendation System
    Jain, Gourav
    Mahara, Tripti
    Sharma, S. C.
    Sangaiah, Arun Kumar
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2022, 9 (06) : 1785 - 1793
  • [24] Efficient and robust data augmentation for trajectory analytics: a similarity-based approach
    He, Dan
    Wang, Sibo
    Ruan, Boyu
    Zheng, Bolong
    Zhou, Xiaofang
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2020, 23 (01): : 361 - 387
  • [25] Content-based video retrieval based on similarity of frame sequence
    Shan, MK
    Lee, SY
    INTERNATIONAL WORKSHOP ON MULTI-MEDIA DATABASE MANAGEMENT SYSTEMS- PROCEEDINGS, 1998, : 90 - 97
  • [26] Structural similarity-based rate control algorithm for 3D video
    Y. Harshalatha
    Prabir Kumar Biswas
    Multimedia Tools and Applications, 2021, 80 : 25897 - 25908
  • [27] ONTOLOGICAL RERANKING APPROACH FOR HYBRID CONCEPT SIMILARITY-BASED VIDEO SHOTS INDEXATION
    Benmokhtar, Rachid
    Huet, Benoit
    2009 10TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES, 2009, : 226 - 229
  • [28] An efficient similarity-based level set model for medical image segmentation
    Yu, Haiping
    He, Fazhi
    Pan, Yiteng
    Chen, Xiao
    JOURNAL OF ADVANCED MECHANICAL DESIGN SYSTEMS AND MANUFACTURING, 2016, 10 (08):
  • [29] StateMiner: An Efficient Similarity-Based Approach for Optimal Mining of Role Hierarchy
    Takabi, Hassan
    Joshi, James B. D.
    SACMAT 2010: PROCEEDINGS OF THE 15TH ACM SYMPOSIUM ON ACCESS CONTROL MODELS AND TECHNOLOGIES, 2010, : 55 - 64
  • [30] Similarity of Query Results in Similarity-Based Databases
    Belohlavek, Radim
    Urbanova, Lucie
    Vychodil, Vilem
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY, 2011, 6954 : 258 - 267