Efficient Video Captioning with Frame Similarity-Based Filtering

被引:0
|
作者
Rashno, Elyas [1 ]
Zulkernine, Farhana [1 ]
机构
[1] Queens Univ, Sch Comp, Kingston, ON, Canada
关键词
Video Caption Generation; Video frame similarity; Sequence to Sequence; Stacked LSTM;
D O I
10.1007/978-3-031-39821-6_7
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Video captioning combines computer vision and Natural Language Processing (NLP) to perform the challenging task of scene understanding. The rapid advancements in artificial intelligence have led to a growing interest in video captioning, which involves generating natural language descriptions based on the visual content of videos. In this paper, we present a novel approach to video caption generation. The proposed method first extracts frames from the video and reduces the number of frames based on their similarity. The remaining frames are then processed by a Convolution Neural Network (CNN) to extract a feature vector, which is then fed into a Long Short-Term Memory (LSTM) network to generate the captions. The results are compared with the state-of-the-art models which demonstrate that the proposed approach outperforms the existing methods on MSVD, M-VAD, and MPII-MD datasets.
引用
收藏
页码:98 / 112
页数:15
相关论文
共 50 条
  • [31] Efficient similarity-based data clustering by optimal object to cluster reallocation
    Rossignol, Mathias
    Lagrange, Mathieu
    Cont, Arshia
    PLOS ONE, 2018, 13 (06):
  • [32] A similarity-based resolution rule
    Fontana, FA
    Formato, F
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2002, 17 (09) : 853 - 872
  • [33] Similarity-based Product Configuration
    Schuh, Guenther
    Rudolf, Stefan
    Riesener, Michael
    VARIETY MANAGEMENT IN MANUFACTURING: PROCEEDINGS OF THE 47TH CIRP CONFERENCE ON MANUFACTURING SYSTEMS, 2014, 17 : 290 - 295
  • [34] FedGroup: Efficient Federated Learning via Decomposed Similarity-Based Clustering
    Duan, Moming
    Liu, Duo
    Ji, Xinyuan
    Liu, Renping
    Liang, Liang
    Chen, Xianzhang
    Tan, Yujuan
    19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 228 - 237
  • [35] Structural similarity-based rate control algorithm for 3D video
    Harshalatha, Y.
    Biswas, Prabir Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (17) : 25897 - 25908
  • [36] Efficient and robust data augmentation for trajectory analytics: a similarity-based approach
    Dan He
    Sibo Wang
    Boyu Ruan
    Bolong Zheng
    Xiaofang Zhou
    World Wide Web, 2020, 23 : 361 - 387
  • [37] Pro-Frame: similarity-based gene recognition in eukaryotic DNA sequences with errors
    Mironov, AA
    Novichkov, PS
    Gelfand, MS
    BIOINFORMATICS, 2001, 17 (01) : 13 - 15
  • [38] Exploring Similarity-Based Graph Compression for Efficient Network Analysis and Embedding
    Akin, Hamdi Selim
    Aktas, Mehmet Emin
    Islam, Muhammed Ifte
    Hossain, Tanvir
    Akbas, Esra
    2024 33RD INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS, ICCCN 2024, 2024,
  • [39] A similarity-based approach to aggregation
    Jacas, J
    Recasens, J
    FUZZ-IEEE 2005: PROCEEDINGS OF THE IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS: BIGGEST LITTLE CONFERENCE IN THE WORLD, 2005, : 658 - 662
  • [40] Sparse Similarity-Based Fisherfaces
    Fagertun, Jens
    Gomez, David D.
    Hansen, Mads F.
    Paulsen, Rasmus R.
    IMAGE ANALYSIS: 17TH SCANDINAVIAN CONFERENCE, SCIA 2011, 2011, 6688 : 69 - 78