Video Summarization with Long Short-Term Memory

被引:411
|
作者
Zhang, Ke [1 ]
Chao, Wei-Lun [1 ]
Sha, Fei [2 ]
Grauman, Kristen [3 ]
机构
[1] Univ Southern Calif, Dept Comp Sci, Los Angeles, CA 90007 USA
[2] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90024 USA
[3] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
来源
关键词
Video summarization; Long short-term memory; SPEECH RECOGNITION;
D O I
10.1007/978-3-319-46478-7_47
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel supervised learning technique for summarizing videos by automatically selecting keyframes or key subshots. Casting the task as a structured prediction problem, our main idea is to use Long Short-Term Memory (LSTM) to model the variable-range temporal dependency among video frames, so as to derive both representative and compact video summaries. The proposed model successfully accounts for the sequential structure crucial to generating meaningful video summaries, leading to state-of-the-art results on two benchmark datasets. In addition to advances in modeling techniques, we introduce a strategy to address the need for a large amount of annotated data for training complex learning approaches to summarization. There, our main idea is to exploit auxiliary annotated video summarization datasets, in spite of their heterogeneity in visual styles and contents. Specifically, we show that domain adaptation techniques can improve learning by reducing the discrepancies in the original datasets' statistical properties.
引用
收藏
页码:766 / 782
页数:17
相关论文
共 50 条
  • [1] Long short-term memory
    Hochreiter, S
    Schmidhuber, J
    NEURAL COMPUTATION, 1997, 9 (08) : 1735 - 1780
  • [2] An Efficient Long Short-Term Memory Model for Digital Cross-Language Summarization
    Reddy, Y. C. A. Padmanabha
    Kasireddy, Shyam Sunder Reddy
    Sirisala, Nageswara Rao
    Kuchipudi, Ramu
    Kollapudi, Purnachand
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (03): : 6389 - 6409
  • [3] Multi-guiding long short-term memory for video captioning
    Xu, Ning
    Liu, An-An
    Nie, Weizhi
    Su, Yuting
    MULTIMEDIA SYSTEMS, 2019, 25 (06) : 663 - 672
  • [4] Multi-guiding long short-term memory for video captioning
    Ning Xu
    An-An Liu
    Weizhi Nie
    Yuting Su
    Multimedia Systems, 2019, 25 : 663 - 672
  • [5] Short-term Load Forecasting with Distributed Long Short-Term Memory
    Dong, Yi
    Chen, Yang
    Zhao, Xingyu
    Huang, Xiaowei
    2023 IEEE POWER & ENERGY SOCIETY INNOVATIVE SMART GRID TECHNOLOGIES CONFERENCE, ISGT, 2023,
  • [6] A short-term prediction model of global ionospheric VTEC based on the combination of long short-term memory and convolutional long short-term memory
    Peng Chen
    Rong Wang
    Yibin Yao
    Hao Chen
    Zhihao Wang
    Zhiyuan An
    Journal of Geodesy, 2023, 97
  • [7] A short-term prediction model of global ionospheric VTEC based on the combination of long short-term memory and convolutional long short-term memory
    Chen, Peng
    Wang, Rong
    Yao, Yibin
    Chen, Hao
    Wang, Zhihao
    An, Zhiyuan
    JOURNAL OF GEODESY, 2023, 97 (05)
  • [8] QUANTUM LONG SHORT-TERM MEMORY
    Chen, Samuel Yen-Chi
    Yoo, Shinjae
    Fang, Yao-Lung L.
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8622 - 8626
  • [9] LIPREADING WITH LONG SHORT-TERM MEMORY
    Wand, Michael
    Koutnik, Jan
    Schmidhuber, Jurgen
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6115 - 6119
  • [10] Associative Long Short-Term Memory
    Danihelka, Ivo
    Wayne, Greg
    Uria, Benigno
    Kalchbrenner, Nal
    Graves, Alex
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48