Using independently recurrent networks for reinforcement learning based unsupervised video summarization

被引:0
|
作者
Gokhan Yaliniz
Nazli Ikizler-Cinbis
机构
[1] Hacettepe University,Department of Computer Engineering
来源
关键词
Video summarization; Recurrent neural networks; Reinforcement learning; Unsupervised learning;
D O I
暂无
中图分类号
学科分类号
摘要
Sigmoid and hyperbolic activation functions in long short-term memory (LSTM) and gated recurrent unit (GRU) based models used in recent studies on video summarization, may cause gradient decay over layers. Moreover, interpreting and developing network models are difficult because of entanglement of neurons on recurrent neural network (RNN). To solve these issues, in this study, we propose a method that uses deep reinforcement learning together with independently recurrent neural networks (IndRNN) for unsupervised video summarization. In this method, Leaky Rectified Linear Unit (Leaky ReLU) is used as an activation function to deal with decaying gradient and dying neuron problems. The model, which does not rely on any labels or user interaction, is designed with a reward function that jointly accounts for uniformity, diversity and representativeness of generated summaries. In this way, our model can create summaries as uniform as possible, has more layers and can be trained with more steps without having any problem related to gradients. Based on the experiments conducted on two benchmark datasets, we observe that, compared to the state-of-the-art methods on video summarization task, better summarization performance can be obtained.
引用
收藏
页码:17827 / 17847
页数:20
相关论文
共 50 条
  • [21] Unsupervised Video Summarization with Attentive Conditional Generative Adversarial Networks
    He, Xufeng
    Hua, Yang
    Song, Tao
    Zhang, Zongpu
    Xue, Zhengui
    Ma, Ruhui
    Robertson, Neil
    Guan, Haibing
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2296 - 2304
  • [22] Deep Reinforcement Learning for Video Summarization with Semantic Reward
    Sun, Haoran
    Zhu, Xiaolong
    Zhou, Conghua
    2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY COMPANION, QRS-C, 2022, : 754 - 755
  • [23] VIDEOWHISPER: Toward Discriminative Unsupervised Video Feature Learning With Attention-Based Recurrent Neural Networks
    Zhao, Na
    Zhang, Hanwang
    Hong, Richang
    Wang, Meng
    Chua, Tat-Seng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (09) : 2080 - 2092
  • [24] An Unsupervised Video Summarization Method Based on Multimodal Representation
    Lei, Zhuo
    Yu, Qiang
    Shou, Lidan
    Li, Shengquan
    Mao, Yunqing
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT V, 2023, 14090 : 171 - 180
  • [25] Unsupervised Video Summarization based on Consistent Clip Generation
    Ai, Xin
    Song, Yan
    Li, Zechao
    2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2018,
  • [26] A working memory model based on recurrent neural networks using reinforcement learning
    Wang, Mengyuan
    Wang, Yihong
    Xu, Xuying
    Pan, Xiaochuan
    COGNITIVE NEURODYNAMICS, 2024, 18 (05) : 3031 - 3058
  • [27] Extractive summarization using supervised and unsupervised learning
    Mao, Xiangke
    Yang, Hui
    Huang, Shaobin
    Liu, Ye
    Li, Rongsheng
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 133 : 173 - 181
  • [28] Text summarization using unsupervised deep learning
    Yousefi-Azar, Mahmood
    Hamey, Len
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 68 : 93 - 105
  • [29] Unsupervised Video Summarization With Cycle-Consistent Adversarial LSTM Networks
    Yuan, Li
    Tay, Francis Eng Hock
    Li, Ping
    Feng, Jiashi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (10) : 2711 - 2722
  • [30] Escaping local minima in deep reinforcement learning for video summarization
    Alexoudi, Panagiota
    Mademlis, Ioannis
    Pitas, Ioannis
    PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 530 - 534