Using independently recurrent networks for reinforcement learning based unsupervised video summarization

被引：0

作者：

Gokhan Yaliniz

Nazli Ikizler-Cinbis

机构：

[1] Hacettepe University,Department of Computer Engineering

来源：

Multimedia Tools and Applications | 2021年 / 80卷

关键词：

Video summarization; Recurrent neural networks; Reinforcement learning; Unsupervised learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Sigmoid and hyperbolic activation functions in long short-term memory (LSTM) and gated recurrent unit (GRU) based models used in recent studies on video summarization, may cause gradient decay over layers. Moreover, interpreting and developing network models are difficult because of entanglement of neurons on recurrent neural network (RNN). To solve these issues, in this study, we propose a method that uses deep reinforcement learning together with independently recurrent neural networks (IndRNN) for unsupervised video summarization. In this method, Leaky Rectified Linear Unit (Leaky ReLU) is used as an activation function to deal with decaying gradient and dying neuron problems. The model, which does not rely on any labels or user interaction, is designed with a reward function that jointly accounts for uniformity, diversity and representativeness of generated summaries. In this way, our model can create summaries as uniform as possible, has more layers and can be trained with more steps without having any problem related to gradients. Based on the experiments conducted on two benchmark datasets, we observe that, compared to the state-of-the-art methods on video summarization task, better summarization performance can be obtained.

引用

页码：17827 / 17847

页数：20

共 50 条

[21] Unsupervised Video Summarization with Attentive Conditional Generative Adversarial Networks
He, Xufeng
Hua, Yang
Song, Tao
Zhang, Zongpu
Xue, Zhengui
Ma, Ruhui
Robertson, Neil
Guan, Haibing
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2296 - 2304
[22] Deep Reinforcement Learning for Video Summarization with Semantic Reward
Sun, Haoran
Zhu, Xiaolong
Zhou, Conghua
2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY COMPANION, QRS-C, 2022, : 754 - 755
[23] VIDEOWHISPER: Toward Discriminative Unsupervised Video Feature Learning With Attention-Based Recurrent Neural Networks
Zhao, Na
Zhang, Hanwang
Hong, Richang
Wang, Meng
Chua, Tat-Seng
IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (09) : 2080 - 2092
[24] An Unsupervised Video Summarization Method Based on Multimodal Representation
Lei, Zhuo
Yu, Qiang
Shou, Lidan
Li, Shengquan
Mao, Yunqing
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT V, 2023, 14090 : 171 - 180
[25] Unsupervised Video Summarization based on Consistent Clip Generation
Ai, Xin
Song, Yan
Li, Zechao
2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2018,
[26] A working memory model based on recurrent neural networks using reinforcement learning
Wang, Mengyuan
Wang, Yihong
Xu, Xuying
Pan, Xiaochuan
COGNITIVE NEURODYNAMICS, 2024, 18 (05) : 3031 - 3058
[27] Extractive summarization using supervised and unsupervised learning
Mao, Xiangke
Yang, Hui
Huang, Shaobin
Liu, Ye
Li, Rongsheng
EXPERT SYSTEMS WITH APPLICATIONS, 2019, 133 : 173 - 181
[28] Text summarization using unsupervised deep learning
Yousefi-Azar, Mahmood
Hamey, Len
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 68 : 93 - 105
[29] Unsupervised Video Summarization With Cycle-Consistent Adversarial LSTM Networks
Yuan, Li
Tay, Francis Eng Hock
Li, Ping
Feng, Jiashi
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (10) : 2711 - 2722
[30] Escaping local minima in deep reinforcement learning for video summarization
Alexoudi, Panagiota
Mademlis, Ioannis
Pitas, Ioannis
PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 530 - 534

← 1 2 3 4 5 →