Unsupervised Video Summarization via Relation-Aware Assignment Learning

被引:23
|
作者
Gao, Junyu [1 ,2 ,3 ]
Yang, Xiaoshan [1 ,2 ,3 ]
Zhang, Yingying [1 ,2 ,3 ]
Xu, Changsheng [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Natl Lab Pattern Recognit, Inst Automat, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[3] PengCheng Lab, Shenzhen 518066, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Training; Optimization; Semantics; Recurrent neural networks; Task analysis; Graph neural network; unsupervised learning; video summarization; ACTION RECOGNITION; DEEP;
D O I
10.1109/TMM.2020.3021980
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We address the problem of unsupervised video summarization that automatically selects key video clips. Most state-of-the-art approaches suffer from two issues: (1) they model video clips without explicitly exploiting their relations, and (2) they learn soft importance scores over all the video clips to generate the summary representation. However, a meaningful video summary should be inferred by taking the relation-aware context of the original video into consideration, and directly selecting a subset of clips with a hard assignment. In this paper, we propose to exploit clip-clip relations to learn relation-aware hard assignments for selecting key clips in an unsupervised manner. First, we consider the clips as graph nodes to construct an assignment-learning graph. Then, we utilize the magnitude of the node features to generate hard assignments as the summary selection. Finally, we optimize the whole framework via a proposed multi-task loss including a reconstruction constraint, and a contrastive constraint. Extensive experimental results on three popular benchmarks demonstrate the favourable performance of our approach.
引用
收藏
页码:3203 / 3214
页数:12
相关论文
共 50 条
  • [21] Unsupervised Reinforcement Learning For Video Summarization Reward Function
    Wang, Lei
    Zhu, Yaping
    Pan, Hong
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO AND SIGNAL PROCESSING (IVSP 2019), 2019, : 40 - 44
  • [22] ReGR: Relation-aware graph reasoning framework for video question answering
    Wang, Zheng
    Li, Fangtao
    Ota, Kaoru
    Dong, Mianxiong
    Wu, Bin
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (04)
  • [23] Semantic Relation-aware Difference Representation Learning for Change Captioning
    Tu, Yunbin
    Yao, Tingting
    Li, Liang
    Lou, Jiedong
    Gao, Shengxiang
    Yu, Zhengtao
    Yan, Chenggang
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 63 - 73
  • [24] TransFER: Learning Relation-aware Facial Expression Representations with Transformers
    Xue, Fanglei
    Wang, Qiangchang
    Guo, Guodong
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3581 - 3590
  • [25] Gaze Aware Deep Learning Model for Video Summarization
    Wu, Jiaxin
    Zhong, Sheng-hua
    Ma, Zheng
    Heinen, Stephen J.
    Jiang, Jianmin
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II, 2018, 11165 : 285 - 295
  • [26] Unsupervised Video Summarization via Multi-source Features
    Kanafani, Hussain
    Ghauri, Junaid Ahmed
    Hakimov, Sherzod
    Ewerth, Ralph
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 466 - 470
  • [27] Unsupervised Video Summarization Based on Deep Reinforcement Learning with Interpolation
    Yoon, Ui Nyoung
    Hong, Myung Duk
    Jo, Geun-Sik
    SENSORS, 2023, 23 (07)
  • [28] Endoscopy Video Summarization based on Unsupervised Learning and Feature Discrimination
    Ben Ismail, M. Maher
    Bchir, Ouiem
    Emam, Ahmed Z.
    2013 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP 2013), 2013,
  • [29] Dynamic Relation-Aware Multiple Instance Learning for Few-shot Learning
    Zheng, Kaipeng
    Cheng, Liu
    Shen, Jie
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [30] RAN: A Relation-aware Network for Relation Extraction
    Li, Yile
    Gu, Xiaoyan
    Yue, Yinliang
    Wang, Zhuo
    Li, Bo
    Wang, Weiping
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,