Unsupervised Video Summarization via Relation-Aware Assignment Learning

被引:23
|
作者
Gao, Junyu [1 ,2 ,3 ]
Yang, Xiaoshan [1 ,2 ,3 ]
Zhang, Yingying [1 ,2 ,3 ]
Xu, Changsheng [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Natl Lab Pattern Recognit, Inst Automat, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[3] PengCheng Lab, Shenzhen 518066, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Training; Optimization; Semantics; Recurrent neural networks; Task analysis; Graph neural network; unsupervised learning; video summarization; ACTION RECOGNITION; DEEP;
D O I
10.1109/TMM.2020.3021980
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We address the problem of unsupervised video summarization that automatically selects key video clips. Most state-of-the-art approaches suffer from two issues: (1) they model video clips without explicitly exploiting their relations, and (2) they learn soft importance scores over all the video clips to generate the summary representation. However, a meaningful video summary should be inferred by taking the relation-aware context of the original video into consideration, and directly selecting a subset of clips with a hard assignment. In this paper, we propose to exploit clip-clip relations to learn relation-aware hard assignments for selecting key clips in an unsupervised manner. First, we consider the clips as graph nodes to construct an assignment-learning graph. Then, we utilize the magnitude of the node features to generate hard assignments as the summary selection. Finally, we optimize the whole framework via a proposed multi-task loss including a reconstruction constraint, and a contrastive constraint. Extensive experimental results on three popular benchmarks demonstrate the favourable performance of our approach.
引用
收藏
页码:3203 / 3214
页数:12
相关论文
共 50 条
  • [1] Video Captioning via Relation-Aware Graph Learning
    Zheng, Yi
    Jing, Heming
    Xie, Qiujie
    Zhang, Yuejie
    Feng, Rui
    Zhang, Tao
    Gao, Shang
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2023, 2023-June
  • [2] Visual Relation-Aware Unsupervised Video Captioning
    Ji, Puzhao
    Cao, Meng
    Zou, Yuexian
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT III, 2022, 13531 : 495 - 507
  • [3] Relation-aware attention for video captioning via graph learning
    Tu, Yunbin
    Zhou, Chang
    Guo, Junjun
    Li, Huafeng
    Gao, Shengxiang
    Yu, Zhengtao
    PATTERN RECOGNITION, 2023, 136
  • [4] Video Moment Retrieval via Comprehensive Relation-Aware Network
    Sun, Xin
    Gao, Jialin
    Zhu, Yizhe
    Wang, Xuan
    Zhou, Xi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 5281 - 5295
  • [5] Global Relation-Aware Contrast Learning for Unsupervised Person Re-Identification
    Zhang, Hongwei
    Zhang, Guoqing
    Chen, Yuhao
    Zheng, Yuhui
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (12) : 8599 - 8610
  • [6] Relation-aware Graph Contrastive Learning
    Li, Bingshi
    Li, Jin
    Fu, Yang-Geng
    PARALLEL PROCESSING LETTERS, 2023, 33 (01N02)
  • [7] Unsupervised Video Summarization via Attention-Driven Adversarial Learning
    Apostolidis, Evlampios
    Adamantidou, Eleni
    Metsai, Alexandros, I
    Mezaris, Vasileios
    Patras, Ioannis
    MULTIMEDIA MODELING (MMM 2020), PT I, 2020, 11961 : 492 - 504
  • [8] Relation-Aware Transformer for Portfolio Policy Learning
    Xu, Ke
    Zhang, Yifan
    Ye, Deheng
    Zhao, Peilin
    Tan, Mingkui
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 4647 - 4653
  • [9] Aspect-based sentiment analysis via relation-aware collaborative learning
    Zhou, Lexin
    Yang, Wenzhong
    Wang, Ting
    Wu, Yongzhi
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (03) : 1445 - 1454
  • [10] Discriminative Feature Learning for Unsupervised Video Summarization
    Jung, Yunjae
    Cho, Donghyeon
    Kim, Dahun
    Woo, Sanghyun
    Kweon, In So
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8537 - 8544