Play and rewind: Context-aware video temporal action proposals

被引:17
|
作者
Gao, Lianli [1 ]
Li, Tao [1 ]
Song, Jingkuan [1 ]
Zhao, Zhou [2 ]
Shen, Heng Tao [1 ]
机构
[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[2] Zhejiang Univ, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Temporal action proposal generation and detection; Deep learning; Untrimmed video analysis; ACTION RECOGNITION;
D O I
10.1016/j.patcog.2020.107477
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we investigate the problem of Temporal Action Proposal (TAP) generation, which plays a fundamental role in large-scale untrimmed video analysis but remains largely unsolved. Most of the prior works proposed the temporal actions by predicting the temporal boundaries or actionness scores of video units. Nevertheless, context information among surrounding video units has not been adequately explored, which may result in severe loss of information. In this work, we propose a context-aware temporal action proposal network which makes full use of the contextual information in two aspects: 1) To generate initial proposals, we design a Bi-directional Parallel LSTMs to extract the visual features of a video unit by considering its contextual information. Therefore, the prediction of temporal boundaries and actionness scores will be more accurate because it knows what happened in the past and what will happen in the future; and 2) To refine the initial proposals, we design an action-attention based reranking network which considers both surrounding proposal and initial actionness scores to assign true action proposals with high confidence scores. Extensive experiments are conducted on two challenging datasets for both temporal action proposal generation and detection tasks, demonstrating the effectiveness of the proposed approach. In particular, on THUMOS'14 dataset, our method significantly surpasses state-of-the-art methods by 7.73% on AR@50. Our code is released at: https://github.com/Rheelt/TAPG. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Context-aware HDR video distribution for mobile devices
    Miguel Melo
    Luís Barbosa
    Maximino Bessa
    Kurt Debattista
    Alan Chalmers
    Multimedia Tools and Applications, 2017, 76 : 16605 - 16623
  • [32] Context-aware adaptation of mobile video decoding resolution
    Octavian Machidon
    Jani Asprov
    Tine Fajfar
    Veljko Pejović
    Multimedia Tools and Applications, 2023, 82 : 17599 - 17630
  • [33] A context-aware video display scheme for mobile devices
    Seo, K
    Kim, C
    MULTIMEDIA ON MOBILE DEVICES II, 2006, 6074
  • [34] Context-aware Deformable Alignment for Video Object Segmentation
    Yang, Jie
    Xia, Mingfu
    Zhou, Xue
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 303 - 309
  • [35] A Novel Spatial and Temporal Context-Aware Approach for Drone-Based Video Object Detection
    Pi, Zhaoliang
    Lian, Yanchao
    Chen, Xier
    Wu, Yinan
    Li, Yingping
    Jiao, Licheng
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 179 - 188
  • [36] MCDubber: Multimodal Context-Aware Expressive Video Dubbing
    Zhao, Yuan
    Jia, Zhenqi
    Liu, Rui
    Hu, De
    Bao, Feilong
    Gao, Guanglai
    MAN-MACHINE SPEECH COMMUNICATION, NCMMSC 2024, 2025, 2312 : 168 - 182
  • [37] Video Search with Context-Aware Ranker and Relevance Feedback
    Lokoc, Jakub
    Mejzlik, Frantisek
    Soucek, Tomas
    Dokoupil, Patrik
    Peska, Ladislav
    MULTIMEDIA MODELING, MMM 2022, PT II, 2022, 13142 : 505 - 510
  • [38] Context-Aware Activity Recognition and Anomaly Detection in Video
    Zhu, Yingying
    Nayak, Nandita M.
    Roy-Chowdhury, Amit K.
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2013, 7 (01) : 91 - 101
  • [39] Context-aware HDR video distribution for mobile devices
    Melo, Miguel
    Barbosa, Luis
    Bessa, Maximino
    Debattista, Kurt
    Chalmers, Alan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (15) : 16605 - 16623
  • [40] Towards efficient video-based action recognition: context-aware memory attention network
    Koh, Thean Chun
    Yeo, Chai Kiat
    Jing, Xuan
    Sivadas, Sunil
    SN APPLIED SCIENCES, 2023, 5 (12):