Play and rewind: Context-aware video temporal action proposals

被引:17
|
作者
Gao, Lianli [1 ]
Li, Tao [1 ]
Song, Jingkuan [1 ]
Zhao, Zhou [2 ]
Shen, Heng Tao [1 ]
机构
[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[2] Zhejiang Univ, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Temporal action proposal generation and detection; Deep learning; Untrimmed video analysis; ACTION RECOGNITION;
D O I
10.1016/j.patcog.2020.107477
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we investigate the problem of Temporal Action Proposal (TAP) generation, which plays a fundamental role in large-scale untrimmed video analysis but remains largely unsolved. Most of the prior works proposed the temporal actions by predicting the temporal boundaries or actionness scores of video units. Nevertheless, context information among surrounding video units has not been adequately explored, which may result in severe loss of information. In this work, we propose a context-aware temporal action proposal network which makes full use of the contextual information in two aspects: 1) To generate initial proposals, we design a Bi-directional Parallel LSTMs to extract the visual features of a video unit by considering its contextual information. Therefore, the prediction of temporal boundaries and actionness scores will be more accurate because it knows what happened in the past and what will happen in the future; and 2) To refine the initial proposals, we design an action-attention based reranking network which considers both surrounding proposal and initial actionness scores to assign true action proposals with high confidence scores. Extensive experiments are conducted on two challenging datasets for both temporal action proposal generation and detection tasks, demonstrating the effectiveness of the proposed approach. In particular, on THUMOS'14 dataset, our method significantly surpasses state-of-the-art methods by 7.73% on AR@50. Our code is released at: https://github.com/Rheelt/TAPG. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Context-Aware Temporal Logic for Probabilistic Systems
    Elfar, Mahmoud
    Wang, Yu
    Pajic, Miroslav
    AUTOMATED TECHNOLOGY FOR VERIFICATION AND ANALYSIS (ATVA 2020), 2020, 12302 : 215 - 232
  • [22] Spatial-Temporal Context-Aware Tracking
    Han, Yuqi
    Deng, Chenwei
    Zhao, Boya
    Zhao, Baojun
    IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (03) : 500 - 504
  • [23] Temporal knowledge completion with context-aware embeddings
    Liu, Yu
    Hua, Wen
    Qu, Jianfeng
    Xin, Kexuan
    Zhou, Xiaofang
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2021, 24 (02): : 675 - 695
  • [24] Temporal knowledge completion with context-aware embeddings
    Yu Liu
    Wen Hua
    Jianfeng Qu
    Kexuan Xin
    Xiaofang Zhou
    World Wide Web, 2021, 24 : 675 - 695
  • [25] Context-Aware Action with a Small Mobile Robot
    Withey, Daniel
    Mogokonyane, Katlego
    Tikam, Mayur
    Holder, Ross
    Veeraragoo, Mahalingam
    Gambushe, Mxolisi
    2020 INTERNATIONAL SAUPEC/ROBMECH/PRASA CONFERENCE, 2020, : 410 - 415
  • [26] Context-aware adaptation of mobile video decoding resolution
    Machidon, Octavian
    Asprov, Jani
    Fajfar, Tine
    Pejovic, Veljko
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (12) : 17599 - 17630
  • [27] Context-Aware Video Reconstruction for Rolling Shutter Cameras
    Fan, Bin
    Dai, Yuchao
    Zhang, Zhiyuan
    Liu, Qi
    He, Mingyi
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17551 - 17561
  • [28] Context-Aware Talking-Head Video Editing
    Yang, Songlin
    Wang, Wei
    Ling, Jun
    Peng, Bo
    Tan, Xu
    Dong, Jing
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 7718 - 7727
  • [29] Context-Aware Adaptive Video Streaming for Mobile Users
    Mekki, Sami
    Karagkioules, Theodoros
    Valentin, Stefan
    2017 IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2017, : 988 - 989
  • [30] Context-Aware Video Retargeting via Graph Model
    Qu, Zhan
    Wang, Jinqiao
    Xu, Min
    Lu, Hanqing
    IEEE TRANSACTIONS ON MULTIMEDIA, 2013, 15 (07) : 1677 - 1687