Play and rewind: Context-aware video temporal action proposals

被引:17
|
作者
Gao, Lianli [1 ]
Li, Tao [1 ]
Song, Jingkuan [1 ]
Zhao, Zhou [2 ]
Shen, Heng Tao [1 ]
机构
[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[2] Zhejiang Univ, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Temporal action proposal generation and detection; Deep learning; Untrimmed video analysis; ACTION RECOGNITION;
D O I
10.1016/j.patcog.2020.107477
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we investigate the problem of Temporal Action Proposal (TAP) generation, which plays a fundamental role in large-scale untrimmed video analysis but remains largely unsolved. Most of the prior works proposed the temporal actions by predicting the temporal boundaries or actionness scores of video units. Nevertheless, context information among surrounding video units has not been adequately explored, which may result in severe loss of information. In this work, we propose a context-aware temporal action proposal network which makes full use of the contextual information in two aspects: 1) To generate initial proposals, we design a Bi-directional Parallel LSTMs to extract the visual features of a video unit by considering its contextual information. Therefore, the prediction of temporal boundaries and actionness scores will be more accurate because it knows what happened in the past and what will happen in the future; and 2) To refine the initial proposals, we design an action-attention based reranking network which considers both surrounding proposal and initial actionness scores to assign true action proposals with high confidence scores. Extensive experiments are conducted on two challenging datasets for both temporal action proposal generation and detection tasks, demonstrating the effectiveness of the proposed approach. In particular, on THUMOS'14 dataset, our method significantly surpasses state-of-the-art methods by 7.73% on AR@50. Our code is released at: https://github.com/Rheelt/TAPG. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Towards efficient video-based action recognition: context-aware memory attention network
    Thean Chun Koh
    Chai Kiat Yeo
    Xuan Jing
    Sunil Sivadas
    SN Applied Sciences, 2023, 5
  • [42] Temporal Context-Aware Representation Learning for Question Routing
    Zhang, Xuchao
    Cheng, Wei
    Zong, Bo
    Chen, Yuncong
    Xu, Jianwu
    Li, Ding
    Chen, Haifeng
    PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), 2020, : 753 - 761
  • [43] Temporal context-aware task recommendation in crowdsourcing systems
    Yuen, Man-Ching
    King, Irwin
    Leung, Kwong-Sak
    KNOWLEDGE-BASED SYSTEMS, 2021, 219
  • [44] Context-Aware Neural Model for Temporal Information Extraction
    Meng, Yuanliang
    Rumshisky, Anna
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 527 - 536
  • [45] Temporal context-aware motion-saliency detection
    Xu, Mengxi
    Wu, Xiaobin
    Ma, Zhizhong
    Wang, Ruili
    Lu, Huimin
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (01)
  • [46] Research on temporal representation and reasoning for context-aware computing
    Liu, Dong
    Meng, Xiangwu
    Chen, Junliang
    Gaojishu Tongxin/Chinese High Technology Letters, 2009, 19 (04): : 342 - 347
  • [47] Designing context-aware interaction: An action research study
    Olsson, CM
    Henfridsson, O
    DESIGNING UBIQUITOUS INFORMATION ENVIRONMENTS: SOCIO-TECHNICAL ISSUES AND CHALLENGES, 2005, 185 : 233 - 247
  • [48] Improve Temporal Action Proposals using Hierarchical Context
    Liu, Qinying
    Wang, Zilei
    Rong, Shenghai
    PATTERN RECOGNITION, 2023, 140
  • [49] Context-aware joint Video Summarization and Streaming (CVSS) Approach
    Farouk, Hesham
    ElDahshan, Kamal A.
    Abozeid, Amr
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2016, : 597 - 602
  • [50] Context-Aware Handover for Voice and Video Applications in WiMax/WiFi
    Akkari, Nadine
    Al Hazmi, Hanan
    WORLD CONGRESS ON COMPUTER & INFORMATION TECHNOLOGY (WCCIT 2013), 2013,