Play and rewind: Context-aware video temporal action proposals

被引：17

作者：

Gao, Lianli ^{[1
]}

Li, Tao ^{[1
]}

Song, Jingkuan ^{[1
]}

Zhao, Zhou ^{[2
]}

Shen, Heng Tao ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China

[2] Zhejiang Univ, Hangzhou, Peoples R China

来源：

PATTERN RECOGNITION | 2020年 / 107卷 / 107期

基金：

中国国家自然科学基金;

关键词：

Temporal action proposal generation and detection; Deep learning; Untrimmed video analysis; ACTION RECOGNITION;

D O I：

10.1016/j.patcog.2020.107477

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we investigate the problem of Temporal Action Proposal (TAP) generation, which plays a fundamental role in large-scale untrimmed video analysis but remains largely unsolved. Most of the prior works proposed the temporal actions by predicting the temporal boundaries or actionness scores of video units. Nevertheless, context information among surrounding video units has not been adequately explored, which may result in severe loss of information. In this work, we propose a context-aware temporal action proposal network which makes full use of the contextual information in two aspects: 1) To generate initial proposals, we design a Bi-directional Parallel LSTMs to extract the visual features of a video unit by considering its contextual information. Therefore, the prediction of temporal boundaries and actionness scores will be more accurate because it knows what happened in the past and what will happen in the future; and 2) To refine the initial proposals, we design an action-attention based reranking network which considers both surrounding proposal and initial actionness scores to assign true action proposals with high confidence scores. Extensive experiments are conducted on two challenging datasets for both temporal action proposal generation and detection tasks, demonstrating the effectiveness of the proposed approach. In particular, on THUMOS'14 dataset, our method significantly surpasses state-of-the-art methods by 7.73% on AR@50. Our code is released at: https://github.com/Rheelt/TAPG. (C) 2020 Elsevier Ltd. All rights reserved.

引用

页数：9

共 50 条

[41] Towards efficient video-based action recognition: context-aware memory attention network
Thean Chun Koh
Chai Kiat Yeo
Xuan Jing
Sunil Sivadas
SN Applied Sciences, 2023, 5
[42] Temporal Context-Aware Representation Learning for Question Routing
Zhang, Xuchao
Cheng, Wei
Zong, Bo
Chen, Yuncong
Xu, Jianwu
Li, Ding
Chen, Haifeng
PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), 2020, : 753 - 761
[43] Temporal context-aware task recommendation in crowdsourcing systems
Yuen, Man-Ching
King, Irwin
Leung, Kwong-Sak
KNOWLEDGE-BASED SYSTEMS, 2021, 219
[44] Context-Aware Neural Model for Temporal Information Extraction
Meng, Yuanliang
Rumshisky, Anna
PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 527 - 536
[45] Temporal context-aware motion-saliency detection
Xu, Mengxi
Wu, Xiaobin
Ma, Zhizhong
Wang, Ruili
Lu, Huimin
JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (01)
[46] Research on temporal representation and reasoning for context-aware computing
Liu, Dong
Meng, Xiangwu
Chen, Junliang
Gaojishu Tongxin/Chinese High Technology Letters, 2009, 19 (04): : 342 - 347
[47] Designing context-aware interaction: An action research study
Olsson, CM
Henfridsson, O
DESIGNING UBIQUITOUS INFORMATION ENVIRONMENTS: SOCIO-TECHNICAL ISSUES AND CHALLENGES, 2005, 185 : 233 - 247
[48] Improve Temporal Action Proposals using Hierarchical Context
Liu, Qinying
Wang, Zilei
Rong, Shenghai
PATTERN RECOGNITION, 2023, 140
[49] Context-aware joint Video Summarization and Streaming (CVSS) Approach
Farouk, Hesham
ElDahshan, Kamal A.
Abozeid, Amr
PROCEEDINGS OF 2016 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2016, : 597 - 602
[50] Context-Aware Handover for Voice and Video Applications in WiMax/WiFi
Akkari, Nadine
Al Hazmi, Hanan
WORLD CONGRESS ON COMPUTER & INFORMATION TECHNOLOGY (WCCIT 2013), 2013,

← 1 2 3 4 5 →