Actor-Centric Tubelets for Real-Time Activity Detection in Extended Videos

被引:1
|
作者
Mavroudi, Effrosyni [1 ]
Bindal, Prashast [1 ]
Vidal, Rene [1 ]
机构
[1] Johns Hopkins Univ, Math Inst Data Sci, Baltimore, MD 21218 USA
关键词
RECOGNITION;
D O I
10.1109/WACVW54805.2022.00023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We address the problem of detecting human and vehicle activities in long, untrimmed surveillance videos that capture a large field of view. Most existing activity detection approaches are designed for recognizing atomic human actions performed in the foreground. Therefore, they are not suitable for detecting activities in extended videos, which contain multiple actors performing co-occurring, complex activities with extreme spatio-temporal scale variations. In this paper, we propose a modular, actor-centric framework for real-time activity detection in extended videos. In particular, we decompose an extended video into a collection of smaller actor-centric tubelets of interest. Each tubelet is a video sub-volume associated with an actor and includes adaptive visual context for recognizing the actor's activities. Once these tubelets are extracted via an object-detection-based approach, we are able to detect activities in each tubelet by focusing on the actor situated in its foreground. To accurately detect the activities of a tubelet's actor we take into account the interactions with other detected actors and objects within the tubelet. We encode such interactions with a dynamic visual spatio-temporal graph and process it with a Graph Neural Network that yields context-aware actor representations. We validate our activity detection framework on the MEVA (Multiview Extended Video with Activities) dataset and the ActEV 2021 Sequestered Data Leaderboard and demonstrate its effectiveness in terms of speed and performance.
引用
收藏
页码:172 / 181
页数:10
相关论文
共 50 条
  • [1] Effective actor-centric human-object interaction detection
    Xu, Kunlun
    Li, Zhimin
    Zhang, Zhijun
    Dong, Leizhen
    Xu, Wenhui
    Yan, Luxin
    Zhong, Sheng
    Zou, Xu
    IMAGE AND VISION COMPUTING, 2022, 121
  • [2] An Actor-centric Causality Graph for Asynchronous Temporal Inference in Group Activity
    Xie, Zhao
    Gao, Tian
    Wu, Kewei
    Chang, Jiao
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6652 - 6661
  • [3] Real-time Weapon Detection in Videos
    Nazeem, Ahmed
    Bei, Xinzhu
    Chen, Ruobing
    Shrivastava, Shreyas
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM), 2021, : 497 - 504
  • [4] Real-time Detection of Activities in Untrimmed Videos
    Gleason, Joshua
    Castillo, Carlos D.
    Chellappa, Rama
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW), 2020, : 117 - 125
  • [5] REAL-TIME DOCUMENT DETECTION IN SMARTPHONE VIDEOS
    Puybareau, Elodie
    Geraud, Thierry
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 1498 - 1502
  • [6] Real-time Detection of Human Body in Videos
    Smirg, Ondrej
    Smekal, Zdenek
    2012 35TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2012, : 784 - 788
  • [7] Gabriella: An Online System for Real-Time Activity Detection in Untrimmed Security Videos
    Rizve, Mamshad Nayeem
    Demir, Ugur
    Tirupattur, Praveen
    Rana, Aayush Jung
    Duarte, Kevin
    Dave, Ishan
    Rawat, Yogesh Singh
    Shah, Mubarak
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4237 - 4244
  • [8] Real-time and accurate abnormal behavior detection in videos
    Zheyi Fan
    Jianyuan Yin
    Yu Song
    Zhiwen Liu
    Machine Vision and Applications, 2020, 31
  • [9] Robust real-time pedestrian detection in surveillance videos
    Domonkos Varga
    Tamás Szirányi
    Journal of Ambient Intelligence and Humanized Computing, 2017, 8 : 79 - 85
  • [10] Robust real-time pedestrian detection in surveillance videos
    Varga, Domonkos
    Sziranyi, Tamas
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2017, 8 (01) : 79 - 85