Actor-Centric Tubelets for Real-Time Activity Detection in Extended Videos

被引:1
|
作者
Mavroudi, Effrosyni [1 ]
Bindal, Prashast [1 ]
Vidal, Rene [1 ]
机构
[1] Johns Hopkins Univ, Math Inst Data Sci, Baltimore, MD 21218 USA
关键词
RECOGNITION;
D O I
10.1109/WACVW54805.2022.00023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We address the problem of detecting human and vehicle activities in long, untrimmed surveillance videos that capture a large field of view. Most existing activity detection approaches are designed for recognizing atomic human actions performed in the foreground. Therefore, they are not suitable for detecting activities in extended videos, which contain multiple actors performing co-occurring, complex activities with extreme spatio-temporal scale variations. In this paper, we propose a modular, actor-centric framework for real-time activity detection in extended videos. In particular, we decompose an extended video into a collection of smaller actor-centric tubelets of interest. Each tubelet is a video sub-volume associated with an actor and includes adaptive visual context for recognizing the actor's activities. Once these tubelets are extracted via an object-detection-based approach, we are able to detect activities in each tubelet by focusing on the actor situated in its foreground. To accurately detect the activities of a tubelet's actor we take into account the interactions with other detected actors and objects within the tubelet. We encode such interactions with a dynamic visual spatio-temporal graph and process it with a Graph Neural Network that yields context-aware actor representations. We validate our activity detection framework on the MEVA (Multiview Extended Video with Activities) dataset and the ActEV 2021 Sequestered Data Leaderboard and demonstrate its effectiveness in terms of speed and performance.
引用
收藏
页码:172 / 181
页数:10
相关论文
共 50 条
  • [41] Real-time traffic light detection from videos with inertial sensor fusion
    Khan, Nishat Anjum
    Ansari, Rashid
    PROCEEDINGS OF THE 1ST ACM SIGSPATIAL INTERNATIONAL WORKSHOP ON ADVANCES IN RESILIENT AND INTELLIGENT CITIES (ARIC-2018), 2018, : 31 - 40
  • [42] Real-Time Automatic Multi-Style License Plate Detection in Videos
    Elbamby, Asmaa
    Hemayed, Elsayed E.
    Helal, Dina
    Rehan, Mohamed
    ICENCO 2016 - 2016 12TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO) - BOUNDLESS SMART SOCIETIES, 2016, : 148 - 153
  • [43] Real-Time People Detection in Videos Using Geometrical Features and Adaptive Boosting
    Pedrocca, Pablo Julian
    Allili, Mohand Said
    IMAGE ANALYSIS AND RECOGNITION: 8TH INTERNATIONAL CONFERENCE, ICIAR 2011, PT I, 2011, 6753 : 314 - 324
  • [44] A quick moving target detection method based on real-time airborne videos
    Deng, Hong-bin
    He, Yuan-yuan
    Guo, Zhen-yong
    INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL, 2011, 12 (1-2) : 22 - 28
  • [45] THE REAL-TIME HUMAN RELIABILITY DETECTION SYSTEM BASED ON SHIP BRIDGE VIDEOS
    Wang, Shuoping
    Xiao, Youan
    Wang, Tengfei
    Li, Zhuo
    PROCEEDINGS OF ASME 2022 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2022, VOL 9, 2022,
  • [46] Real-Time Detection, Tracking and Classification of Multiple Moving Objects in UAV Videos
    Baykara, Huseyin Can
    Biyik, Erdem
    Gul, Gamze
    Onural, Deniz
    Ozturk, Ahmet Safa
    Yildiz, Ilkay
    2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 945 - 950
  • [47] Real-time implementation of moving object detection in UAV videos using GPUs
    Jaiswal, Deepak
    Kumar, Praveen
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2020, 17 (05) : 1301 - 1317
  • [48] A Motion-based Approach for Real-time Detection of Pornographic Content in Videos
    Geremias, Jhonatan
    Viegas, Eduardo K.
    Britto, Alceu S., Jr.
    Santin, Altair O.
    37TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2022, : 1066 - 1073
  • [49] User-Centric and Real-Time Activity Recognition Using Smart Glasses
    Ho, Joshua
    Wang, Chien-Min
    GREEN, PERVASIVE, AND CLOUD COMPUTING, 2016, 9663 : 196 - 210
  • [50] Real-time human segmentation in infrared videos
    Fernandez-Caballero, Antonio
    Castillo, Jose Carlos
    Serrano-Cuerda, Juan
    Maldonado-Bascon, Saturnino
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (03) : 2577 - 2584