Actor-Centric Tubelets for Real-Time Activity Detection in Extended Videos

被引:1
|
作者
Mavroudi, Effrosyni [1 ]
Bindal, Prashast [1 ]
Vidal, Rene [1 ]
机构
[1] Johns Hopkins Univ, Math Inst Data Sci, Baltimore, MD 21218 USA
关键词
RECOGNITION;
D O I
10.1109/WACVW54805.2022.00023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We address the problem of detecting human and vehicle activities in long, untrimmed surveillance videos that capture a large field of view. Most existing activity detection approaches are designed for recognizing atomic human actions performed in the foreground. Therefore, they are not suitable for detecting activities in extended videos, which contain multiple actors performing co-occurring, complex activities with extreme spatio-temporal scale variations. In this paper, we propose a modular, actor-centric framework for real-time activity detection in extended videos. In particular, we decompose an extended video into a collection of smaller actor-centric tubelets of interest. Each tubelet is a video sub-volume associated with an actor and includes adaptive visual context for recognizing the actor's activities. Once these tubelets are extracted via an object-detection-based approach, we are able to detect activities in each tubelet by focusing on the actor situated in its foreground. To accurately detect the activities of a tubelet's actor we take into account the interactions with other detected actors and objects within the tubelet. We encode such interactions with a dynamic visual spatio-temporal graph and process it with a Graph Neural Network that yields context-aware actor representations. We validate our activity detection framework on the MEVA (Multiview Extended Video with Activities) dataset and the ActEV 2021 Sequestered Data Leaderboard and demonstrate its effectiveness in terms of speed and performance.
引用
收藏
页码:172 / 181
页数:10
相关论文
共 50 条
  • [21] REAL-TIME DETECTION OF THE ACTIVITY OF A DOG
    Lemasson, Germain
    Lucidarme, Philippe
    Duhaut, Dominique
    NATURE INSPIRED MOBILE ROBOTICS, 2013, : 815 - 821
  • [22] Multi-actor activity detection by modeling object relationships in extended videos based on deep learning
    Zhang, Binyu
    Wan, Junfeng
    Zhao, Yanyun
    Tong, Zhihang
    Du, Yunhao
    Engineering Applications of Artificial Intelligence, 2022, 114
  • [23] Multi-actor activity detection by modeling object relationships in extended videos based on deep learning
    Zhang, Binyu
    Wan, Junfeng
    Zhao, Yanyun
    Tong, Zhihang
    Du, Yunhao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 114
  • [24] A Real-Time Network for Fast Breast Lesion Detection in Ultrasound Videos
    Dai, Qian
    Lin, Junhao
    Li, Weibin
    Wang, Liansheng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XIII, 2024, 14437 : 40 - 50
  • [25] Weapon Detection in Real-Time CCTV Videos Using Deep Learning
    Bhatti, Muhammad Tahir
    Khan, Muhammad Gufran
    Aslam, Masood
    Fiaz, Muhammad Junaid
    IEEE ACCESS, 2021, 9 : 34366 - 34382
  • [26] Real-time Salient Object Detection Engine for High Definition Videos
    Fu, Yu-Jie
    Wu, Guan-Lin
    Chien, Shao-Yi
    2013 INTERNATIONAL SYMPOSIUM ON VLSI DESIGN, AUTOMATION, AND TEST (VLSI-DAT), 2013,
  • [27] Real-Time Image-based Smoke Detection in Endoscopic Videos
    Leibetseder, Andreas
    Primus, Manfred Jurgen
    Petscharnig, Stefan
    Schoeffmann, Klaus
    PROCEEDINGS OF THE THEMATIC WORKSHOPS OF ACM MULTIMEDIA 2017 (THEMATIC WORKSHOPS'17), 2017, : 296 - 304
  • [28] Real-time Salient Object Detection Engine for High Definition Videos
    Fu, Yu-Jie
    Wu, Guan-Lin
    Chien, Shao-Yi
    2013 INTERNATIONAL SYMPOSIUM ON VLSI DESIGN, AUTOMATION, AND TEST (VLSI-DAT), 2013,
  • [29] Real-Time Traffic Congestion Detection for Driver-Centric Applications
    Kisters, Philipp
    Bauer, Tim
    Posdorfer, Wolf
    Edinger, Janick
    2023 IEEE 43RD INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS WORKSHOPS, ICDCSW, 2023, : 163 - 168
  • [30] Real-Time Activity Detection of Human Movement in Videos via Smartphone Based on Synthetic Training Data
    Thomanek, Rico
    Rolletschke, Tony
    Platte, Benny
    Hoesel, Claudia
    Roschke, Christian
    Manthey, Robert
    Heinzig, Manuel
    Vogel, Richard
    Zimmer, Frank
    Vodel, Matthias
    Eibl, Maximilian
    Ritter, Marc
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW), 2020, : 160 - 164