Multigranularity Feature Aggregation and Cross-level Boundary Modeling for Temporal Action Detection

被引:0
|
作者
Li, Qiang [1 ,2 ]
Liu, Di [1 ,3 ]
Zu, Guang [4 ]
Li, Sen [1 ]
Sun, Hui [2 ]
Wang, Jianzhong [1 ]
机构
[1] Northeast Normal Univ, Sch Informat Sci & Technol, Changchun, Peoples R China
[2] Changchun Humanities & Sci Coll, Changchun, Peoples R China
[3] Northeast Elect Power Univ, Jilin, Peoples R China
[4] Jilin Univ, Sch Artificial Intelligence, Changchun, Peoples R China
基金
中国国家自然科学基金;
关键词
Temporal action detection; action recognition; vision transformers; TRANSFORMER;
D O I
10.1145/3712598
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article presents a Temporal Action Detection (TAD) method with Multigranularity (MG) feature aggregation and Cross-level Boundary Modeling (CBM). Compared with other methods, our proposed approach has the following advantages. First, different from most existing works which only consider the local temporal context, a simple and computationally efficient MG module is proposed to comprehensively extract video features in instant, local, and global temporal granularities. Second, unlike the methods that only employ the information from single feature pyramid level for action boundary regression, a CBM strategy that integrates the relative information from both the same and higher level features is designed to improve the accuracy of boundary prediction. At lastfere, benefiting from the MG module and CBM strategy, our method outperforms other state-of-the-art approaches on five challenging TAD datasets: THUMOS14, MultiTHUMOS, EPIC-KITCHENS-100, ActivityNet-1.3, and HACS. We make our code and pre-trained model publicly available CCS Concepts: center dot Computing methodologies -> Artificial intelligence; Computer vision tasks; Activity recognition and understanding
引用
收藏
页数:24
相关论文
共 50 条
  • [31] Cross-level feature adaptive fusion network for low-light image enhancement
    Liang, Liming
    Zhu, Chenkun
    Yang, Yuan
    Li, Renjie
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2024, 39 (06) : 856 - 866
  • [32] Temporal Context Enhanced Feature Aggregation for Video Object Detection
    He, Fei
    Gao, Naiyu
    Li, Qiaozhe
    Du, Senyao
    Zhao, Xin
    Huang, Kaiqi
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10941 - 10948
  • [33] Cross-boundary and cross-level dynamics increase vulnerability to severe winter disasters (dzud) in Mongolia
    Fernandez-Gimenez, Maria E.
    Batkhishig, B.
    Batbuyan, B.
    GLOBAL ENVIRONMENTAL CHANGE-HUMAN AND POLICY DIMENSIONS, 2012, 22 (04): : 836 - 851
  • [34] BOUNDARY INFORMATION MATTERS MORE: ACCURATE TEMPORAL ACTION DETECTION WITH TEMPORAL BOUNDARY NETWORK
    Zhang, Tao
    Liu, Shan
    Li, Thomas
    Li, Ge
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1642 - 1646
  • [35] Temporal adaptive feature pyramid network for action detection
    Xiang, Xuezhi
    Yin, Hang
    Qiao, Yulong
    El Saddik, Abdulmotaleb
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 240
  • [36] MC-Net: Multi-Scale Feature Fusion and Cross-Level Information Interaction Network for Traffic Sign Detection
    Yu, Zhongyi
    Cheng, Debo
    Zhang, Wenzhen
    Chen, Jing
    Zhang, Shichao
    2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 841 - 848
  • [37] Examining cross-level effects in dyadic analysis: A structural equation modeling perspective
    Wickham, Robert E.
    Macia, Kathryn S.
    BEHAVIOR RESEARCH METHODS, 2019, 51 (06) : 2629 - 2645
  • [38] Examining cross-level effects in dyadic analysis: A structural equation modeling perspective
    Robert E. Wickham
    Kathryn S. Macia
    Behavior Research Methods, 2019, 51 : 2629 - 2645
  • [39] Multi-Level Content-Aware Boundary Detection for Temporal Action Proposal Generation
    Su, Taiyi
    Wang, Hanli
    Wang, Lei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 6090 - 6101
  • [40] Interteam Cooperation and Competition and Boundary Activities: The Cross-Level Mediation of Team Goal Orientations
    Shin, Yuhyung
    Kim, Mihee
    Hur, Won-Moo
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2019, 16 (15)