Multi-grained clip focus for skeleton-based action recognition

被引:10
|
作者
Qiu, Helei [1 ]
Hou, Biao [1 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Int Res Ctr Intelligent Percept & Computat, Key Lab Intelligent Percept & Image Understanding,, Xian 710071, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; Skeleton; Multi-grain; Self-attention;
D O I
10.1016/j.patcog.2023.110188
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Joint-level and part-level information are crucial for modeling actions with different granularity. In addition, the relevant information on different joints between consecutive frames is very useful for skeleton-based action recognition. To effectively capture the action information, a new multi-grained clip focus network (MGCF-Net) is proposed. Firstly, the skeleton sequence is divided into multiple clips, each containing several consecutive frames. According to the structure of the human body, each clip is divided into several tuples. Then an intra-clip attention module is proposed to capture intra-clip action information. Specifically, multi-head self-attention is divided into two parts, obtaining relevant information at the joint and part levels, and integrating the information captured from these two parts to obtain multi-grained contextual features. In addition, an inter clip focus module is used to capture the key information of several consecutive sub-actions, which will help to distinguish similar actions. On two large-scale benchmarks for skeleton-based action recognition, our method achieves the most advanced performance, and its effectiveness has been verified.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Skeleton-based Action Recognition for Industrial Packing Process
    Chen, Zhenhui
    Hu, Haiyang
    Li, Zhongjin
    Qi, Xingchen
    Zhang, Haiping
    Hu, Hua
    Chang, Victor
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, BIG DATA AND SECURITY (IOTBDS), 2020, : 36 - 45
  • [42] Multi-Grained Named Entity Recognition
    Xia, Congying
    Zhang, Chenwei
    Yang, Tao
    Li, Yaliang
    Du, Nan
    Wu, Xian
    Fan, Wei
    Ma, Fenglong
    Yu, Philip
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1430 - 1440
  • [43] Multi-scale skeleton adaptive weighted GCN for skeleton-based human action recognition in IoT
    Xu Weiyao
    Wu Muqing
    Zhu Jie
    Zhao Min
    APPLIED SOFT COMPUTING, 2021, 104
  • [44] Cross-Block Fine-Grained Semantic Cascade for Skeleton-Based Sports Action Recognition
    Liu, Zhendong
    Xia, Haifeng
    Guo, Tong
    Sun, Libo
    Shao, Ming
    Xia, Siyu
    2024 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, FG 2024, 2024,
  • [45] WAVELET-DECOUPLING CONTRASTIVE ENHANCEMENT NETWORK FOR FINE-GRAINED SKELETON-BASED ACTION RECOGNITION
    Chang, Haochen
    Chen, Jing
    Li, Yilin
    Chen, Jixiang
    Zhang, Xiaofeng
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 4060 - 4064
  • [46] Multi-scale spatiotemporal topology unveiled: enhancing skeleton-based action recognition
    Chen, Hongwei
    Wang, Jianpeng
    Chen, Zexi
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (01):
  • [47] Multi-stream slowFast graph convolutional networks for skeleton-based action recognition
    Sun, Ning
    Leng, Ling
    Liu, Jixin
    Han, Guang
    IMAGE AND VISION COMPUTING, 2021, 109
  • [48] A HYBRID MULTI-PERSPECTIVE COMPLEMENTARY MODEL FOR HUMAN SKELETON-BASED ACTION RECOGNITION
    Li, Linze
    Zhou, Youwei
    Hu, Jiannan
    Wu, Cong
    Xu, Tianyang
    Wu, Xiao-Jun
    2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS, ICMEW 2024, 2024,
  • [49] Multi-Stream Fusion Network for Skeleton-Based Construction Worker Action Recognition
    Tian, Yuanyuan
    Liang, Yan
    Yang, Haibin
    Chen, Jiayu
    SENSORS, 2023, 23 (23)
  • [50] Skeleton-Based Action Recognition With Multi-Stream Adaptive Graph Convolutional Networks
    Shi, Lei
    Zhang, Yifan
    Cheng, Jian
    Lu, Hanqing
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 9532 - 9545