Integrating Human Parsing and Pose Network for Human Action Recognition

被引:0
|
作者
Ding, Runwei [1 ]
Wen, Yuhang [2 ]
Liu, Jinfu [2 ]
Dai, Nan [3 ]
Meng, Fanyang [4 ]
Liu, Mengyuan [1 ]
机构
[1] Peking Univ, Shenzhen Grad Sch, Shenzhen, Peoples R China
[2] Sun Yat Sen Univ, Shenzhen, Peoples R China
[3] Changchun Univ Sci & Technol, Changchun, Peoples R China
[4] Peng Cheng Lab, Shenzhen, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Action recognition; Human parsing; Human skeletons;
D O I
10.1007/978-981-99-8850-1_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human skeletons and RGB sequences are both widelyadopted input modalities for human action recognition. However, skeletons lack appearance features and color data suffer large amount of irrelevant depiction. To address this, we introduce human parsing feature map as a novel modality, since it can selectively retain spatiotemporal features of the body parts, while filtering out noises regarding outfits, backgrounds, etc. We propose an Integrating Human Parsing and Pose Network (IPP-Net) for action recognition, which is the first to leverage both skeletons and human parsing feature maps in dual-branch approach. The human pose branch feeds compact skeletal representations of different modalities in graph convolutional network to model pose features. In human parsing branch, multi-frame body-part parsing features are extracted with human detector and parser, which is later learnt using a convolutional backbone. A late ensemble of two branches is adopted to get final predictions, considering both robust keypoints and rich semantic body-part features. Extensive experiments on NTU RGB+D and NTU RGB+D 120 benchmarks consistently verify the effectiveness of the proposed IPP-Net, which outperforms the existing action recognition methods. Our code is publicly available at https://github.com/liujf69/IPPNet-Parsing.
引用
收藏
页码:182 / 194
页数:13
相关论文
共 50 条
  • [21] Does Human Action Recognition Benefit from Pose Estimation?
    Yao, Angela
    Gall, Juergen
    Fanelli, Gabriele
    Van Gool, Luc
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
  • [22] Multi-human Parsing with Pose and Boundary Guidance
    Du, Shuncheng
    Wang, Yigang
    Wu, Zizhao
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, PRCV 2020, 2020, 12305 : 481 - 492
  • [23] Refined Spatial Network for Human Action Recognition
    Wu, Chunlei
    Cao, Haiwen
    Zhang, Weishan
    Wang, Leiquan
    Wei, Yiwei
    Peng, Zexin
    IEEE ACCESS, 2019, 7 : 111043 - 111052
  • [24] Histogram of oriented rectangles: A new pose descriptor for human action recognition
    Ikizler, Nazli
    Duygulu, Pinar
    IMAGE AND VISION COMPUTING, 2009, 27 (10) : 1515 - 1526
  • [25] Interactive human pose and action recognition using dynamical motion primitives
    Jenkins, Odest Chadwicke
    Serrano, German Gonzalez
    Loper, Matthew M.
    INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2007, 4 (02) : 365 - 385
  • [26] Pose primitive based human action recognition in videos or still images
    Thurau, Christian
    Hlavac, Vaclav
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 2955 - +
  • [27] Pose-based Human Action Recognition with Extreme Gradient Boosting
    Ayumi, Vina
    PROCEEDINGS OF THE 14TH IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2016,
  • [28] ENHANCED TRAJECTORY-BASED ACTION RECOGNITION USING HUMAN POSE
    Papadopoulos, Konstantinos
    Antunes, Michel
    Aouada, Djamila
    Ottersten, Bjorn
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1807 - 1811
  • [29] On the Benefits of 3D Pose and Tracking for Human Action Recognition
    Rajasegaran, Jathushan
    Pavlakos, Georgios
    Kanazawa, Angjoo
    Feichtenhofer, Christoph
    Malik, Jitendra
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 640 - 649
  • [30] Human action recognition in videos with articulated pose information by deep networks
    M. Farrajota
    João M. F. Rodrigues
    J. M. H. du Buf
    Pattern Analysis and Applications, 2019, 22 : 1307 - 1318