Crowded pose-guided multi-task learning for instance-level human parsing

被引:1
|
作者
Wei, Yong [1 ]
Liu, Li [1 ,2 ]
Fu, Xiaodong [1 ,2 ]
Liu, LiJun [1 ,2 ]
Peng, Wei [1 ,2 ]
机构
[1] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, 727,Jingming South Rd, Kunming 650500, Yunnan, Peoples R China
[2] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Comp Technol Applicat Key Lab Yunnan Prov, 727,Jingming South Rd, Kunming 650500, Yunnan, Peoples R China
基金
中国国家自然科学基金;
关键词
Instance-level human parsing; Multi-task learning; Pose estimation; Semantic features; Hierarchical association;
D O I
10.1007/s00138-023-01392-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Instance-level human parsing remains challenging due to the similarity between human instances and background, complex interactions, and various poses. Aiming at assigning each human-related pixel a semantic label and associate each label with the corresponding instance simultaneously, a new top-down method based on multi-task learning guided by crowded pose estimation is proposed to learn instance-level human semantic part information. Firstly, we introduce a path attention feature pyramid to learn more robust multi-scale shared semantic features by changing the feature propagation to concatenation and increasing channel attention at each layer in order to solve the problem of complex background. Secondly, by improving the learned shared features via spatial attention and RC-ASPP, we design an instance-agnostic human parsing module to learn body part segmentation and edge information. In addition, we design a Mask-RCNN-based crowded pose estimation module that uses D-SPPE and hierarchical association rules to obtain pose information. Finally, we define fusion strategy and multi-task learning loss to fuse different semantic features and instance features, which can learn the final instance-level human parsing results in an end-to-end manner. Extensive experimental results on PASCAL-Person-Part and MHPv2.0 dataset verify the effectiveness of our proposed method that outperforms most of state-of-the-art methods.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Heterogeneous Multi-task Learning for Human Pose Estimation with Deep Convolutional Neural Network
    Li, Sijin
    Liu, Zhi-Qiang
    Chan, Antoni B.
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2014, : 488 - +
  • [32] Heterogeneous Multi-task Learning for Human Pose Estimation with Deep Convolutional Neural Network
    Li, Sijin
    Liu, Zhi-Qiang
    Chan, Antoni B.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 113 (01) : 19 - 36
  • [33] Heterogeneous Multi-task Learning for Human Pose Estimation with Deep Convolutional Neural Network
    Sijin Li
    Zhi-Qiang Liu
    Antoni B. Chan
    International Journal of Computer Vision, 2015, 113 : 19 - 36
  • [34] HUMAN PARSING BASED ALIGNMENT WITH MULTI-TASK LEARNING FOR OCCLUDED PERSON RE-IDENTIFICATION
    Huang, Houjing
    Chen, Xiaotang
    Huang, Kaiqi
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [35] Deep Convolutional Neural Networks for Multi-Instance Multi-Task Learning
    Zeng, Tao
    Ji, Shuiwang
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 579 - 588
  • [36] Adversarial Learning Guided Task Relatedness Refinement for Multi-Task Deep Learning
    Fang, Yuchun
    Cai, Sirui
    Cao, Yiting
    Li, Zhengchen
    Zhang, Zhaoxiang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 6946 - 6957
  • [37] Multi-task Forest for Human Pose Estimation in Depth Images
    Lallemand, Joe
    Pauly, Olivier
    Schwarz, Loren
    Tan, David
    Ilic, Slobodan
    2013 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2013), 2013, : 271 - 278
  • [38] Guided Learning: A New Paradigm for Multi-task Classification
    Fu, Jingru
    Zhang, Lei
    Zhang, Bob
    Jia, Wei
    BIOMETRIC RECOGNITION, CCBR 2018, 2018, 10996 : 239 - 246
  • [39] Semantic parsing of IT operation and maintenance service requirements based on multi-task learning
    Xu M.
    Liu Z.
    Wang C.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2024, 30 (02): : 673 - 683
  • [40] Multi-task learning in conditional random fields for chunking in shallow semantic parsing
    Bejing University of Posts and Telecommunications, Beijing, 100876, China
    不详
    PACLIC 23 - Proc. 23rd Pacific Asia Conf. Lang. Inf. Comput., 2009, (180-189):