Crowded pose-guided multi-task learning for instance-level human parsing

被引:1
|
作者
Wei, Yong [1 ]
Liu, Li [1 ,2 ]
Fu, Xiaodong [1 ,2 ]
Liu, LiJun [1 ,2 ]
Peng, Wei [1 ,2 ]
机构
[1] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, 727,Jingming South Rd, Kunming 650500, Yunnan, Peoples R China
[2] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Comp Technol Applicat Key Lab Yunnan Prov, 727,Jingming South Rd, Kunming 650500, Yunnan, Peoples R China
基金
中国国家自然科学基金;
关键词
Instance-level human parsing; Multi-task learning; Pose estimation; Semantic features; Hierarchical association;
D O I
10.1007/s00138-023-01392-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Instance-level human parsing remains challenging due to the similarity between human instances and background, complex interactions, and various poses. Aiming at assigning each human-related pixel a semantic label and associate each label with the corresponding instance simultaneously, a new top-down method based on multi-task learning guided by crowded pose estimation is proposed to learn instance-level human semantic part information. Firstly, we introduce a path attention feature pyramid to learn more robust multi-scale shared semantic features by changing the feature propagation to concatenation and increasing channel attention at each layer in order to solve the problem of complex background. Secondly, by improving the learned shared features via spatial attention and RC-ASPP, we design an instance-agnostic human parsing module to learn body part segmentation and edge information. In addition, we design a Mask-RCNN-based crowded pose estimation module that uses D-SPPE and hierarchical association rules to obtain pose information. Finally, we define fusion strategy and multi-task learning loss to fuse different semantic features and instance features, which can learn the final instance-level human parsing results in an end-to-end manner. Extensive experimental results on PASCAL-Person-Part and MHPv2.0 dataset verify the effectiveness of our proposed method that outperforms most of state-of-the-art methods.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Crowded pose-guided multi-task learning for instance-level human parsing
    Yong Wei
    Li Liu
    Xiaodong Fu
    LiJun Liu
    Wei Peng
    Machine Vision and Applications, 2023, 34
  • [2] Instance-level 6D pose estimation based on multi-task parameter sharing for robotic grasping
    Zhang, Liming
    Zhou, Xin
    Liu, Jiaqing
    Wang, Can
    Wu, Xinyu
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [3] Instance-level 6D pose estimation based on multi-task parameter sharing for robotic grasping
    Zhang L.
    Zhou X.
    Liu J.
    Wang C.
    Wu X.
    Scientific Reports, 14 (1)
  • [4] DEEP HASHING WITH MULTI-TASK LEARNING FOR LARGE-SCALE INSTANCE-LEVEL VEHICLE SEARCH
    Liang, Dawei
    Yan, Ke
    Wang, Yaowei
    Zeng, Wei
    Yuan, Qingsheng
    Bao, Xiuguo
    Tian, Yonghong
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
  • [5] Knowledge enhanced multi-task learning for simultaneous optimization of human parsing and pose estimation
    Zhou, Yanghong
    Mok, P. Y.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
  • [6] Pose-Guided Hierarchical Semantic Decomposition and Composition for Human Parsing
    Yang, Beibei
    Yu, Changqian
    Yu, Jin-Gang
    Gao, Changxin
    Sang, Nong
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (03) : 1641 - 1652
  • [7] Pose-Guided Human Parsing by an AND/OR Graph Using Pose-Context Features
    Xia, Fangting
    Zhu, Jun
    Wang, Peng
    Yuille, Alan L.
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 3632 - 3640
  • [8] Parsing R-CNN for Instance-Level Human Analysis
    Yang, Lu
    Song, Qing
    Wang, Zhihui
    Jiang, Ming
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 364 - 373
  • [9] AIParsing: Anchor-Free Instance-Level Human Parsing
    Zhang, Sanyi
    Cao, Xiaochun
    Qi, Guo-Jun
    Song, Zhanjie
    Zhou, Jie
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 5599 - 5612
  • [10] Instance-Level Human Parsing via Part Grouping Network
    Gong, Ke
    Liang, Xiaodan
    Li, Yicheng
    Chen, Yimin
    Yang, Ming
    Lin, Liang
    COMPUTER VISION - ECCV 2018, PT IV, 2018, 11208 : 805 - 822