Crowded pose-guided multi-task learning for instance-level human parsing

被引：1

作者：

Wei, Yong ^{[1
]}

Liu, Li ^{[1
,2
]}

Fu, Xiaodong ^{[1
,2
]}

Liu, LiJun ^{[1
,2
]}

Peng, Wei ^{[1
,2
]}

机构：

[1] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, 727,Jingming South Rd, Kunming 650500, Yunnan, Peoples R China

[2] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Comp Technol Applicat Key Lab Yunnan Prov, 727,Jingming South Rd, Kunming 650500, Yunnan, Peoples R China

来源：

MACHINE VISION AND APPLICATIONS | 2023年 / 34卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Instance-level human parsing; Multi-task learning; Pose estimation; Semantic features; Hierarchical association;

D O I：

10.1007/s00138-023-01392-4

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Instance-level human parsing remains challenging due to the similarity between human instances and background, complex interactions, and various poses. Aiming at assigning each human-related pixel a semantic label and associate each label with the corresponding instance simultaneously, a new top-down method based on multi-task learning guided by crowded pose estimation is proposed to learn instance-level human semantic part information. Firstly, we introduce a path attention feature pyramid to learn more robust multi-scale shared semantic features by changing the feature propagation to concatenation and increasing channel attention at each layer in order to solve the problem of complex background. Secondly, by improving the learned shared features via spatial attention and RC-ASPP, we design an instance-agnostic human parsing module to learn body part segmentation and edge information. In addition, we design a Mask-RCNN-based crowded pose estimation module that uses D-SPPE and hierarchical association rules to obtain pose information. Finally, we define fusion strategy and multi-task learning loss to fuse different semantic features and instance features, which can learn the final instance-level human parsing results in an end-to-end manner. Extensive experimental results on PASCAL-Person-Part and MHPv2.0 dataset verify the effectiveness of our proposed method that outperforms most of state-of-the-art methods.

引用

页数：15

共 50 条

[31] Heterogeneous Multi-task Learning for Human Pose Estimation with Deep Convolutional Neural Network
Li, Sijin
Liu, Zhi-Qiang
Chan, Antoni B.
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2014, : 488 - +
[32] Heterogeneous Multi-task Learning for Human Pose Estimation with Deep Convolutional Neural Network
Li, Sijin
Liu, Zhi-Qiang
Chan, Antoni B.
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 113 (01) : 19 - 36
[33] Heterogeneous Multi-task Learning for Human Pose Estimation with Deep Convolutional Neural Network
Sijin Li
Zhi-Qiang Liu
Antoni B. Chan
International Journal of Computer Vision, 2015, 113 : 19 - 36
[34] HUMAN PARSING BASED ALIGNMENT WITH MULTI-TASK LEARNING FOR OCCLUDED PERSON RE-IDENTIFICATION
Huang, Houjing
Chen, Xiaotang
Huang, Kaiqi
2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
[35] Deep Convolutional Neural Networks for Multi-Instance Multi-Task Learning
Zeng, Tao
Ji, Shuiwang
2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 579 - 588
[36] Adversarial Learning Guided Task Relatedness Refinement for Multi-Task Deep Learning
Fang, Yuchun
Cai, Sirui
Cao, Yiting
Li, Zhengchen
Zhang, Zhaoxiang
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 6946 - 6957
[37] Multi-task Forest for Human Pose Estimation in Depth Images
Lallemand, Joe
Pauly, Olivier
Schwarz, Loren
Tan, David
Ilic, Slobodan
2013 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2013), 2013, : 271 - 278
[38] Guided Learning: A New Paradigm for Multi-task Classification
Fu, Jingru
Zhang, Lei
Zhang, Bob
Jia, Wei
BIOMETRIC RECOGNITION, CCBR 2018, 2018, 10996 : 239 - 246
[39] Semantic parsing of IT operation and maintenance service requirements based on multi-task learning
Xu M.
Liu Z.
Wang C.
Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2024, 30 (02): : 673 - 683
[40] Multi-task learning in conditional random fields for chunking in shallow semantic parsing
Bejing University of Posts and Telecommunications, Beijing, 100876, China
不详
PACLIC 23 - Proc. 23rd Pacific Asia Conf. Lang. Inf. Comput., 2009, (180-189):

← 1 2 3 4 5 →