Crowded pose-guided multi-task learning for instance-level human parsing

被引:1
|
作者
Wei, Yong [1 ]
Liu, Li [1 ,2 ]
Fu, Xiaodong [1 ,2 ]
Liu, LiJun [1 ,2 ]
Peng, Wei [1 ,2 ]
机构
[1] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, 727,Jingming South Rd, Kunming 650500, Yunnan, Peoples R China
[2] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Comp Technol Applicat Key Lab Yunnan Prov, 727,Jingming South Rd, Kunming 650500, Yunnan, Peoples R China
基金
中国国家自然科学基金;
关键词
Instance-level human parsing; Multi-task learning; Pose estimation; Semantic features; Hierarchical association;
D O I
10.1007/s00138-023-01392-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Instance-level human parsing remains challenging due to the similarity between human instances and background, complex interactions, and various poses. Aiming at assigning each human-related pixel a semantic label and associate each label with the corresponding instance simultaneously, a new top-down method based on multi-task learning guided by crowded pose estimation is proposed to learn instance-level human semantic part information. Firstly, we introduce a path attention feature pyramid to learn more robust multi-scale shared semantic features by changing the feature propagation to concatenation and increasing channel attention at each layer in order to solve the problem of complex background. Secondly, by improving the learned shared features via spatial attention and RC-ASPP, we design an instance-agnostic human parsing module to learn body part segmentation and edge information. In addition, we design a Mask-RCNN-based crowded pose estimation module that uses D-SPPE and hierarchical association rules to obtain pose information. Finally, we define fusion strategy and multi-task learning loss to fuse different semantic features and instance features, which can learn the final instance-level human parsing results in an end-to-end manner. Extensive experimental results on PASCAL-Person-Part and MHPv2.0 dataset verify the effectiveness of our proposed method that outperforms most of state-of-the-art methods.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] A Novel Multi-Task Learning Framework for Semi-Supervised Semantic Parsing
    Qi, Qi
    Wang, Xiaolu
    Sun, Haifeng
    Wang, Jingyu
    Liang, Xiao
    Liaoz, Jianxin
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2552 - 2560
  • [42] Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing
    Zheng, Qingping
    Deng, Jiankang
    Zhu, Zheng
    Li, Ying
    Zafeiriou, Stefanos
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4146 - 4155
  • [43] Pallet Recognition with Multi-Task Learning for Automated Guided Vehicles
    Mok, Chunghyup
    Baek, Insung
    Cho, Yoon Sang
    Kim, Younghoon
    Kim, Seoung Bum
    APPLIED SCIENCES-BASEL, 2021, 11 (24):
  • [44] Toxicity Prediction in Cancer Using Multiple Instance Learning in a Multi-task Framework
    Li, Cheng
    Gupta, Sunil
    Rana, Santu
    Luo, Wei
    Venkatesh, Svetha
    Ashely, David
    Dinh Phung
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2016, PT I, 2016, 9651 : 152 - 164
  • [45] Multi-task, multi-domain learning: Application to semantic segmentation and pose regression
    Fourure, Damien
    Emonet, Remi
    Fromont, Elisa
    Muselet, Damien
    Neverova, Natalia
    Tremeau, Alain
    Wolf, Christian
    NEUROCOMPUTING, 2017, 251 : 68 - 80
  • [46] 3D Instance Segmentation via Multi-Task Metric Learning
    Lahoud, Jean
    Ghanem, Bernard
    Pollefeys, Marc
    Oswald, Martin R.
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9255 - 9265
  • [47] Multi-Task Domain Adaptation for Deep Learning of Instance Grasping from Simulation
    Fang, Kuan
    Bai, Yunfei
    Hinterstoisser, Stefan
    Savarese, Silvio
    Kalakrishnan, Mrinal
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 3516 - 3523
  • [48] Multi-Task Learning of Confounding Factors in Pose-Based Gait Recognition
    Cosma, Adrian
    Radoi, Ion Emilian
    2020 19TH ROEDUNET CONFERENCE: NETWORKING IN EDUCATION AND RESEARCH (ROEDUNET), 2020,
  • [49] CNN based Multi-Instance Multi-Task Learning for Syndrome Differentiation of Diabetic Patients
    Wang, Zeyuan
    Poon, Josiah
    Sun, Shiding
    Poon, Simon
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 1905 - 1911
  • [50] Multi-level network Lasso for multi-task personalized learning
    Wang, Jiankun
    Fei, Luhuan
    Sun, Lu
    PATTERN RECOGNITION, 2025, 161