PART-PRESERVING POSE MANIPULATION FOR PERSON IMAGE SYNTHESIS

被引:7
|
作者
Dong, Haoye [1 ,2 ]
Liang, Xiaodan [3 ]
Zhou, Chenxing [1 ,2 ]
Lai, Hanjiang [1 ,2 ]
Zhu, Jia [4 ]
Yin, Jian [1 ,2 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Guangdong, Peoples R China
[2] Guangdong Key Lab Big Data Anal & Proc, Guangzhou 510006, Guangdong, Peoples R China
[3] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Guangzhou, Guangdong, Peoples R China
[4] South China Normal Univ, Sch Comp Sci, Guangzhou, Guangdong, Peoples R China
来源
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME) | 2019年
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Person Image Synthesis; Generative Adversarial Network; Human Parsing;
D O I
10.1109/ICME.2019.00215
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Manipulating person images under diverse poses, which transfers a person from one pose to another desired pose, is an interesting yet challenging task due to large non-rigid spatial deformation. Most existing works fail to preserve the fine-grained appearance consistency along with the pose changes due to the lack of explicit constraints and spatial modeling, leading to unrealistic results with severe artifacts. In this paper, we propose a novel Part-Preserving Generative Adversarial Network (PP-GAN) to achieve good manipulation quality by explicitly enforcing rich structure constraints over generative modeling. PP-GAN is proposed to decompose the challenging spatial transformation of the whole body into fine-grained part-level transformations, which are then integrated via human joint structure constraint. Given arbitrary poses, PP-GAN integrates human joint structure and region-level part cues as inputs to perform explicit generative modeling. Besides, we introduce a parsing-consistent loss to enforce semantic consistency among images with diverse poses, which guides the image synthesis from a semantic perspective. Extensive qualitative and quantitative evaluations on two benchmarks show that our PP-GAN significantly outperforms the state-of-the-art baselines in generating more realistic and plausible image synthesis results. PP-GAN successfully preserves part-level characteristics even for most challenging pose changes while prior works are easy to fail.
引用
收藏
页码:1234 / 1239
页数:6
相关论文
共 50 条
  • [31] Deep Spatial Transformation for Pose-Guided Person Image Generation and Animation
    Ren, Yurui
    Li, Ge
    Liu, Shan
    Li, Thomas H.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8622 - 8635
  • [32] Object Recognition and Full Pose Registration from a Single Image for Robotic Manipulation
    Collet, Alvaro
    Berenson, Dmitry
    Srinivasa, Siddhartha S.
    Ferguson, Dave
    ICRA: 2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-7, 2009, : 3534 - +
  • [33] UPGPT: Universal Diffusion Model for Person Image Generation, Editing and Pose Transfer
    Cheong, Soon Yau
    Mustafa, Armin
    Gilbert, Andrew
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 4175 - 4184
  • [34] StylePart: image-based shape part manipulation
    Shen, I. -Chao
    Su, Li-Wen
    Wu, Yu-Ting
    Chen, Bing-Yu
    VISUAL COMPUTER, 2025, 41 (01): : 67 - 78
  • [35] Inter-image Contrastive Consistency for Multi-Person Pose Estimation
    Xu, Xixia
    Gao, Yingguo
    Pan, Xingjia
    Yan, Ke
    Chen, Xiaoyu
    Zou, Qi
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3063 - 3071
  • [36] Structure-aware Person Image Generation with Pose Decomposition and Semantic Correlation
    Tang, Jilin
    Yuan, Yi
    Shao, Tianjia
    Liu, Yong
    Wang, Mengmeng
    Zhou, Kun
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2656 - 2664
  • [37] Recognizing Context for Privacy Preserving of First Person Vision Image Sequences
    Battiato, Sebastiano
    Farinella, Giovanni Maria
    Napoli, Christian
    Nicotra, Gabriele
    Riccobene, Salvatore
    IMAGE ANALYSIS AND PROCESSING (ICIAP 2017), PT II, 2017, 10485 : 580 - 590
  • [38] Exploring Dual-task Correlation for Pose Guided Person Image Generation
    Zhang, Pengze
    Yang, Lingxiao
    Lai, Jianhuang
    Xie, Xiaohua
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7703 - 7712
  • [39] POSE GUIDED PERSON IMAGE GENERATION WITH HIDDEN P-NORM REGRESSION
    Hu, Ting-Yao
    Hauptmann, Alexander G.
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2423 - 2427
  • [40] Human Pose Manipulation and Novel View Synthesis using Differentiable Rendering
    Rochette, Guillaume
    Russell, Chris
    Bowden, Richard
    2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,