PART-PRESERVING POSE MANIPULATION FOR PERSON IMAGE SYNTHESIS

被引：7

作者：

Dong, Haoye ^{[1
,2
]}

Liang, Xiaodan ^{[3
]}

Zhou, Chenxing ^{[1
,2
]}

Lai, Hanjiang ^{[1
,2
]}

Zhu, Jia ^{[4
]}

Yin, Jian ^{[1
,2
]}

机构：

[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Guangdong, Peoples R China

[2] Guangdong Key Lab Big Data Anal & Proc, Guangzhou 510006, Guangdong, Peoples R China

[3] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Guangzhou, Guangdong, Peoples R China

[4] South China Normal Univ, Sch Comp Sci, Guangzhou, Guangdong, Peoples R China

来源：

2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME) | 2019年

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Person Image Synthesis; Generative Adversarial Network; Human Parsing;

D O I：

10.1109/ICME.2019.00215

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Manipulating person images under diverse poses, which transfers a person from one pose to another desired pose, is an interesting yet challenging task due to large non-rigid spatial deformation. Most existing works fail to preserve the fine-grained appearance consistency along with the pose changes due to the lack of explicit constraints and spatial modeling, leading to unrealistic results with severe artifacts. In this paper, we propose a novel Part-Preserving Generative Adversarial Network (PP-GAN) to achieve good manipulation quality by explicitly enforcing rich structure constraints over generative modeling. PP-GAN is proposed to decompose the challenging spatial transformation of the whole body into fine-grained part-level transformations, which are then integrated via human joint structure constraint. Given arbitrary poses, PP-GAN integrates human joint structure and region-level part cues as inputs to perform explicit generative modeling. Besides, we introduce a parsing-consistent loss to enforce semantic consistency among images with diverse poses, which guides the image synthesis from a semantic perspective. Extensive qualitative and quantitative evaluations on two benchmarks show that our PP-GAN significantly outperforms the state-of-the-art baselines in generating more realistic and plausible image synthesis results. PP-GAN successfully preserves part-level characteristics even for most challenging pose changes while prior works are easy to fail.

引用

页码：1234 / 1239

页数：6

共 50 条

[31] Deep Spatial Transformation for Pose-Guided Person Image Generation and Animation
Ren, Yurui
Li, Ge
Liu, Shan
Li, Thomas H.
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8622 - 8635
[32] Object Recognition and Full Pose Registration from a Single Image for Robotic Manipulation
Collet, Alvaro
Berenson, Dmitry
Srinivasa, Siddhartha S.
Ferguson, Dave
ICRA: 2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-7, 2009, : 3534 - +
[33] UPGPT: Universal Diffusion Model for Person Image Generation, Editing and Pose Transfer
Cheong, Soon Yau
Mustafa, Armin
Gilbert, Andrew
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 4175 - 4184
[34] StylePart: image-based shape part manipulation
Shen, I. -Chao
Su, Li-Wen
Wu, Yu-Ting
Chen, Bing-Yu
VISUAL COMPUTER, 2025, 41 (01): : 67 - 78
[35] Inter-image Contrastive Consistency for Multi-Person Pose Estimation
Xu, Xixia
Gao, Yingguo
Pan, Xingjia
Yan, Ke
Chen, Xiaoyu
Zou, Qi
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3063 - 3071
[36] Structure-aware Person Image Generation with Pose Decomposition and Semantic Correlation
Tang, Jilin
Yuan, Yi
Shao, Tianjia
Liu, Yong
Wang, Mengmeng
Zhou, Kun
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2656 - 2664
[37] Recognizing Context for Privacy Preserving of First Person Vision Image Sequences
Battiato, Sebastiano
Farinella, Giovanni Maria
Napoli, Christian
Nicotra, Gabriele
Riccobene, Salvatore
IMAGE ANALYSIS AND PROCESSING (ICIAP 2017), PT II, 2017, 10485 : 580 - 590
[38] Exploring Dual-task Correlation for Pose Guided Person Image Generation
Zhang, Pengze
Yang, Lingxiao
Lai, Jianhuang
Xie, Xiaohua
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7703 - 7712
[39] POSE GUIDED PERSON IMAGE GENERATION WITH HIDDEN P-NORM REGRESSION
Hu, Ting-Yao
Hauptmann, Alexander G.
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2423 - 2427
[40] Human Pose Manipulation and Novel View Synthesis using Differentiable Rendering
Rochette, Guillaume
Russell, Chris
Bowden, Richard
2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,

← 1 2 3 4 5 →