Multi-View Consistent Generative Adversarial Networks for 3D-aware Image Synthesis

被引：20

作者：

Zhang, Xuanmeng ^{[1
,2
,4
]}

Zheng, Zhedong ^{[1
]}

Gao, Daiheng ^{[2
]}

Zhang, Bang ^{[2
]}

Pan, Pan ^{[2
]}

Yang, Yi ^{[3
]}

机构：

[1] Univ Technol Sydney, AAII, ReLER, Sydney, NSW, Australia

[2] Alibaba Grp, DAMO Acad, Hangzhou, Peoples R China

[3] Zhejiang Univ, Hangzhou, Peoples R China

[4] Alibaba, Hangzhou, Peoples R China

来源：

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年

关键词：

D O I：

10.1109/CVPR52688.2022.01790

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

3D-aware image synthesis aims to generate images of objects from multiple views by learning a 3D representation. However, one key challenge remains: existing approaches lack geometry constraints, hence usually fail to generate multi -view consistent images. To address this challenge, we propose Multi-View Consistent Generative Adversarial Networks (MVCGAN) for high-quality 3D aware image synthesis with geometry constraints. By leveraging the underlying 3D geometry information ofgenerated images, i.e., depth and camera transformation matrix, we explicitly establish stereo correspondence between views to perform multi-view joint optimization. In particular, we enforce the photometric consistency between pairs of views and integrate a stereo mixup mechanism into the training process, encouraging the model to reason about the correct 3D shape. Besides, we design a two -stage training strategy with feature -level multi-view joint optimization to improve the image quality. Extensive experiments on three datasets demonstrate that MVCGAN achieves the state-ofthe-art performance for 3D -aware image synthesis.

引用

页码：18429 / 18438

页数：10

共 50 条

[21] Deep-plane sweep generative adversarial network for consistent multi-view depth estimation
Shu, Dong Wook
Jang, Wonbeom
Yoo, Heebin
Shin, Hong-Chang
Kwon, Junseok
MACHINE VISION AND APPLICATIONS, 2022, 33 (01)
[22] Deep-plane sweep generative adversarial network for consistent multi-view depth estimation
Dong Wook Shu
Wonbeom Jang
Heebin Yoo
Hong-Chang Shin
Junseok Kwon
Machine Vision and Applications, 2022, 33
[23] 3D-Aware Semantic-Guided Generative Model for Human Synthesis
Zhang, Jichao
Sangineto, Enver
Tang, Hao
Siarohin, Aliaksandr
Zhong, Zhun
Sebe, Nicu
Wang, Wei
COMPUTER VISION - ECCV 2022, PT XV, 2022, 13675 : 339 - 356
[24] Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator
Shi, Zifan
Xu, Yinghao
Shen, Yujun
Zhao, Deli
Chen, Qifeng
Yeung, Dit-Yan
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[25] Learning 3D-aware Image Synthesis with Unknown Pose Distribution
Shi, Zifan
Shen, Yujun
Xu, Yinghao
Peng, Sida
Liao, Yiyi
Guo, Sheng
Chen, Qifeng
Yeung, Dit-Yan
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13062 - 13071
[26] A Shading-Guided Generative Implicit Model for Shape-Accurate 3D-Aware Image Synthesis
Pan, Xingang
Xu, Xudong
Loy, Chen Change
Theobalt, Christian
Dai, Bo
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[27] A Co-Attention Method Based on Generative Adversarial Networks for Multi-view Images
Huang, Qi-Xian
Shi, Shu-Pei
Lin, Guo-Shiang
Shen, Day-Fann
Sun, Hung-Min
22ND IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD 2021-FALL), 2021, : 171 - 173
[28] Quantitative Manipulation of Custom Attributes on 3D-Aware Image Synthesis
Do, Hoseok
Yoo, EunKyung
Kim, Taehyeong
Lee, Chul
Choi, Tin Young
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 8529 - 8538
[29] 3D-Aware Multi-Class Image-to-Image Translation with NeRFs
Li, Senmao
van de Weijer, Joost
Wang, Yaxing
Khan, Fahad Shahbaz
Liu, Meiqin
Yang, Jian
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12652 - 12662
[30] Adversarial Multi-view Networks for Activity Recognition
Bai, Lei
Yao, Lina
Wang, Xianzhi
Kanhere, Salil S.
Bin Guo
Yu, Zhiwen
PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2020, 4 (02):

← 1 2 3 4 5 →