Multi-View Consistent Generative Adversarial Networks for 3D-aware Image Synthesis

被引:20
|
作者
Zhang, Xuanmeng [1 ,2 ,4 ]
Zheng, Zhedong [1 ]
Gao, Daiheng [2 ]
Zhang, Bang [2 ]
Pan, Pan [2 ]
Yang, Yi [3 ]
机构
[1] Univ Technol Sydney, AAII, ReLER, Sydney, NSW, Australia
[2] Alibaba Grp, DAMO Acad, Hangzhou, Peoples R China
[3] Zhejiang Univ, Hangzhou, Peoples R China
[4] Alibaba, Hangzhou, Peoples R China
来源
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年
关键词
D O I
10.1109/CVPR52688.2022.01790
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D-aware image synthesis aims to generate images of objects from multiple views by learning a 3D representation. However, one key challenge remains: existing approaches lack geometry constraints, hence usually fail to generate multi -view consistent images. To address this challenge, we propose Multi-View Consistent Generative Adversarial Networks (MVCGAN) for high-quality 3D aware image synthesis with geometry constraints. By leveraging the underlying 3D geometry information ofgenerated images, i.e., depth and camera transformation matrix, we explicitly establish stereo correspondence between views to perform multi-view joint optimization. In particular, we enforce the photometric consistency between pairs of views and integrate a stereo mixup mechanism into the training process, encouraging the model to reason about the correct 3D shape. Besides, we design a two -stage training strategy with feature -level multi-view joint optimization to improve the image quality. Extensive experiments on three datasets demonstrate that MVCGAN achieves the state-ofthe-art performance for 3D -aware image synthesis.
引用
收藏
页码:18429 / 18438
页数:10
相关论文
共 50 条
  • [21] Deep-plane sweep generative adversarial network for consistent multi-view depth estimation
    Shu, Dong Wook
    Jang, Wonbeom
    Yoo, Heebin
    Shin, Hong-Chang
    Kwon, Junseok
    MACHINE VISION AND APPLICATIONS, 2022, 33 (01)
  • [22] Deep-plane sweep generative adversarial network for consistent multi-view depth estimation
    Dong Wook Shu
    Wonbeom Jang
    Heebin Yoo
    Hong-Chang Shin
    Junseok Kwon
    Machine Vision and Applications, 2022, 33
  • [23] 3D-Aware Semantic-Guided Generative Model for Human Synthesis
    Zhang, Jichao
    Sangineto, Enver
    Tang, Hao
    Siarohin, Aliaksandr
    Zhong, Zhun
    Sebe, Nicu
    Wang, Wei
    COMPUTER VISION - ECCV 2022, PT XV, 2022, 13675 : 339 - 356
  • [24] Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator
    Shi, Zifan
    Xu, Yinghao
    Shen, Yujun
    Zhao, Deli
    Chen, Qifeng
    Yeung, Dit-Yan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [25] Learning 3D-aware Image Synthesis with Unknown Pose Distribution
    Shi, Zifan
    Shen, Yujun
    Xu, Yinghao
    Peng, Sida
    Liao, Yiyi
    Guo, Sheng
    Chen, Qifeng
    Yeung, Dit-Yan
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13062 - 13071
  • [26] A Shading-Guided Generative Implicit Model for Shape-Accurate 3D-Aware Image Synthesis
    Pan, Xingang
    Xu, Xudong
    Loy, Chen Change
    Theobalt, Christian
    Dai, Bo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [27] A Co-Attention Method Based on Generative Adversarial Networks for Multi-view Images
    Huang, Qi-Xian
    Shi, Shu-Pei
    Lin, Guo-Shiang
    Shen, Day-Fann
    Sun, Hung-Min
    22ND IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD 2021-FALL), 2021, : 171 - 173
  • [28] Quantitative Manipulation of Custom Attributes on 3D-Aware Image Synthesis
    Do, Hoseok
    Yoo, EunKyung
    Kim, Taehyeong
    Lee, Chul
    Choi, Tin Young
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 8529 - 8538
  • [29] 3D-Aware Multi-Class Image-to-Image Translation with NeRFs
    Li, Senmao
    van de Weijer, Joost
    Wang, Yaxing
    Khan, Fahad Shahbaz
    Liu, Meiqin
    Yang, Jian
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12652 - 12662
  • [30] Adversarial Multi-view Networks for Activity Recognition
    Bai, Lei
    Yao, Lina
    Wang, Xianzhi
    Kanhere, Salil S.
    Bin Guo
    Yu, Zhiwen
    PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2020, 4 (02):