Generative Multiplane Images: Making a 2D GAN 3D-Aware

被引:19
|
作者
Zhao, Xiaoming [1 ,2 ]
Ma, Fangchang [1 ]
Guera, David [1 ]
Ren, Zhile [1 ]
Schwing, Alexander G. [2 ]
Colburn, Alex [1 ]
机构
[1] Apple, Cupertino, CA 95014 USA
[2] Univ Illinois, Chicago, IL 60680 USA
来源
COMPUTER VISION - ECCV 2022, PT V | 2022年 / 13665卷
基金
美国国家科学基金会;
关键词
GANs; 3D-aware generation; Multiplane images;
D O I
10.1007/978-3-031-20065-6_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
What is really needed to make an existing 2D GAN 3D-aware? To answer this question, we modify a classical GAN, i.e., Style-GANv2, as little as possible. We find that only two modifications are absolutely necessary: 1) a multiplane image style generator branch which produces a set of alpha maps conditioned on their depth; 2) a pose-conditioned discriminator. We refer to the generated output as a 'generative multiplane image' (GMPI) and emphasize that its renderings are not only high-quality but also guaranteed to be view-consistent, which makes GMPIs different from many prior works. Importantly, the number of alpha maps can be dynamically adjusted and can differ between training and inference, alleviating memory concerns and enabling fast training of GMPIs in less than half a day at a resolution of 1024(2). Our findings are consistent across three challenging and common high-resolution datasets, including FFHQ, AFHQv2 and MetFaces.
引用
收藏
页码:18 / 35
页数:18
相关论文
共 50 条
  • [1] Generative Multiplane Neural Radiance for 3D-Aware Image Generation
    Kumar, Amandeep
    Bhunia, Ankan Kumar
    Narayan, Sanath
    Cholakkal, Hisham
    Anwer, Rao Muhammad
    Khan, Salman
    Yang, Ming-Hsuan
    Khan, Fahad Shahbaz
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7354 - 7364
  • [2] 3D-aware Blending with Generative NeRFs
    Kim, Hyunsu
    Lee, Gayoung
    Choi, Yunjey
    Kim, Jin-Hwa
    Zhu, Jun-Yan
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22849 - 22861
  • [3] Lifting 2D StyleGAN for 3D-Aware Face Generation
    Shi, Yichun
    Aggarwal, Divyansh
    Jain, Anil K.
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6254 - 6262
  • [4] 3D-aware Image Generation using 2D Diffusion Models
    Xiang, Jianfeng
    Yang, Jiaolong
    Huang, Binbin
    Tong, Xin
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2383 - 2393
  • [5] VQ3D: Learning a 3D-Aware Generative Model on ImageNet
    Sargent, Kyle
    Koh, Jing Yu
    Zhang, Han
    Chang, Huiwen
    Herrmann, Charles
    Srinivasan, Pratul
    Wu, Jiajun
    Sun, Deqing
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 4217 - 4227
  • [6] A Survey on Deep Generative 3D-aware Image Synthesis
    Xia, Weihao
    Xue, Jing-Hao
    ACM COMPUTING SURVEYS, 2024, 56 (04)
  • [7] Improving 2D Feature Representations by 3D-Aware Fine-Tuning
    Yue, Yuanwen
    Das, Anurag
    Engelmann, Francis
    Tang, Siyu
    Lenssen, Jan Eric
    COMPUTER VISION - ECCV 2024, PT II, 2025, 15060 : 57 - 74
  • [8] pi-GAN: Periodic Implicit Generative Adversarial Networks for 3D-Aware Image Synthesis
    Chan, Eric R.
    Monteiro, Marco
    Kellnhofer, Petr
    Wu, Jiajun
    Wetzstein, Gordon
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5795 - 5805
  • [9] Generative Novel View Synthesis with 3D-Aware Diffusion Models
    Chan, Eric R.
    Nagano, Koki
    Chan, Matthew A.
    Bergman, Alexander W.
    Park, Jeong Joon
    Levy, Axel
    Aittala, Miika
    De Mello, Shalini
    Karras, Tero
    Wetzstein, Gordon
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 4194 - 4206
  • [10] GRAM: Generative Radiance Manifolds for 3D-Aware Image Generation
    Deng, Yu
    Yang, Jiaolong
    Xiang, Jianfeng
    Tong, Xin
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10663 - 10673