Generative Multiplane Images: Making a 2D GAN 3D-Aware

被引：19

作者：

Zhao, Xiaoming ^{[1
,2
]}

Ma, Fangchang ^{[1
]}

Guera, David ^{[1
]}

Ren, Zhile ^{[1
]}

Schwing, Alexander G. ^{[2
]}

Colburn, Alex ^{[1
]}

机构：

[1] Apple, Cupertino, CA 95014 USA

[2] Univ Illinois, Chicago, IL 60680 USA

来源：

COMPUTER VISION - ECCV 2022, PT V | 2022年 / 13665卷

基金：

美国国家科学基金会;

关键词：

GANs; 3D-aware generation; Multiplane images;

D O I：

10.1007/978-3-031-20065-6_2

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

What is really needed to make an existing 2D GAN 3D-aware? To answer this question, we modify a classical GAN, i.e., Style-GANv2, as little as possible. We find that only two modifications are absolutely necessary: 1) a multiplane image style generator branch which produces a set of alpha maps conditioned on their depth; 2) a pose-conditioned discriminator. We refer to the generated output as a 'generative multiplane image' (GMPI) and emphasize that its renderings are not only high-quality but also guaranteed to be view-consistent, which makes GMPIs different from many prior works. Importantly, the number of alpha maps can be dynamically adjusted and can differ between training and inference, alleviating memory concerns and enabling fast training of GMPIs in less than half a day at a resolution of 1024(2). Our findings are consistent across three challenging and common high-resolution datasets, including FFHQ, AFHQv2 and MetFaces.

引用

页码：18 / 35

页数：18

共 50 条

[1] Generative Multiplane Neural Radiance for 3D-Aware Image Generation
Kumar, Amandeep
Bhunia, Ankan Kumar
Narayan, Sanath
Cholakkal, Hisham
Anwer, Rao Muhammad
Khan, Salman
Yang, Ming-Hsuan
Khan, Fahad Shahbaz
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7354 - 7364
[2] 3D-aware Blending with Generative NeRFs
Kim, Hyunsu
Lee, Gayoung
Choi, Yunjey
Kim, Jin-Hwa
Zhu, Jun-Yan
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22849 - 22861
[3] Lifting 2D StyleGAN for 3D-Aware Face Generation
Shi, Yichun
Aggarwal, Divyansh
Jain, Anil K.
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6254 - 6262
[4] 3D-aware Image Generation using 2D Diffusion Models
Xiang, Jianfeng
Yang, Jiaolong
Huang, Binbin
Tong, Xin
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2383 - 2393
[5] VQ3D: Learning a 3D-Aware Generative Model on ImageNet
Sargent, Kyle
Koh, Jing Yu
Zhang, Han
Chang, Huiwen
Herrmann, Charles
Srinivasan, Pratul
Wu, Jiajun
Sun, Deqing
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 4217 - 4227
[6] A Survey on Deep Generative 3D-aware Image Synthesis
Xia, Weihao
Xue, Jing-Hao
ACM COMPUTING SURVEYS, 2024, 56 (04)
[7] Improving 2D Feature Representations by 3D-Aware Fine-Tuning
Yue, Yuanwen
Das, Anurag
Engelmann, Francis
Tang, Siyu
Lenssen, Jan Eric
COMPUTER VISION - ECCV 2024, PT II, 2025, 15060 : 57 - 74
[8] pi-GAN: Periodic Implicit Generative Adversarial Networks for 3D-Aware Image Synthesis
Chan, Eric R.
Monteiro, Marco
Kellnhofer, Petr
Wu, Jiajun
Wetzstein, Gordon
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5795 - 5805
[9] Generative Novel View Synthesis with 3D-Aware Diffusion Models
Chan, Eric R.
Nagano, Koki
Chan, Matthew A.
Bergman, Alexander W.
Park, Jeong Joon
Levy, Axel
Aittala, Miika
De Mello, Shalini
Karras, Tero
Wetzstein, Gordon
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 4194 - 4206
[10] GRAM: Generative Radiance Manifolds for 3D-Aware Image Generation
Deng, Yu
Yang, Jiaolong
Xiang, Jianfeng
Tong, Xin
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10663 - 10673

← 1 2 3 4 5 →