Generative Multiplane Neural Radiance for 3D-Aware Image Generation

被引:0
|
作者
Kumar, Amandeep [1 ]
Bhunia, Ankan Kumar [1 ]
Narayan, Sanath [2 ]
Cholakkal, Hisham [1 ]
Anwer, Rao Muhammad [1 ,3 ]
Khan, Salman [1 ]
Yang, Ming-Hsuan [4 ,5 ,6 ]
Khan, Fahad Shahbaz [1 ,7 ]
机构
[1] Mohamed bin Zayed Univ AI, Abu Dhabi, U Arab Emirates
[2] Technol Innovat Inst, Abu Dhabi, U Arab Emirates
[3] Aalto Univ, Espoo, Finland
[4] Univ Calif Merced, Merced, CA USA
[5] Yonsei Univ, Seoul, South Korea
[6] Google Res, Mountain View, CA USA
[7] Linkoping Univ, Linkoping, Sweden
来源
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV | 2023年
关键词
D O I
10.1109/ICCV51070.2023.00679
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a method to efficiently generate 3D-aware high-resolution images that are view-consistent across multiple target views. The proposed multiplane neural radiance model, named GMNR, consists of a novel a-guided view-dependent representation (a-VdR) module for learning view-dependent information. The a-VdR module, faciliated by an a-guided pixel sampling technique, computes the view-dependent representation efficiently by learning viewing direction and position coefficients. Moreover, we propose a view-consistency loss to enforce photometric similarity across multiple views. The GMNR model can generate 3D-aware high-resolution images that are view-consistent across multiple camera poses, while maintaining the computational efficiency in terms of both training and inference time. Experiments on three datasets demonstrate the effectiveness of the proposed modules, leading to favorable results in terms of both generation quality and inference time, compared to existing approaches. Our GMNR model generates 3D-aware images of 1024 x 1024 pixels with 17.6 FPS on a single V100. Code : https: //github.com/VIROBO-15/GMNR
引用
收藏
页码:7354 / 7364
页数:11
相关论文
共 50 条
  • [21] VQ3D: Learning a 3D-Aware Generative Model on ImageNet
    Sargent, Kyle
    Koh, Jing Yu
    Zhang, Han
    Chang, Huiwen
    Herrmann, Charles
    Srinivasan, Pratul
    Wu, Jiajun
    Sun, Deqing
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 4217 - 4227
  • [22] Exp-GAN: 3D-Aware Facial Image Generation with Expression Control
    Lee, Yeonkyeong
    Choi, Taeho
    Go, Hyunsung
    Lee, Hyunjoon
    Cho, Sunghyun
    Kim, Junho
    COMPUTER VISION - ACCV 2022, PT VII, 2023, 13847 : 151 - 167
  • [23] 3DHumanGAN: 3D-Aware Human Image Generation with 3D Pose Mapping
    Yang, Zhuoqian
    Li, Shikai
    Wu, Wayne
    Dai, Bo
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22951 - 22962
  • [24] Multi-view Consistent Generative Adversarial Networks for Compositional 3D-Aware Image Synthesis
    Zhang, Xuanmeng
    Zheng, Zhedong
    Gao, Daiheng
    Zhang, Bang
    Yang, Yi
    Chua, Tat-Seng
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (08) : 2219 - 2242
  • [25] Multi-view Consistent Generative Adversarial Networks for Compositional 3D-Aware Image Synthesis
    Xuanmeng Zhang
    Zhedong Zheng
    Daiheng Gao
    Bang Zhang
    Yi Yang
    Tat-Seng Chua
    International Journal of Computer Vision, 2023, 131 : 2219 - 2242
  • [26] Generative Novel View Synthesis with 3D-Aware Diffusion Models
    Chan, Eric R.
    Nagano, Koki
    Chan, Matthew A.
    Bergman, Alexander W.
    Park, Jeong Joon
    Levy, Axel
    Aittala, Miika
    De Mello, Shalini
    Karras, Tero
    Wetzstein, Gordon
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 4194 - 4206
  • [27] pi-GAN: Periodic Implicit Generative Adversarial Networks for 3D-Aware Image Synthesis
    Chan, Eric R.
    Monteiro, Marco
    Kellnhofer, Petr
    Wu, Jiajun
    Wetzstein, Gordon
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5795 - 5805
  • [28] 3D Congealing: 3D-Aware Image Alignment in the Wild
    Zhang, Yunzhi
    Li, Zizhang
    Raj, Amit
    Engelhardt, Andreas
    Li, Yuanzhen
    Hou, Tingbo
    Wu, Jiajun
    Jampani, Varun
    COMPUTER VISION-ECCV 2024, PT I, 2025, 15059 : 387 - 404
  • [29] Multi3D: 3D-aware multimodal image synthesis
    Zhou, Wenyang
    Yuan, Lu
    Mu, Taijiang
    COMPUTATIONAL VISUAL MEDIA, 2024, 10 (06) : 1205 - 1217
  • [30] 3D-aware neural network for analyzing neuron morphology
    Le, Longxin
    Wang, Yimin
    2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024, 2024, : 101 - 104