3D-Aware Multi-Class Image-to-Image Translation with NeRFs

被引:2
|
作者
Li, Senmao [1 ]
van de Weijer, Joost [2 ]
Wang, Yaxing [1 ]
Khan, Fahad Shahbaz [3 ,4 ]
Liu, Meiqin [5 ]
Yang, Jian [1 ]
机构
[1] Nankai Univ, CS, VCIP, Tianjin, Peoples R China
[2] Univ Autonoma Barcelona, Barcelona, Spain
[3] Mohamed bin Zayed Univ AI, Abu Dhabi, U Arab Emirates
[4] Linkoping Univ, Linkoping, Sweden
[5] Beijing Jiaotong Univ, Beijing, Peoples R China
关键词
D O I
10.1109/CVPR52729.2023.01217
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in 3D-aware generative models (3D-aware GANs) combined with Neural Radiance Fields (NeRF) have achieved impressive results. However no prior works investigate 3D-aware GANs for 3D consistent multi-class image-to-image (3D-aware I2I) translation. Naively using 2D-I2I translation methods suffers from unrealistic shape/identity change. To perform 3D-aware multi-class I2I translation, we decouple this learning process into a multi-class 3D-aware GAN step and a 3D-aware I2I translation step. In the first step, we propose two novel techniques: a new conditional architecture and an effective training strategy. In the second step, based on the well-trained multi-class 3D-aware GAN architecture, that preserves view-consistency, we construct a 3D-aware I2I translation system. To further reduce the view-consistency problems, we propose several new techniques, including a U-net-like adaptor network design, a hierarchical representation constrain and a relative regularization loss. In extensive experiments on two datasets, quantitative and qualitative results demonstrate that we successfully perform 3D-aware I2I translation with multi-view consistency. Code is available in 3DI2I.
引用
收藏
页码:12652 / 12662
页数:11
相关论文
共 50 条
  • [21] InstaFormer plus plus : Multi-Domain Instance-Aware Image-to-Image Translation with Transformer
    Kim, Soohyun
    Baek, Jongbeom
    Park, Jihye
    Ha, Eunjae
    Jung, Homin
    Lee, Taeyoung
    Kim, Seungryong
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (04) : 1167 - 1186
  • [22] Image-to-Image Translation with Multi-Path Consistency Regularization
    Lin, Jianxin
    Xia, Yingce
    Wang, Yijun
    Qin, Tao
    Chen, Zhibo
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2980 - 2986
  • [23] Quantitative Manipulation of Custom Attributes on 3D-Aware Image Synthesis
    Do, Hoseok
    Yoo, EunKyung
    Kim, Taehyeong
    Lee, Chul
    Choi, Tin Young
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 8529 - 8538
  • [24] DMDIT: Diverse multi-domain image-to-image translation
    Shao, Mingwen
    Zhang, Youcai
    Liu, Huan
    Wang, Chao
    Li, Le
    Shao, Xun
    KNOWLEDGE-BASED SYSTEMS, 2021, 229
  • [25] SMIT: Stochastic Multi-Label Image-to-Image Translation
    Romero, Andres
    Arbelaez, Pablo
    Van Gool, Luc
    Timofte, Radu
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3285 - 3294
  • [26] Generative Multiplane Neural Radiance for 3D-Aware Image Generation
    Kumar, Amandeep
    Bhunia, Ankan Kumar
    Narayan, Sanath
    Cholakkal, Hisham
    Anwer, Rao Muhammad
    Khan, Salman
    Yang, Ming-Hsuan
    Khan, Fahad Shahbaz
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7354 - 7364
  • [27] SEMANTIC-AWARE UNPAIRED IMAGE-TO-IMAGE TRANSLATION FOR URBAN SCENE IMAGES
    Li, Zongyao
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2150 - 2154
  • [28] Feature-attention module for context-aware image-to-image translation
    Bai, Jing
    Chen, Ran
    Liu, Min
    VISUAL COMPUTER, 2020, 36 (10-12): : 2145 - 2159
  • [29] GRAF: Generative Radiance Fields for 3D-Aware Image Synthesis
    Schwarz, Katja
    Liao, Yiyi
    Niemeyer, Michael
    Geiger, Andreas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [30] GRAM: Generative Radiance Manifolds for 3D-Aware Image Generation
    Deng, Yu
    Yang, Jiaolong
    Xiang, Jianfeng
    Tong, Xin
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10663 - 10673