3D-Aware Multi-Class Image-to-Image Translation with NeRFs

被引：2

作者：

Li, Senmao ^{[1
]}

van de Weijer, Joost ^{[2
]}

Wang, Yaxing ^{[1
]}

Khan, Fahad Shahbaz ^{[3
,4
]}

Liu, Meiqin ^{[5
]}

Yang, Jian ^{[1
]}

机构：

[1] Nankai Univ, CS, VCIP, Tianjin, Peoples R China

[2] Univ Autonoma Barcelona, Barcelona, Spain

[3] Mohamed bin Zayed Univ AI, Abu Dhabi, U Arab Emirates

[4] Linkoping Univ, Linkoping, Sweden

[5] Beijing Jiaotong Univ, Beijing, Peoples R China

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年

关键词：

D O I：

10.1109/CVPR52729.2023.01217

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent advances in 3D-aware generative models (3D-aware GANs) combined with Neural Radiance Fields (NeRF) have achieved impressive results. However no prior works investigate 3D-aware GANs for 3D consistent multi-class image-to-image (3D-aware I2I) translation. Naively using 2D-I2I translation methods suffers from unrealistic shape/identity change. To perform 3D-aware multi-class I2I translation, we decouple this learning process into a multi-class 3D-aware GAN step and a 3D-aware I2I translation step. In the first step, we propose two novel techniques: a new conditional architecture and an effective training strategy. In the second step, based on the well-trained multi-class 3D-aware GAN architecture, that preserves view-consistency, we construct a 3D-aware I2I translation system. To further reduce the view-consistency problems, we propose several new techniques, including a U-net-like adaptor network design, a hierarchical representation constrain and a relative regularization loss. In extensive experiments on two datasets, quantitative and qualitative results demonstrate that we successfully perform 3D-aware I2I translation with multi-view consistency. Code is available in 3DI2I.

引用

页码：12652 / 12662

页数：11

共 50 条

[21] InstaFormer plus plus : Multi-Domain Instance-Aware Image-to-Image Translation with Transformer
Kim, Soohyun
Baek, Jongbeom
Park, Jihye
Ha, Eunjae
Jung, Homin
Lee, Taeyoung
Kim, Seungryong
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (04) : 1167 - 1186
[22] Image-to-Image Translation with Multi-Path Consistency Regularization
Lin, Jianxin
Xia, Yingce
Wang, Yijun
Qin, Tao
Chen, Zhibo
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2980 - 2986
[23] Quantitative Manipulation of Custom Attributes on 3D-Aware Image Synthesis
Do, Hoseok
Yoo, EunKyung
Kim, Taehyeong
Lee, Chul
Choi, Tin Young
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 8529 - 8538
[24] DMDIT: Diverse multi-domain image-to-image translation
Shao, Mingwen
Zhang, Youcai
Liu, Huan
Wang, Chao
Li, Le
Shao, Xun
KNOWLEDGE-BASED SYSTEMS, 2021, 229
[25] SMIT: Stochastic Multi-Label Image-to-Image Translation
Romero, Andres
Arbelaez, Pablo
Van Gool, Luc
Timofte, Radu
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3285 - 3294
[26] Generative Multiplane Neural Radiance for 3D-Aware Image Generation
Kumar, Amandeep
Bhunia, Ankan Kumar
Narayan, Sanath
Cholakkal, Hisham
Anwer, Rao Muhammad
Khan, Salman
Yang, Ming-Hsuan
Khan, Fahad Shahbaz
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7354 - 7364
[27] SEMANTIC-AWARE UNPAIRED IMAGE-TO-IMAGE TRANSLATION FOR URBAN SCENE IMAGES
Li, Zongyao
Togo, Ren
Ogawa, Takahiro
Haseyama, Miki
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2150 - 2154
[28] Feature-attention module for context-aware image-to-image translation
Bai, Jing
Chen, Ran
Liu, Min
VISUAL COMPUTER, 2020, 36 (10-12): : 2145 - 2159
[29] GRAF: Generative Radiance Fields for 3D-Aware Image Synthesis
Schwarz, Katja
Liao, Yiyi
Niemeyer, Michael
Geiger, Andreas
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[30] GRAM: Generative Radiance Manifolds for 3D-Aware Image Generation
Deng, Yu
Yang, Jiaolong
Xiang, Jianfeng
Tong, Xin
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10663 - 10673

← 1 2 3 4 5 →