SINE: Semantic-driven Image-based NeRF Editing with Prior-guided Editing Field

被引：31

作者：

Bao, Chong ^{[1
]}

Zhang, Yinda ^{[2
]}

Yang, Bangbang ^{[1
]}

Fan, Tianxing ^{[1
]}

Yang, Zesong ^{[1
]}

Bao, Hujun ^{[1
]}

Zhang, Guofeng ^{[1
]}

Cui, Zhaopeng ^{[1
]}

机构：

[1] Zhejiang Univ, State Key Lab CAD & CG, Hangzhou, Zhejiang, Peoples R China

[2] Google, Mountain View, CA USA

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年

关键词：

D O I：

10.1109/CVPR52729.2023.02004

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Despite the great success in 2D editing using user-friendly tools, such as Photoshop, semantic strokes, or even text prompts, similar capabilities in 3D areas are still limited, either relying on 3D modeling skills or allowing editing within only a few categories. In this paper, we present a novel semantic-driven NeRF editing approach, which enables users to edit a neural radiance field with a single image, and faithfully delivers edited novel views with high fidelity and multi-view consistency. To achieve this goal, we propose a prior-guided editing field to encode fine-grained geometric and texture editing in 3D space, and develop a series of techniques to aid the editing process, including cyclic constraints with a proxy mesh to facilitate geometric supervision, a color compositing mechanism to stabilize semantic-driven texture editing, and a feature-cluster-based regularization to preserve the irrelevant content unchanged. Extensive experiments and editing examples on both real-world and synthetic data demonstrate that our method achieves photo-realistic 3D editing using only a single edited image, pushing the bound of semantic-driven editing in 3D real-world scenes.

引用

页码：20919 / 20929

页数：11

共 40 条

[21] Prior-guided GAN-based interactive airplane engine damage image augmentation method
Huang, Rui
Duan, Bokun
Zhang, Yuxiang
Fan, Wei
CHINESE JOURNAL OF AERONAUTICS, 2022, 35 (10) : 222 - 232
[22] Interactive Scene Flow Editing for Improved Image-based Rendering and Virtual Spacetime Navigation
Ruhl, Kai
Eisemann, Martin
Hilsmann, Anna
Eisert, Peter
Magnor, Marcus
MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 631 - 640
[23] AdapEdit: Spatio-Temporal Guided Adaptive Editing Algorithm for Text-Based Continuity-Sensitive Image Editing
Ma, Zhiyuan
Jia, Guoli
Zhou, Bowen
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 5, 2024, : 4154 - 4161
[24] Spcolor: Semantic prior guided exemplar-based image colorization
Chen, Siqi
Zhang, Xianlin
Wang, Mingdao
Li, Xueming
Zhang, Yu
Zhang, Yue
PATTERN RECOGNITION, 2025, 159
[25] PatchNet: A Patch-based Image Representation for Interactive Library-driven Image Editing
Hu, Shi-Min
Zhang, Fang-Lue
Wang, Miao
Martin, Ralph R.
Wang, Jue
ACM TRANSACTIONS ON GRAPHICS, 2013, 32 (06):
[26] WAVELET-GUIDED ACCELERATION OF TEXT INVERSION IN DIFFUSION-BASED IMAGE EDITING
Koo, Gwanhyeong
Yoon, Sunjae
Yoo, Chang D.
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 4380 - 4384
[27] Single image-based 3D scene estimation from semantic prior
Hwang, Hyeong Jae
Yoon, Sang Min
ELECTRONICS LETTERS, 2015, 51 (22) : 1788 - +
[28] Text-Guided Multi-region Scene Image Editing Based on Diffusion Model
Li, Ruichen
Wu, Lei
Wang, Changshuo
Dong, Pei
Li, Xin
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XI, ICIC 2024, 2024, 14872 : 229 - 240
[29] Text-Guided Image Editing Based on Post Score for Gaining Attention on Social Media
Watanabe, Yuto
Togo, Ren
Maeda, Keisuke
Ogawa, Takahiro
Haseyama, Miki
SENSORS, 2024, 24 (03)
[30] Optimal transport-based unsupervised semantic disentanglement: A novel approach for efficient image editing in GANs
Liu, Yunqi
Ouyang, Xue
Jiang, Tian
Ding, Hongwei
Cui, Xiaohui
DISPLAYS, 2023, 80

← 1 2 3 4 →