SINE: Semantic-driven Image-based NeRF Editing with Prior-guided Editing Field

被引:31
|
作者
Bao, Chong [1 ]
Zhang, Yinda [2 ]
Yang, Bangbang [1 ]
Fan, Tianxing [1 ]
Yang, Zesong [1 ]
Bao, Hujun [1 ]
Zhang, Guofeng [1 ]
Cui, Zhaopeng [1 ]
机构
[1] Zhejiang Univ, State Key Lab CAD & CG, Hangzhou, Zhejiang, Peoples R China
[2] Google, Mountain View, CA USA
关键词
D O I
10.1109/CVPR52729.2023.02004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the great success in 2D editing using user-friendly tools, such as Photoshop, semantic strokes, or even text prompts, similar capabilities in 3D areas are still limited, either relying on 3D modeling skills or allowing editing within only a few categories. In this paper, we present a novel semantic-driven NeRF editing approach, which enables users to edit a neural radiance field with a single image, and faithfully delivers edited novel views with high fidelity and multi-view consistency. To achieve this goal, we propose a prior-guided editing field to encode fine-grained geometric and texture editing in 3D space, and develop a series of techniques to aid the editing process, including cyclic constraints with a proxy mesh to facilitate geometric supervision, a color compositing mechanism to stabilize semantic-driven texture editing, and a feature-cluster-based regularization to preserve the irrelevant content unchanged. Extensive experiments and editing examples on both real-world and synthetic data demonstrate that our method achieves photo-realistic 3D editing using only a single edited image, pushing the bound of semantic-driven editing in 3D real-world scenes.
引用
收藏
页码:20919 / 20929
页数:11
相关论文
共 40 条
  • [21] Prior-guided GAN-based interactive airplane engine damage image augmentation method
    Huang, Rui
    Duan, Bokun
    Zhang, Yuxiang
    Fan, Wei
    CHINESE JOURNAL OF AERONAUTICS, 2022, 35 (10) : 222 - 232
  • [22] Interactive Scene Flow Editing for Improved Image-based Rendering and Virtual Spacetime Navigation
    Ruhl, Kai
    Eisemann, Martin
    Hilsmann, Anna
    Eisert, Peter
    Magnor, Marcus
    MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 631 - 640
  • [23] AdapEdit: Spatio-Temporal Guided Adaptive Editing Algorithm for Text-Based Continuity-Sensitive Image Editing
    Ma, Zhiyuan
    Jia, Guoli
    Zhou, Bowen
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 5, 2024, : 4154 - 4161
  • [24] Spcolor: Semantic prior guided exemplar-based image colorization
    Chen, Siqi
    Zhang, Xianlin
    Wang, Mingdao
    Li, Xueming
    Zhang, Yu
    Zhang, Yue
    PATTERN RECOGNITION, 2025, 159
  • [25] PatchNet: A Patch-based Image Representation for Interactive Library-driven Image Editing
    Hu, Shi-Min
    Zhang, Fang-Lue
    Wang, Miao
    Martin, Ralph R.
    Wang, Jue
    ACM TRANSACTIONS ON GRAPHICS, 2013, 32 (06):
  • [26] WAVELET-GUIDED ACCELERATION OF TEXT INVERSION IN DIFFUSION-BASED IMAGE EDITING
    Koo, Gwanhyeong
    Yoon, Sunjae
    Yoo, Chang D.
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 4380 - 4384
  • [27] Single image-based 3D scene estimation from semantic prior
    Hwang, Hyeong Jae
    Yoon, Sang Min
    ELECTRONICS LETTERS, 2015, 51 (22) : 1788 - +
  • [28] Text-Guided Multi-region Scene Image Editing Based on Diffusion Model
    Li, Ruichen
    Wu, Lei
    Wang, Changshuo
    Dong, Pei
    Li, Xin
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XI, ICIC 2024, 2024, 14872 : 229 - 240
  • [29] Text-Guided Image Editing Based on Post Score for Gaining Attention on Social Media
    Watanabe, Yuto
    Togo, Ren
    Maeda, Keisuke
    Ogawa, Takahiro
    Haseyama, Miki
    SENSORS, 2024, 24 (03)
  • [30] Optimal transport-based unsupervised semantic disentanglement: A novel approach for efficient image editing in GANs
    Liu, Yunqi
    Ouyang, Xue
    Jiang, Tian
    Ding, Hongwei
    Cui, Xiaohui
    DISPLAYS, 2023, 80