DreamMotion: Space-Time Self-similar Score Distillation for Zero-Shot Video Editing

被引:0
|
作者
Jeong, Hyeonho [1 ]
Chang, Jinho [1 ]
Park, Geon Yeong [1 ]
Ye, Jong Chul [1 ,2 ]
机构
[1] Korea Adv Inst Sci & Technol, Kim Jaechul Grad Sch AI, Daejeon, South Korea
[2] Korea Adv Inst Sci & Technol, Dept Bio & Brain Engn, Daejeon, South Korea
来源
基金
新加坡国家研究基金会;
关键词
Video Editing; Diffusion Models; Score Distillation;
D O I
10.1007/978-3-031-73404-5_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-driven diffusion-based video editing presents a unique challenge not encountered in image editing literature: establishing real-world motion. Unlike existing video editing approaches, here we focus on score distillation sampling to circumvent the standard reverse diffusion process and initiate optimization from videos that already exhibit natural motion. Our analysis reveals that while video score distillation can effectively introduce new content indicated by target text, it can also cause significant structure and motion deviation. To counteract this, we propose to match the space-time self-similarities of the original video and the edited video during the score distillation. Thanks to the use of score distillation, our approach is model-agnostic, which can be applied for both cascaded and non-cascaded video diffusion frameworks. Through extensive comparisons with leading methods, our approach demonstrates its superiority in altering appearances while accurately preserving the original structure and motion.
引用
收藏
页码:358 / 376
页数:19
相关论文
共 38 条
  • [1] A NEW SELF-SIMILAR SPACE-TIME
    CHI, LK
    JOURNAL OF MATHEMATICAL PHYSICS, 1987, 28 (07) : 1539 - 1540
  • [2] VidToMe: Video Token Merging for Zero-Shot Video Editing
    Li, Xirui
    Ma, Chao
    Yang, Xiaokang
    Yang, Ming-Hsuan
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 7486 - 7495
  • [3] Space-time analogy of self-similar intense vortices
    Vatistas, GH
    Aboelkassem, Y
    AIAA JOURNAL, 2006, 44 (04) : 912 - 917
  • [4] Even perturbations of the self-similar Vaidya space-time
    Nolan, BC
    Waters, TJ
    PHYSICAL REVIEW D, 2005, 71 (10):
  • [5] Space-time analogy of self-similar intense vortices
    Vatistas, Georgios H.
    Aboelkassem, Yasser
    AIAA Journal, 2006, 44 (04): : 912 - 917
  • [6] Self-similar space-time evolution of an initial density discontinuity
    Rekaa, V. L.
    Pecseli, H. L.
    Trulsen, J. K.
    PHYSICS OF PLASMAS, 2013, 20 (07)
  • [7] Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer
    Yatim, Danah
    Fridman, Rafail
    Bar-Tal, Omer
    Kasten, Yoni
    Dekel, Tali
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 8466 - 8476
  • [8] FateZero: Fusing Attentions for Zero-shot Text-based Video Editing
    Qi, Chenyang
    Cun, Xiaodong
    Zhang, Yong
    Lei, Chenyang
    Wang, Xintao
    Shan, Ying
    Chen, Qifeng
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 15886 - 15896
  • [9] A Latent Space of Stochastic Diffusion Models for Zero-Shot Image Editing and Guidance
    Wu, Chen Henry
    De la Torre, Fernando
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7344 - 7353
  • [10] WAVE: Warping DDIM Inversion Features for Zero-Shot Text-to-Video Editing
    Feng, Yutang
    Gao, Sicheng
    Bao, Yuxiang
    Wang, Xiaodi
    Han, Shumin
    Zhang, Juan
    Zhang, Baochang
    Yao, Angela
    COMPUTER VISION - ECCV 2024, PT LXXVI, 2025, 15134 : 38 - 55