DreamMotion: Space-Time Self-similar Score Distillation for Zero-Shot Video Editing

被引:0
|
作者
Jeong, Hyeonho [1 ]
Chang, Jinho [1 ]
Park, Geon Yeong [1 ]
Ye, Jong Chul [1 ,2 ]
机构
[1] Korea Adv Inst Sci & Technol, Kim Jaechul Grad Sch AI, Daejeon, South Korea
[2] Korea Adv Inst Sci & Technol, Dept Bio & Brain Engn, Daejeon, South Korea
来源
基金
新加坡国家研究基金会;
关键词
Video Editing; Diffusion Models; Score Distillation;
D O I
10.1007/978-3-031-73404-5_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-driven diffusion-based video editing presents a unique challenge not encountered in image editing literature: establishing real-world motion. Unlike existing video editing approaches, here we focus on score distillation sampling to circumvent the standard reverse diffusion process and initiate optimization from videos that already exhibit natural motion. Our analysis reveals that while video score distillation can effectively introduce new content indicated by target text, it can also cause significant structure and motion deviation. To counteract this, we propose to match the space-time self-similarities of the original video and the edited video during the score distillation. Thanks to the use of score distillation, our approach is model-agnostic, which can be applied for both cascaded and non-cascaded video diffusion frameworks. Through extensive comparisons with leading methods, our approach demonstrates its superiority in altering appearances while accurately preserving the original structure and motion.
引用
收藏
页码:358 / 376
页数:19
相关论文
共 38 条
  • [21] Cross-modal Self-distillation for Zero-shot Sketch-based Image Retrieval
    Tian J.-L.
    Xu X.
    Shen F.-M.
    Shen H.-T.
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (09):
  • [22] En-Compactness: Self-Distillation Embedding & Contrastive Generation for Generalized Zero-Shot Learning
    Kong, Xia
    Gao, Zuodong
    Li, Xiaofan
    Hong, Ming
    Liu, Jun
    Wang, Chengjie
    Xie, Yuan
    Qu, Yanyun
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9296 - 9305
  • [23] TEST-TIME ADAPTATION TOWARD PERSONALIZED SPEECH ENHANCEMENT: ZERO-SHOT LEARNING WITH KNOWLEDGE DISTILLATION
    Kim, Sunwoo
    Kim, Minje
    2021 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2021, : 176 - 180
  • [24] Depth-aware Test-Time Training for Zero-shot Video Object Segmentation
    Li, Weihuang
    Shen, Xi
    Li, Haolun
    Bi, Xiuli
    Liu, Bo
    Pun, Chi-Man
    Cun, Xiaodong
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 19218 - 19227
  • [25] On self-similar solutions of time and space fractional sub-diffusion equations
    Al-Musalhi, Fatma
    Karimov, Erkinjon
    INTERNATIONAL JOURNAL OF OPTIMIZATION AND CONTROL-THEORIES & APPLICATIONS-IJOCTA, 2021, 11 (03): : 16 - 27
  • [26] Self-similar Rayleigh-Taylor mixing with accelerations varying in time and space
    Abarzhi, Snezhana I.
    Sreenivasan, Katepalli R.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2022, 119 (47)
  • [27] Leveraging Self-Distillation and Disentanglement Network to Enhance Visual-Semantic Feature Consistency in Generalized Zero-Shot Learning
    Liu, Xiaoming
    Wang, Chen
    Yang, Guan
    Wang, Chunhua
    Long, Yang
    Liu, Jie
    Zhang, Zhiyuan
    ELECTRONICS, 2024, 13 (10)
  • [28] Discrete-time self-similar systems and stable distributions: Applications to VBR video modeling
    Narasimha, R
    Rao, RM
    IEEE SIGNAL PROCESSING LETTERS, 2003, 10 (03) : 65 - 68
  • [29] Zero-Shot Self-Supervised Joint Temporal Image and Sensitivity Map Reconstruction via Linear Latent Space
    Zhang, Molin
    Xu, Junshen
    Arefeen, Yamin
    Adalsteinsson, Elfar
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 : 1713 - 1725
  • [30] Self-Supervised Video Representation Learning with Space-Time Cubic Puzzles
    Kim, Dahun
    Cho, Donghyeon
    Kweon, In So
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8545 - 8552