DreamMotion: Space-Time Self-similar Score Distillation for Zero-Shot Video Editing

被引：0

作者：

Jeong, Hyeonho ^{[1
]}

Chang, Jinho ^{[1
]}

Park, Geon Yeong ^{[1
]}

Ye, Jong Chul ^{[1
,2
]}

机构：

[1] Korea Adv Inst Sci & Technol, Kim Jaechul Grad Sch AI, Daejeon, South Korea

[2] Korea Adv Inst Sci & Technol, Dept Bio & Brain Engn, Daejeon, South Korea

来源：

COMPUTER VISION - ECCV 2024, PT XXX | 2025年 / 15088卷

基金：

新加坡国家研究基金会;

关键词：

Video Editing; Diffusion Models; Score Distillation;

D O I：

10.1007/978-3-031-73404-5_21

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Text-driven diffusion-based video editing presents a unique challenge not encountered in image editing literature: establishing real-world motion. Unlike existing video editing approaches, here we focus on score distillation sampling to circumvent the standard reverse diffusion process and initiate optimization from videos that already exhibit natural motion. Our analysis reveals that while video score distillation can effectively introduce new content indicated by target text, it can also cause significant structure and motion deviation. To counteract this, we propose to match the space-time self-similarities of the original video and the edited video during the score distillation. Thanks to the use of score distillation, our approach is model-agnostic, which can be applied for both cascaded and non-cascaded video diffusion frameworks. Through extensive comparisons with leading methods, our approach demonstrates its superiority in altering appearances while accurately preserving the original structure and motion.

引用

页码：358 / 376

页数：19

共 38 条

[11] Collapsing perfect fluid in self-similar five dimensional space-time and cosmic censorship
Ghosh, SG
Sarwe, SB
Saraykar, RV
PHYSICAL REVIEW D, 2002, 66 (08):
[12] Self-similar stochastic models with stationary increments for symmetric space-time fractional diffusion
Pagnini, Gianni
2014 IEEE/ASME 10TH INTERNATIONAL CONFERENCE ON MECHATRONIC AND EMBEDDED SYSTEMS AND APPLICATIONS (MESA 2014), 2014,
[13] Space-Time Distillation for Video Super-Resolution
Xiao, Zeyu
Fu, Xueyang
Huang, Jie
Cheng, Zhen
Xiong, Zhiwei
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2113 - 2122
[14] A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video Editing
Li, Maomao
Li, Yu
Yang, Tianyu
Liu, Yunfei
Yue, Dongxu
Lin, Zhihui
Xu, Dong
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 7528 - 7537
[15] INFUSION: Inject and Attention Fusion for Multi Concept Zero-Shot Text-based Video Editing
Khandelwal, Anant
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3009 - 3018
[16] Data-Efficient Language-Supervised Zero-Shot Learning with Self-Distillation
Cheng, Ruizhe
Wu, Bichen
Zhang, Peizhao
Vajda, Peter
Gonzalez, Joseph E.
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3113 - 3118
[17] Zero-shot test time adaptation via knowledge distillation for personalized speech denoising and dereverberation
Kim, Sunwoo
Athi, Mrudula
Shi, Guangji
Kim, Minje
Kristjansson, Trausti
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2024, 155 (02): : 1353 - 1367
[18] Dynamics of nested, self-similar winnerless competition in time and space
Voit, Maximilian
Meyer-Ortmanns, Hildegard
PHYSICAL REVIEW RESEARCH, 2019, 1 (02):
[19] SKZC: self-distillation and k-nearest neighbor-based zero-shot classification
Sun, Muyang
Jia, Haitao
Journal of Engineering and Applied Science, 2024, 71 (01):
[20] Self-similar and oscillating solutions of Einstein's equation and other relevant consequences of a stochastic self-similar and fractal Universe via El Naschie's ε(∞) Cantorian space-time
Iovane, G
CHAOS SOLITONS & FRACTALS, 2005, 23 (02) : 351 - 360

← 1 2 3 4 →