Text-Guided Synthesis of Eulerian Cinemagraphs

被引:0
|
作者
Mahapatra, Aniruddha [1 ]
Siarohin, Aliaksandr [2 ]
Lee, Hsin-Ying [2 ]
Tulyakov, Sergey [2 ]
Zhu, Jun-Yan [1 ]
机构
[1] Carnegie Mellon Univ, 5000 Forbes Ave, Pittsburgh, PA 15213 USA
[2] Snap Inc, 2850 Ocean Pk Blvd, Santa Monica, CA 90405 USA
来源
ACM TRANSACTIONS ON GRAPHICS | 2023年 / 42卷 / 06期
关键词
Cinemagraphs; Diffusion Models; Generative Adversarial Networks; IMAGE;
D O I
10.1145/3618326
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We introduce Text2Cinemagraph, a fully automated method for creating cinemagraphs from text descriptions - an especially challenging task when prompts feature imaginary elements and artistic styles, given the complexity of interpreting the semantics and motions of these images. We focus on cinemagraphs of fluid elements, such as flowing rivers, and drifting clouds, which exhibit continuous motion and repetitive textures. Existing singleimage animation methods fall short on artistic inputs, and recent text-based video methods frequently introduce temporal inconsistencies, struggling to keep certain regions static. To address these challenges, we propose an idea of synthesizing image twins from a single text prompt - a pair of an artistic image and its pixel-aligned corresponding natural-looking twin. While the artistic image depicts the style and appearance detailed in our text prompt, the realistic counterpart greatly simplifies layout and motion analysis. Leveraging existing natural image and video datasets, we can accurately segment the realistic image and predict plausible motion given the semantic information. The predicted motion can then be transferred to the artistic image to create the final cinemagraph. Our method outperforms existing approaches in creating cinemagraphs for natural landscapes as well as artistic and other-worldly scenes, as validated by automated metrics and user studies. Finally, we demonstrate two extensions: animating existing paintings and controlling motion directions using text.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Text-Guided Synthesis of Crowd Animation
    Ji, Xuebo
    Pan, Zherong
    Gao, Xifeng
    Pan, Jia
    PROCEEDINGS OF SIGGRAPH 2024 CONFERENCE PAPERS, 2024,
  • [2] Text-Guided Customizable Image Synthesis and Manipulation
    Zhang, Zhiqiang
    Fu, Chen
    Weng, Wei
    Zhou, Jinjia
    APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [3] Text-Guided Synthesis of Masked Face Images
    Anjali, T.
    Masilamani, V.
    ACM Transactions on Multimedia Computing, Communications and Applications, 2024, 21 (01)
  • [4] Text-Guided Sketch-to-Photo Image Synthesis
    Osahor, Uche
    Nasrabadi, Nasser M.
    IEEE ACCESS, 2022, 10 : 98278 - 98289
  • [5] Text-Guided Image Inpainting
    Zhang, Zijian
    Zhao, Zhou
    Zhang, Zhu
    Huai, Baoxing
    Yuan, Jing
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4079 - 4087
  • [6] IconShop: Text-Guided Vector Icon Synthesis with Autoregressive Transformers
    Wu, Ronghuan
    Su, Wanchao
    Ma, Kede
    Liao, Jing
    ACM TRANSACTIONS ON GRAPHICS, 2023, 42 (06):
  • [7] Benchmarking Robustness to Text-Guided Corruptions
    Mofayezi, Mohammadreza
    Medghalchi, Yasamin
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2023, : 779 - 786
  • [8] Text-Guided Vector Graphics Customization
    Zhang, Peiying
    Zhao, Nanxuan
    Liao, Jing
    PROCEEDINGS OF THE SIGGRAPH ASIA 2023 CONFERENCE PAPERS, 2023,
  • [9] Text-Guided Automated Self Assessment
    Pirnay-Dummer, Pablo
    Ifenthaler, Dirk
    MULTIPLE PERSPECTIVES ON PROBLEM SOLVING AND LEARNING IN THE DIGITAL AGE, 2011, : 217 - 225
  • [10] Topology optimization with text-guided stylization
    Shengze Zhong
    Parinya Punpongsanon
    Daisuke Iwai
    Kosuke Sato
    Structural and Multidisciplinary Optimization, 2023, 66