LIA: Latent Image Animator

被引:0
|
作者
Wang, Yaohui [1 ]
Yang, Di [1 ]
Bremond, Francois [1 ]
Dantcheva, Antitza [1 ]
机构
[1] Univ Cote dAzur, Inria Ctr, 2004 Rte Lucioles, F-06902 Valbonne, France
关键词
Disentanglement; generative adversarial networks; image animation; interpretability; video generation;
D O I
10.1109/TPAMI.2024.3449075
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous animation techniques mainly focus on leveraging explicit structure representations (e.g., meshes or keypoints) for transferring motion from driving videos to source images. However, such methods are challenged with large appearance variations between source and driving data, as well as require complex additional modules to respectively model appearance and motion. Towards addressing these issues, we introduce the Latent Image Animator (LIA), streamlined to animate high-resolution images. LIA is designed as a simple autoencoder that does not rely on explicit representations. Motion transfer in the pixel space is modeled as linear navigation of motion codes in the latent space. Specifically such navigation is represented as an orthogonal motion dictionary learned in a self-supervised manner based on proposed Linear Motion Decomposition (LMD). Extensive experimental results demonstrate that LIA outperforms state-of-the-art on VoxCeleb, TaichiHD, and TED-talk datasets with respect to video quality and spatio-temporal consistency. In addition LIA is well equipped for zero-shot high-resolution image animation. Code, models, and demo video are available at https://github.com/wyhsirius/LIA.
引用
收藏
页码:10829 / 10844
页数:16
相关论文
共 50 条
  • [1] LEO: Generative Latent Image Animator for Human Video Synthesis
    Wang, Yaohui
    Ma, Xin
    Chen, Xinyuan
    Chen, Cunjian
    Dantcheva, Antitza
    Dai, Bo
    Qiao, Yu
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025, 133 (03) : 1277 - 1289
  • [2] THE ANIMATOR AS AN ARTIST, THE ARTIST AS AN ANIMATOR. A RECAP
    Cholodenko, Alan
    CON A DE ANIMACION, 2020, (10): : 10 - +
  • [3] BEHIND THE VISIBLE IMAGE ON THE SCREEN, A INTERVIEW WITH NORSTEIN,YOURI + FILM ANIMATOR
    LAMPOLSKI, M
    NORSTEIN, Y
    POSITIF, 1985, (297): : 48 - 50
  • [4] LATENT IMAGE
    YAMAMOTO, VY
    JOURNAL OF THE BIOLOGICAL PHOTOGRAPHIC ASSOCIATION, 1967, 35 (01): : 1 - &
  • [5] LATENT IMAGE
    YAMAMOTO, VY
    JOURNAL OF THE BIOLOGICAL PHOTOGRAPHIC ASSOCIATION, 1967, 35 (04): : 145 - &
  • [6] LATENT IMAGE
    YAMAMOTO, VY
    JOURNAL OF THE BIOLOGICAL PHOTOGRAPHIC ASSOCIATION, 1967, 35 (03): : 93 - &
  • [7] LATENT IMAGE
    YAMAMOTO, VY
    JOURNAL OF THE BIOLOGICAL PHOTOGRAPHIC ASSOCIATION, 1968, 36 (01): : 1 - &
  • [8] LATENT IMAGE
    YAMAMOTO, VY
    JOURNAL OF THE BIOLOGICAL PHOTOGRAPHIC ASSOCIATION, 1967, 35 (02): : 49 - &
  • [9] Latent Image
    Ladouceur, Ben
    FIDDLEHEAD, 2016, (268): : 70 - 70
  • [10] Latent Image
    Targan, Barry
    NORTH AMERICAN REVIEW, 2011, 296 (04): : 11 - 16