SPEECH-DRIVEN FACIAL ANIMATION USING POLYNOMIAL FUSION OF FEATURES

被引:0
|
作者
Kefalas, Triantafyllos [1 ]
Vougioukas, Konstantinos [1 ]
Panagakis, Yannis [2 ]
Petridis, Stavros [1 ,3 ]
Kossaifi, Jean [1 ,3 ]
Pantic, Maja [1 ,3 ]
机构
[1] Imperial Coll London, Dept Comp, London, England
[2] Univ Athens, Dept Informat & Telecommun, Athens, Greece
[3] Samsung AI Ctr, Cambridge, England
基金
英国工程与自然科学研究理事会;
关键词
multiview learning; tensor factorization; deep learning; GAN; audiovisual learning;
D O I
10.1109/icassp40776.2020.9054469
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech-driven facial animation involves using a speech signal to generate realistic videos of talking faces. Recent deep learning approaches to facial synthesis rely on extracting low-dimensional representations and concatenating them, followed by a decoding step of the concatenated vector. This accounts for only first-order interactions of the features and ignores higher-order interactions. In this paper we propose a polynomial fusion layer that models the joint representation of the encodings by a higher-order polynomial, with the parameters modelled by a tensor decomposition. We demonstrate the suitability of this approach through experiments on generated videos evaluated on a range of metrics on video quality, audiovisual synchronisation and generation of blinks.
引用
收藏
页码:3487 / 3491
页数:5
相关论文
共 50 条
  • [1] Expressive speech-driven facial animation
    Cao, Y
    Tien, WC
    Faloutsos, P
    Pighin, F
    ACM TRANSACTIONS ON GRAPHICS, 2005, 24 (04): : 1283 - 1302
  • [2] Speech-driven facial animation using a hierarchical model
    Cosker, DP
    Marshall, AD
    Rosin, PL
    Hicks, YA
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2004, 151 (04): : 314 - 321
  • [3] Speech-driven facial animation with realistic dynamics
    Gutierrez-Osuna, R
    Kakumanu, PK
    Esposito, A
    Garcia, ON
    Bojorquez, A
    Castillo, JL
    Rudomin, I
    IEEE TRANSACTIONS ON MULTIMEDIA, 2005, 7 (01) : 33 - 42
  • [4] Realistic Speech-Driven Facial Animation with GANs
    Konstantinos Vougioukas
    Stavros Petridis
    Maja Pantic
    International Journal of Computer Vision, 2020, 128 : 1398 - 1413
  • [5] Speech-Driven Facial Animation Using Manifold Relevance Determination
    Dawood, Samia
    Hicks, Yulia
    Marshall, David
    COMPUTER VISION - ECCV 2016 WORKSHOPS, PT II, 2016, 9914 : 869 - 882
  • [6] Realistic Speech-Driven Facial Animation with GANs
    Vougioukas, Konstantinos
    Petridis, Stavros
    Pantic, Maja
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (05) : 1398 - 1413
  • [7] REALTIME SPEECH-DRIVEN FACIAL ANIMATION USING GAUSSIAN MIXTURE MODELS
    Luo, Changwei
    Yu, Jun
    Li, Xian
    Wang, Zengfu
    2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2014,
  • [8] A comparison of acoustic coding models for speech-driven facial animation
    Kakumanu, Praveen
    Esposito, Anna
    Garcia, Oscar N.
    Gutierrez-Osuna, Ricardo
    SPEECH COMMUNICATION, 2006, 48 (06) : 598 - 615
  • [9] Towards Realistic Real Time Speech-Driven Facial Animation
    Cerekovic, Aleksandra
    Zoric, Goranka
    Smid, Karlo
    Pandzic, Igor S.
    INTELLIGENT VIRTUAL AGENTS, PROCEEDINGS, 2008, 5208 : 476 - 478
  • [10] Speech-driven facial animation with spectral gathering and temporal attention
    Yujin Chai
    Yanlin Weng
    Lvdi Wang
    Kun Zhou
    Frontiers of Computer Science, 2022, 16