SPEECH-DRIVEN FACIAL ANIMATION USING POLYNOMIAL FUSION OF FEATURES

被引：0

作者：

Kefalas, Triantafyllos ^{[1
]}

Vougioukas, Konstantinos ^{[1
]}

Panagakis, Yannis ^{[2
]}

Petridis, Stavros ^{[1
,3
]}

Kossaifi, Jean ^{[1
,3
]}

Pantic, Maja ^{[1
,3
]}

机构：

[1] Imperial Coll London, Dept Comp, London, England

[2] Univ Athens, Dept Informat & Telecommun, Athens, Greece

[3] Samsung AI Ctr, Cambridge, England

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2020年

基金：

英国工程与自然科学研究理事会;

关键词：

multiview learning; tensor factorization; deep learning; GAN; audiovisual learning;

D O I：

10.1109/icassp40776.2020.9054469

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Speech-driven facial animation involves using a speech signal to generate realistic videos of talking faces. Recent deep learning approaches to facial synthesis rely on extracting low-dimensional representations and concatenating them, followed by a decoding step of the concatenated vector. This accounts for only first-order interactions of the features and ignores higher-order interactions. In this paper we propose a polynomial fusion layer that models the joint representation of the encodings by a higher-order polynomial, with the parameters modelled by a tensor decomposition. We demonstrate the suitability of this approach through experiments on generated videos evaluated on a range of metrics on video quality, audiovisual synchronisation and generation of blinks.

引用

页码：3487 / 3491

页数：5

共 50 条

[1] Expressive speech-driven facial animation
Cao, Y
Tien, WC
Faloutsos, P
Pighin, F
ACM TRANSACTIONS ON GRAPHICS, 2005, 24 (04): : 1283 - 1302
[2] Speech-driven facial animation using a hierarchical model
Cosker, DP
Marshall, AD
Rosin, PL
Hicks, YA
IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2004, 151 (04): : 314 - 321
[3] Speech-driven facial animation with realistic dynamics
Gutierrez-Osuna, R
Kakumanu, PK
Esposito, A
Garcia, ON
Bojorquez, A
Castillo, JL
Rudomin, I
IEEE TRANSACTIONS ON MULTIMEDIA, 2005, 7 (01) : 33 - 42
[4] Realistic Speech-Driven Facial Animation with GANs
Konstantinos Vougioukas
Stavros Petridis
Maja Pantic
International Journal of Computer Vision, 2020, 128 : 1398 - 1413
[5] Speech-Driven Facial Animation Using Manifold Relevance Determination
Dawood, Samia
Hicks, Yulia
Marshall, David
COMPUTER VISION - ECCV 2016 WORKSHOPS, PT II, 2016, 9914 : 869 - 882
[6] Realistic Speech-Driven Facial Animation with GANs
Vougioukas, Konstantinos
Petridis, Stavros
Pantic, Maja
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (05) : 1398 - 1413
[7] REALTIME SPEECH-DRIVEN FACIAL ANIMATION USING GAUSSIAN MIXTURE MODELS
Luo, Changwei
Yu, Jun
Li, Xian
Wang, Zengfu
2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2014,
[8] A comparison of acoustic coding models for speech-driven facial animation
Kakumanu, Praveen
Esposito, Anna
Garcia, Oscar N.
Gutierrez-Osuna, Ricardo
SPEECH COMMUNICATION, 2006, 48 (06) : 598 - 615
[9] Towards Realistic Real Time Speech-Driven Facial Animation
Cerekovic, Aleksandra
Zoric, Goranka
Smid, Karlo
Pandzic, Igor S.
INTELLIGENT VIRTUAL AGENTS, PROCEEDINGS, 2008, 5208 : 476 - 478
[10] Speech-driven facial animation with spectral gathering and temporal attention
Yujin Chai
Yanlin Weng
Lvdi Wang
Kun Zhou
Frontiers of Computer Science, 2022, 16

← 1 2 3 4 5 →