Speech driven MPEG-4 based face animation via neural network

被引：0

作者：

Chen, YQ ^{[1
]}

Gao, W

Wang, ZQ

Zuo, L

机构：

[1] Chinese Acad Sci, Inst Comp Technol, Beijing 100080, Peoples R China

[2] Harbin Inst Technol, Dept Comp Sci, Harbin 150001, Peoples R China

来源：

ADVANCES IN MUTLIMEDIA INFORMATION PROCESSING - PCM 2001, PROCEEDINGS | 2001年 / 2195卷

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, some clustering and machine teaming methods are combined together to learn the correspondence between speech acoustic and MPEG-4 based face animation parameters. The features of audio and image sequences can be extracted from the large recorded audio-visual database. The face animation parameter (FAP) sequences can be computed and then clustered to FAP patterns. An artificial neural network (ANN) was trained to map the linear predictive coefficients (LPC) and some prosodic features of an individual's natural speech to FAP patterns. The performance of our system shows that the proposed teaming algorithm is suitable, which can greatly improve the realism of real time face animation during speech.

引用

页码：1108 / 1113

页数：6

共 50 条

[41] Text2Video: Text-driven facial animation using MPEG-4
Rurainsky, J
Eisert, P
VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2005, PTS 1-4, 2005, 5960 : 492 - 500
[42] MPEG-4 facial animation technology: Survey, implementation, and results
Abrantes, GA
Pereira, F
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1999, 9 (02) : 290 - 305
[43] Learning and synthesizing MPEG-4 compatible 3-D face animation from video sequence
Gao, W
Chen, YQ
Wang, R
Shan, SG
Jiang, DL
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2003, 13 (11) : 1119 - 1128
[44] Highly realistic MPEG-4 compliant facial animation with charisma
School of Computing and Mathematical Sciences, Liverpool John Moores University, Byrom Street, L3 3AF, Liverpool, United Kingdom
Proc Int Conf Comput Commun Networks ICCCN,
[45] Optimized MPEG-4 animation encoder for motion capture data
Preda, Marius
Jovanova, Blagica
Arsov, Ivica
Preteux, Francoise
WEB3D 2007 - 12TH INTERNATIONAL CONFERENCE ON 3D WEB TECHNOLOGY, PROCEEDINGS, 2007, : 181 - 190
[46] Evaluation of neural network architectures for MPEG-4 video traffic prediction
Abdennour, Adel
IEEE TRANSACTIONS ON BROADCASTING, 2006, 52 (02) : 184 - 192
[47] Highly Realistic MPEG-4 Compliant Facial Animation with Charisma
El Rhalibi, Abdennour
Carter, Chris
Cooper, Simon
Merabti, Madjid
2011 20TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS (ICCCN), 2011,
[48] Virtual character within MPEG-4 Animation Framework eXtension
Preda, M
Preteux, F
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2004, 14 (07) : 975 - 988
[49] Inner lip feature extraction for MPEG-4 facial animation
Wu, ZL
Aleksic, PS
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 633 - 636
[50] Survey and Evaluation of MPEG-4 Based 3D Character Animation Frameworks
Duarte, Ricardo L. Parreira
El Rhalibi, Abdennour
Carter, Christopher
Merabti, Madjid
2013 5TH INTERNATIONAL CONFERENCE ON GAMES AND VIRTUAL WORLDS FOR SERIOUS APPLICATIONS (VS-GAMES), 2013,

← 1 2 3 4 5 →