Speech driven MPEG-4 based face animation via neural network

被引:0
|
作者
Chen, YQ [1 ]
Gao, W
Wang, ZQ
Zuo, L
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing 100080, Peoples R China
[2] Harbin Inst Technol, Dept Comp Sci, Harbin 150001, Peoples R China
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, some clustering and machine teaming methods are combined together to learn the correspondence between speech acoustic and MPEG-4 based face animation parameters. The features of audio and image sequences can be extracted from the large recorded audio-visual database. The face animation parameter (FAP) sequences can be computed and then clustered to FAP patterns. An artificial neural network (ANN) was trained to map the linear predictive coefficients (LPC) and some prosodic features of an individual's natural speech to FAP patterns. The performance of our system shows that the proposed teaming algorithm is suitable, which can greatly improve the realism of real time face animation during speech.
引用
收藏
页码:1108 / 1113
页数:6
相关论文
共 50 条
  • [41] Text2Video: Text-driven facial animation using MPEG-4
    Rurainsky, J
    Eisert, P
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2005, PTS 1-4, 2005, 5960 : 492 - 500
  • [42] MPEG-4 facial animation technology: Survey, implementation, and results
    Abrantes, GA
    Pereira, F
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1999, 9 (02) : 290 - 305
  • [43] Learning and synthesizing MPEG-4 compatible 3-D face animation from video sequence
    Gao, W
    Chen, YQ
    Wang, R
    Shan, SG
    Jiang, DL
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2003, 13 (11) : 1119 - 1128
  • [44] Highly realistic MPEG-4 compliant facial animation with charisma
    School of Computing and Mathematical Sciences, Liverpool John Moores University, Byrom Street, L3 3AF, Liverpool, United Kingdom
    Proc Int Conf Comput Commun Networks ICCCN,
  • [45] Optimized MPEG-4 animation encoder for motion capture data
    Preda, Marius
    Jovanova, Blagica
    Arsov, Ivica
    Preteux, Francoise
    WEB3D 2007 - 12TH INTERNATIONAL CONFERENCE ON 3D WEB TECHNOLOGY, PROCEEDINGS, 2007, : 181 - 190
  • [46] Evaluation of neural network architectures for MPEG-4 video traffic prediction
    Abdennour, Adel
    IEEE TRANSACTIONS ON BROADCASTING, 2006, 52 (02) : 184 - 192
  • [47] Highly Realistic MPEG-4 Compliant Facial Animation with Charisma
    El Rhalibi, Abdennour
    Carter, Chris
    Cooper, Simon
    Merabti, Madjid
    2011 20TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS (ICCCN), 2011,
  • [48] Virtual character within MPEG-4 Animation Framework eXtension
    Preda, M
    Preteux, F
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2004, 14 (07) : 975 - 988
  • [49] Inner lip feature extraction for MPEG-4 facial animation
    Wu, ZL
    Aleksic, PS
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 633 - 636
  • [50] Survey and Evaluation of MPEG-4 Based 3D Character Animation Frameworks
    Duarte, Ricardo L. Parreira
    El Rhalibi, Abdennour
    Carter, Christopher
    Merabti, Madjid
    2013 5TH INTERNATIONAL CONFERENCE ON GAMES AND VIRTUAL WORLDS FOR SERIOUS APPLICATIONS (VS-GAMES), 2013,