Comparison of phoneme and viseme based acoustic units for speech driven realistic lip animation

被引:0
|
作者
Bozkurt, Elif [1 ]
Erdem, Cigdem Eroglu [1 ]
Erzin, Engin [2 ]
Erdem, T. [1 ]
Oezkan, Mehmet [1 ]
机构
[1] TUBITAK MAM TEKSEB, A-205, Gebze, Kocaeli, Turkey
[2] Koc Univ, Dept Elect & Elect Engn, Istanbul, Turkey
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Natural looking lip animation, synchronized with incoming speech, is essential for realistic character animation. In this work, we evaluate the performance of phone and viseme based acoustic units, with and without context information, for generating realistic lip synchronization using HMM based recognition systems. We conclude via objective evaluations that utilization of viseme based units with context information outperforms the other methods.
引用
收藏
页码:422 / +
页数:2
相关论文
共 50 条
  • [31] EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System
    Li, Hao
    Kang, Yongguo
    Wang, Zhenyu
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3077 - 3081
  • [32] Developing phoneme-based lip-reading sentences system for silent speech recognition
    El-Bialy, Randa
    Chen, Daqing
    Fenghour, Souheil
    Hussein, Walid
    Xiao, Perry
    Karam, Omar H.
    Li, Bo
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023, 8 (01) : 129 - 138
  • [33] Dynamic mapping method based speech driven face animation system
    Yin, PR
    Tao, JH
    AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2005, 3784 : 755 - 763
  • [34] Comparison of Grapheme and Phoneme Based Acoustic Modeling in LVCSR Task in Slovak
    Mirilovic, Michal
    Juhar, Jozef
    Cizmar, Anton
    MULTIMODAL SIGNAL: COGNITIVE AND ALGORITHMIC ISSUES, 2009, 5398 : 242 - 247
  • [35] Speech Data Clustering Based on Phoneme Error Trend for Unsupervised Acoustic Model Adaptation
    Asami, Taichi
    Kobashikawa, Satoshi
    Masataki, Hirokazu
    Yoshioka, Osamu
    Takahashi, Satoshi
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1758 - 1761
  • [36] A comparison of grapheme and phoneme-based units for Spanish spoken term detection
    Tejedor, Javier
    Wang, Dong
    Frankel, Joe
    King, Simon
    Coias, Jose
    SPEECH COMMUNICATION, 2008, 50 (11-12) : 980 - 991
  • [37] HMM BASED SPEECH-DRIVEN 3D TONGUE ANIMATION
    Luo, Changwei
    Yu, Jun
    Li, Xian
    Zhang, Leilei
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 4377 - 4381
  • [38] Phoneme Set Design Based on Integrated Acoustic and Linguistic Features for Second Language Speech Recognition
    Wang, Xiaoyun
    Kato, Tsuneo
    Yamamoto, Seiichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (04): : 857 - 864
  • [39] Simple Acoustic Features Can Explain Phoneme-Based Predictions of Cortical Responses to Speech
    Daube, Christoph
    Ince, Robin A. A.
    Gross, Joachim
    CURRENT BIOLOGY, 2019, 29 (12) : 1924 - +
  • [40] Phoneme class based feature adaptation for mismatch acoustic modeling and recognition of distant noisy speech
    Uluskan S.
    Sangwan A.
    Hansen J.H.L.
    International Journal of Speech Technology, 2017, 20 (4) : 799 - 811