Comparison of phoneme and viseme based acoustic units for speech driven realistic lip animation

被引：0

作者：

Bozkurt, Elif ^{[1
]}

Erdem, Cigdem Eroglu ^{[1
]}

Erzin, Engin ^{[2
]}

Erdem, T. ^{[1
]}

Oezkan, Mehmet ^{[1
]}

机构：

[1] TUBITAK MAM TEKSEB, A-205, Gebze, Kocaeli, Turkey

[2] Koc Univ, Dept Elect & Elect Engn, Istanbul, Turkey

来源：

2007 IEEE 15TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS, VOLS 1-3 | 2007年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Natural looking lip animation, synchronized with incoming speech, is essential for realistic character animation. In this work, we evaluate the performance of phone and viseme based acoustic units, with and without context information, for generating realistic lip synchronization using HMM based recognition systems. We conclude via objective evaluations that utilization of viseme based units with context information outperforms the other methods.

引用

页码：422 / +

页数：2

共 50 条

[31] EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System
Li, Hao
Kang, Yongguo
Wang, Zhenyu
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3077 - 3081
[32] Developing phoneme-based lip-reading sentences system for silent speech recognition
El-Bialy, Randa
Chen, Daqing
Fenghour, Souheil
Hussein, Walid
Xiao, Perry
Karam, Omar H.
Li, Bo
CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023, 8 (01) : 129 - 138
[33] Dynamic mapping method based speech driven face animation system
Yin, PR
Tao, JH
AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2005, 3784 : 755 - 763
[34] Comparison of Grapheme and Phoneme Based Acoustic Modeling in LVCSR Task in Slovak
Mirilovic, Michal
Juhar, Jozef
Cizmar, Anton
MULTIMODAL SIGNAL: COGNITIVE AND ALGORITHMIC ISSUES, 2009, 5398 : 242 - 247
[35] Speech Data Clustering Based on Phoneme Error Trend for Unsupervised Acoustic Model Adaptation
Asami, Taichi
Kobashikawa, Satoshi
Masataki, Hirokazu
Yoshioka, Osamu
Takahashi, Satoshi
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1758 - 1761
[36] A comparison of grapheme and phoneme-based units for Spanish spoken term detection
Tejedor, Javier
Wang, Dong
Frankel, Joe
King, Simon
Coias, Jose
SPEECH COMMUNICATION, 2008, 50 (11-12) : 980 - 991
[37] HMM BASED SPEECH-DRIVEN 3D TONGUE ANIMATION
Luo, Changwei
Yu, Jun
Li, Xian
Zhang, Leilei
2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 4377 - 4381
[38] Phoneme Set Design Based on Integrated Acoustic and Linguistic Features for Second Language Speech Recognition
Wang, Xiaoyun
Kato, Tsuneo
Yamamoto, Seiichi
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (04): : 857 - 864
[39] Simple Acoustic Features Can Explain Phoneme-Based Predictions of Cortical Responses to Speech
Daube, Christoph
Ince, Robin A. A.
Gross, Joachim
CURRENT BIOLOGY, 2019, 29 (12) : 1924 - +
[40] Phoneme class based feature adaptation for mismatch acoustic modeling and recognition of distant noisy speech
Uluskan S.
Sangwan A.
Hansen J.H.L.
International Journal of Speech Technology, 2017, 20 (4) : 799 - 811

← 1 2 3 4 5 →