Speaker Dependent Real-Time Vowel Recognition Algorithm for Lip Sync in Digital Contents

被引：0

作者：

Hwang, Sun-Min ^{[1
]}

Song, Bok-Hee ^{[2
]}

Yun, Han-Hyung ^{[1
]}

机构：

[1] Korea Univ Tech & Edu, Sch CSE, Chonansi, South Korea

[2] Korea Univ Tech & Edu, Dept Ind Design Eng, Chonansi, South Korea

来源：

2013 INTERNATIONAL CONFERENCE ON IT CONVERGENCE AND SECURITY (ICITCS) | 2013年

关键词：

vowel recognition; speech analysis; lip synching; animation;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Previous results of researches related speech recognition, especially vowel recognitions, can be applied for synchronizing the mouth movement with a dialogue in digital contents such as animations and e-learning contents. Since the mouth movement of objects has to be synchronized with a dialogue exactly, lip-sync is one of tedious works for animators and a time consuming work. The mismatch or artificialness between mouth shape of characters and speaking reduces the immersion in the contents. This paper proposes a new technique to automatically perform lip synching for a computer generated character to match a real speech or a dialogue using real time vowel recognition with the formant analysis. The proposed algorithm should be one of speaker dependent speech recognition since our concern is only for a certain voice actor related a certain character. The result shows that the vowels are recognized from a voice actor' speech with real time and the average of the recognition ratio is 97.3%.

引用

页数：4

共 50 条

[1] Speaker Adaptive Real-Time Korean Single Vowel Recognition for an Animation Producing
Whang, Sun-Min
Song, Bok-Hee
Yun, Han-Kyung
FRONTIER AND INNOVATION IN FUTURE COMPUTING AND COMMUNICATIONS, 2014, 301 : 633 - 641
[2] A real-time lip sync system using a genetic algorithm for automatic neural network configuration
Zoric, G
Pandzic, IS
2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2, 2005, : 1367 - 1370
[3] Speaker pruning algorithm for real-time speaker identification
Kinnunen, T
Karpov, E
Fränti, P
AUDIO-BASED AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2003, 2688 : 639 - 646
[4] REAL-TIME SYNTHESIS OF OPTICAL LIP PATTERNS FROM VOWEL SOUNDS
ERBER, NP
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1977, 62 : S76 - S76
[5] REAL-TIME SYNTHESIS OF OPTICAL LIP SHAPES FROM VOWEL SOUNDS
ERBER, NP
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 66 (05): : 1542 - 1544
[6] Design and performance evaluation of real-time fingerprint recognition for digital contents protection
Kang, YK
Kwon, PJ
Kim, H
Proceedings of the IASTED International Conference on Artificial Intelligence and Applications, Vols 1and 2, 2004, : 683 - 687
[7] Real-time speaker-dependent syllable recognition system of complete vocabulary of Chinese
Chen, Tao
Li, Changli
Mo, Fuyuan
Shengxue Xuebao/Acta Acustica, 1993, 18 (03): : 161 - 171
[8] FRAMEWORK DEVELOPMENT OF REAL-TIME LIP SYNC ANIMATION ON VISEME BASED HUMAN SPEECH
Hoon, Loh Ngiik
Rahman, Khairul Aidil Azlin Abd.
Chai, Wang Yin
JURNAL TEKNOLOGI, 2015, 75 (04): : 43 - 48
[9] Seeing the Sound: Multilingual Lip Sync for Real-Time Face-to-Face Translation
Oskooei, Amirkia Rafiei
Aktas, Mehmet S.
Keles, Mustafa
COMPUTERS, 2025, 14 (01)
[10] A study on the real-time fingerprints recognition mechanism for digital contents protection for interaction on web
Kang, YK
Kim, MH
IASTED: PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON MODELLING AND SIMULATION, 2003, : 456 - 460

← 1 2 3 4 5 →