Speaker Dependent Real-Time Vowel Recognition Algorithm for Lip Sync in Digital Contents

被引:0
|
作者
Hwang, Sun-Min [1 ]
Song, Bok-Hee [2 ]
Yun, Han-Hyung [1 ]
机构
[1] Korea Univ Tech & Edu, Sch CSE, Chonansi, South Korea
[2] Korea Univ Tech & Edu, Dept Ind Design Eng, Chonansi, South Korea
关键词
vowel recognition; speech analysis; lip synching; animation;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Previous results of researches related speech recognition, especially vowel recognitions, can be applied for synchronizing the mouth movement with a dialogue in digital contents such as animations and e-learning contents. Since the mouth movement of objects has to be synchronized with a dialogue exactly, lip-sync is one of tedious works for animators and a time consuming work. The mismatch or artificialness between mouth shape of characters and speaking reduces the immersion in the contents. This paper proposes a new technique to automatically perform lip synching for a computer generated character to match a real speech or a dialogue using real time vowel recognition with the formant analysis. The proposed algorithm should be one of speaker dependent speech recognition since our concern is only for a certain voice actor related a certain character. The result shows that the vowels are recognized from a voice actor' speech with real time and the average of the recognition ratio is 97.3%.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Speaker Adaptive Real-Time Korean Single Vowel Recognition for an Animation Producing
    Whang, Sun-Min
    Song, Bok-Hee
    Yun, Han-Kyung
    FRONTIER AND INNOVATION IN FUTURE COMPUTING AND COMMUNICATIONS, 2014, 301 : 633 - 641
  • [2] A real-time lip sync system using a genetic algorithm for automatic neural network configuration
    Zoric, G
    Pandzic, IS
    2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2, 2005, : 1367 - 1370
  • [3] Speaker pruning algorithm for real-time speaker identification
    Kinnunen, T
    Karpov, E
    Fränti, P
    AUDIO-BASED AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2003, 2688 : 639 - 646
  • [4] REAL-TIME SYNTHESIS OF OPTICAL LIP PATTERNS FROM VOWEL SOUNDS
    ERBER, NP
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1977, 62 : S76 - S76
  • [5] REAL-TIME SYNTHESIS OF OPTICAL LIP SHAPES FROM VOWEL SOUNDS
    ERBER, NP
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 66 (05): : 1542 - 1544
  • [6] Design and performance evaluation of real-time fingerprint recognition for digital contents protection
    Kang, YK
    Kwon, PJ
    Kim, H
    Proceedings of the IASTED International Conference on Artificial Intelligence and Applications, Vols 1and 2, 2004, : 683 - 687
  • [7] Real-time speaker-dependent syllable recognition system of complete vocabulary of Chinese
    Chen, Tao
    Li, Changli
    Mo, Fuyuan
    Shengxue Xuebao/Acta Acustica, 1993, 18 (03): : 161 - 171
  • [8] FRAMEWORK DEVELOPMENT OF REAL-TIME LIP SYNC ANIMATION ON VISEME BASED HUMAN SPEECH
    Hoon, Loh Ngiik
    Rahman, Khairul Aidil Azlin Abd.
    Chai, Wang Yin
    JURNAL TEKNOLOGI, 2015, 75 (04): : 43 - 48
  • [9] Seeing the Sound: Multilingual Lip Sync for Real-Time Face-to-Face Translation
    Oskooei, Amirkia Rafiei
    Aktas, Mehmet S.
    Keles, Mustafa
    COMPUTERS, 2025, 14 (01)
  • [10] A study on the real-time fingerprints recognition mechanism for digital contents protection for interaction on web
    Kang, YK
    Kim, MH
    IASTED: PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON MODELLING AND SIMULATION, 2003, : 456 - 460