The approach of Chinese speech triseme recognition for human mouth animation

被引:0
|
作者
Ouyang, Jianjun [1 ]
Xu, Ming [2 ]
Huang, Yunsen [2 ]
机构
[1] Shenzhen Univ, Coll Informat Engn, Shenzhen, Peoples R China
[2] Shenzhen Univ, Informat Ctr, Shenzhen, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Different from text driven and phoneme based human mouth synthesis approaches, this paper presents the novel natural speech driven mouth animation approach. To capture the context information of continuously speaking mouth shapes, the triseme based modeling technique is employed for acquiring the trisemic HMMs. To obtain the robust model parameters with the limited training data, the states tying procedure is introduced. Considering the compatibility and ambiguity issues, the visemic questions which assigned in the leaf nodes of decision tree are generated that based on the training data. With the modeled HMM parameters, the viterbi beam searching algorithm is applied to time align the trisemic sequences. Mapping the recognized trisemes to the corresponding MPEG-4 FAPs-represented mouth shapes, the speaking mouth can be finally animated through a smoothing process. In terms of the proposed evaluation criterion, the experimental results illustrate that the recognition accuracy is applicable and also the aligning speed is acceptable in human vision.
引用
收藏
页码:666 / +
页数:2
相关论文
共 50 条
  • [31] MICROPROCESSOR APPROACH TO SPEECH RECOGNITION
    SEYMOUR, J
    GATWARD, JF
    ACUSTICA, 1977, 37 (01): : 57 - 58
  • [32] AN APPROACH TO THE AUTOMATIC RECOGNITION OF SPEECH
    PAY, BE
    EVANS, CR
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1981, 14 (01): : 13 - 27
  • [33] Watch Your Mouth: Silent Speech Recognition with Depth Sensing
    Wang, Xue
    Su, Zixiong
    Rekimoto, Jun
    Zhang, Yang
    PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS, CHI 2024, 2024,
  • [34] Integration of speech and language processing in Chinese continuous speech recognition
    ZHAO Li ZOU Cairong WU Zhenyang(Department of Radio Engineering
    Chinese Journal of Acoustics, 2002, (04) : 343 - 351
  • [35] A Non-Linguistic Approach for Human Emotion Recognition from Speech
    Spyrou, Evaggelos
    Vernikos, Ioannis
    Nikopoulou, Rozalia
    Mylonas, Phivos
    2018 9TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS (IISA), 2018, : 244 - 248
  • [36] A coupled HMM approach to video-realistic speech animation
    Xie, Lei
    Liu, Zhi-Qiang
    PATTERN RECOGNITION, 2007, 40 (08) : 2325 - 2340
  • [37] A new approach to Chinese person names recognition based on part-of-speech detecting
    Wang, YY
    He, ZS
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 2969 - 2972
  • [38] Flexible human speech recognition
    Pols, LCW
    1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 273 - 283
  • [39] An Approach for Improving Automatic Mouth Emotion Recognition
    Biondi, Giulio
    Franzoni, Valentina
    Gervasi, Osvaldo
    Perri, Damiano
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2019, PT I: 19TH INTERNATIONAL CONFERENCE, SAINT PETERSBURG, RUSSIA, JULY 1-4, 2019, PROCEEDINGS, PT I, 2019, 11619 : 649 - 664
  • [40] Chinese dialect speech recognition: a comprehensive survey
    Qiang Li
    Qianyu Mai
    Mandou Wang
    Mingjuan Ma
    Artificial Intelligence Review, 57