Lip temporal pattern analysis for automatic visual speech recognition

被引:0
|
作者
Xie, L [1 ]
Cai, XL [1 ]
Fu, ZH [1 ]
Jiang, DM [1 ]
Zhao, RC [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Peoples R China
关键词
visual speech recognition; lipreading; feature extraction; lip temporal pattern;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel approach to processing temporal lip motion information for dynamic visual feature extraction in visual speech recognition. The long-time Lip TenipoRA1 Patterns (LipTRAPs) of visual phonemes are introduced to analyze the nature of lip shape changes when uttering speech. A dynamic visual feature is also proposed based on the LipTRAPs. Visual speech recognition experiments on a connected-digits task show that the LipTRAP feature can yield significant WRR improvments than conventional delta features.
引用
收藏
页码:703 / 706
页数:4
相关论文
共 50 条
  • [1] A hybrid approach for automatic lip localization and viseme classification to enhance visual speech recognition
    Mahdi, Walid
    Werda, Salah
    Ben Hamadou, Abdelmajid
    INTEGRATED COMPUTER-AIDED ENGINEERING, 2008, 15 (03) : 253 - 266
  • [2] A hybrid approach for automatic lip localization and viseme classification to enhance visual speech recognition
    Multimedia Information Systems and Advanced Computing Laboratory, High Institute of Computer Science and Multimedia, University of Sfax, Sfax, Tunisia
    Integr. Comput. Aided Eng., 2008, 3 (253-266):
  • [3] Analysis of lip geometric features for audio-visual speech recognition
    Kaynak, MN
    Zhi, Q
    Cheok, AD
    Sengupta, K
    Han, Z
    Chung, KC
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2004, 34 (04): : 564 - 570
  • [4] Analysis of HMM temporal evolution for automatic speech recognition and verification
    Casar, Marta
    Fonollosa, Jose A. R.
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2006, 4188 : 359 - 366
  • [5] Improved Lip Contour Extraction For Visual Speech Recognition
    Chalamala, Srinivasa Rao
    Gudla, Balakrishna
    Yegnanarayana, B.
    Sheela, Anitha K.
    2015 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2015, : 459 - 462
  • [6] Lip location normalized training for visual speech recognition
    Vanegas, Oscar
    Tokuda, Keiichi
    Kitamura, Tadashi
    IEICE Transactions on Information and Systems, 2000, 383 -D (11) : 1969 - 1977
  • [7] Lip location normalized training for visual speech recognition
    Vanegas, O
    Tokuda, K
    Kitamura, T
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2000, E83D (11): : 1969 - 1977
  • [8] Visual Lip Contour Detection for the Purpose of Speech Recognition
    Dalka, Piotr
    Bratoszewski, Piotr
    Czyzewski, Andrzej
    2014 INTERNATIONAL CONFERENCE ON SIGNALS AND ELECTRONIC SYSTEMS (ICSES), 2014,
  • [9] Lip-Based Visual Speech Recognition System
    Frisky, Aufaclav Zatu Kusuma
    Wang, Chien-Yao
    Santoso, Andri
    Wang, Jia-Ching
    49TH ANNUAL IEEE INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY (ICCST), 2015, : 315 - 319
  • [10] Analysis of HMM Temporal Evolution for Automatic Speech Recognition and Utterance Verification
    Casar, Marta
    Fonollosa, Jose A. R.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 613 - 616