Visual Feature Extraction for Isolated Word Visual Only Speech Recognition of Vietnamese

被引:0
|
作者
Nguyen Thien Chuong [1 ]
Chaloupka, Josef [1 ]
机构
[1] Tech Univ Liberec, Inst Informat Technol & Elect, Liberec, Czech Republic
关键词
Audio-visual speech recognition; isolated word recognition; LDA; Vietnamese language; visual feature;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents our research on visual feature extraction with some special treatment for dealing with Vietnamese language. The effect of linear discriminant analysis (LDA) when training with different sets of basic class will be examined. For improving the visual features, we proposed two types of visual front end for automatic lip-reading: (a) 1-Stage LDA visual front end; and (b) hierarchical LDA (HLDA) visual front end. We also compare four different types of visual feature on an isolated word visual only speech recognition of Vietnamese task using our recorded audio-visual speech database. Experiments on our database show that the proposed visual front end improves up to 8% of recognition accuracy and the HLDA visual front end outperform the other.
引用
收藏
页码:459 / 463
页数:5
相关论文
共 50 条
  • [21] Shape Feature Analysis for Visual Speech and Speaker Recognition
    Gui, Jiaping
    Wang, Shilin
    APPLIED INFORMATICS AND COMMUNICATION, PT III, 2011, 226 : 167 - 174
  • [22] Visual speech recognition with loosely synchronized feature streams
    Saenko, K
    Livescu, K
    Siracusa, M
    Wilson, K
    Glass, J
    Darrell, T
    TENTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 1424 - 1431
  • [23] Improved Lip Contour Extraction For Visual Speech Recognition
    Chalamala, Srinivasa Rao
    Gudla, Balakrishna
    Yegnanarayana, B.
    Sheela, Anitha K.
    2015 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2015, : 459 - 462
  • [24] VISUAL-ONLY RECOGNITION OF NORMAL, WHISPERED AND SILENT SPEECH
    Petridis, Stavros
    Shen, Jie
    Cetin, Doruk
    Pantic, Maja
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6219 - 6223
  • [25] VISEME DEFINITIONS COMPARISON FOR VISUAL-ONLY SPEECH RECOGNITION
    Cappelletta, Luca
    Harte, Naomi
    19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 2109 - 2118
  • [26] APPEARANCE FEATURE EXTRACTION VERSUS IMAGE TRANSFORM-BASED APPROACH FOR VISUAL SPEECH RECOGNITION
    Sagheer, Alaa
    Tsuruta, Naoyuki
    Taniguchi, Rin-Ichiro
    Maeda, Sakashi
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2006, 6 (01) : 101 - 122
  • [27] Isolated Speech Recognition and Its Transformation in Visual Signs
    Qaisar, Saeed Mian
    JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2019, 14 (02) : 955 - 964
  • [28] Isolated Speech Recognition and Its Transformation in Visual Signs
    Saeed Mian Qaisar
    Journal of Electrical Engineering & Technology, 2019, 14 : 955 - 964
  • [29] Visual Word Recognition
    Protopapas, Athanassios
    INTERNATIONAL JOURNAL OF LANGUAGE & COMMUNICATION DISORDERS, 2014, 49 (02) : 273 - 274
  • [30] VISUAL WORD RECOGNITION
    LUKATELA, G
    PSYCHOLOGICAL RESEARCH-PSYCHOLOGISCHE FORSCHUNG, 1991, 53 (01): : 1 - 2