Visual Feature Extraction for Isolated Word Visual Only Speech Recognition of Vietnamese

被引：0

作者：

Nguyen Thien Chuong ^{[1
]}

Chaloupka, Josef ^{[1
]}

机构：

[1] Tech Univ Liberec, Inst Informat Technol & Elect, Liberec, Czech Republic

来源：

2013 36TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP) | 2013年

关键词：

Audio-visual speech recognition; isolated word recognition; LDA; Vietnamese language; visual feature;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper presents our research on visual feature extraction with some special treatment for dealing with Vietnamese language. The effect of linear discriminant analysis (LDA) when training with different sets of basic class will be examined. For improving the visual features, we proposed two types of visual front end for automatic lip-reading: (a) 1-Stage LDA visual front end; and (b) hierarchical LDA (HLDA) visual front end. We also compare four different types of visual feature on an isolated word visual only speech recognition of Vietnamese task using our recorded audio-visual speech database. Experiments on our database show that the proposed visual front end improves up to 8% of recognition accuracy and the HLDA visual front end outperform the other.

引用

页码：459 / 463

页数：5

共 50 条

[21] Shape Feature Analysis for Visual Speech and Speaker Recognition
Gui, Jiaping
Wang, Shilin
APPLIED INFORMATICS AND COMMUNICATION, PT III, 2011, 226 : 167 - 174
[22] Visual speech recognition with loosely synchronized feature streams
Saenko, K
Livescu, K
Siracusa, M
Wilson, K
Glass, J
Darrell, T
TENTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 1424 - 1431
[23] Improved Lip Contour Extraction For Visual Speech Recognition
Chalamala, Srinivasa Rao
Gudla, Balakrishna
Yegnanarayana, B.
Sheela, Anitha K.
2015 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2015, : 459 - 462
[24] VISUAL-ONLY RECOGNITION OF NORMAL, WHISPERED AND SILENT SPEECH
Petridis, Stavros
Shen, Jie
Cetin, Doruk
Pantic, Maja
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6219 - 6223
[25] VISEME DEFINITIONS COMPARISON FOR VISUAL-ONLY SPEECH RECOGNITION
Cappelletta, Luca
Harte, Naomi
19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 2109 - 2118
[26] APPEARANCE FEATURE EXTRACTION VERSUS IMAGE TRANSFORM-BASED APPROACH FOR VISUAL SPEECH RECOGNITION
Sagheer, Alaa
Tsuruta, Naoyuki
Taniguchi, Rin-Ichiro
Maeda, Sakashi
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2006, 6 (01) : 101 - 122
[27] Isolated Speech Recognition and Its Transformation in Visual Signs
Qaisar, Saeed Mian
JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2019, 14 (02) : 955 - 964
[28] Isolated Speech Recognition and Its Transformation in Visual Signs
Saeed Mian Qaisar
Journal of Electrical Engineering & Technology, 2019, 14 : 955 - 964
[29] Visual Word Recognition
Protopapas, Athanassios
INTERNATIONAL JOURNAL OF LANGUAGE & COMMUNICATION DISORDERS, 2014, 49 (02) : 273 - 274
[30] VISUAL WORD RECOGNITION
LUKATELA, G
PSYCHOLOGICAL RESEARCH-PSYCHOLOGISCHE FORSCHUNG, 1991, 53 (01): : 1 - 2

← 1 2 3 4 5 →