COMPARISONS OF VISUAL FEATURES EXTRACTION TOWARDS AUTOMATIC LIP READING

被引：0

作者：

Butt, Waqqas Ur Rehman ^{[1
]}

Lombardi, Luca ^{[1
]}

机构：

[1] Univ Pavia, I-27100 Pavia, Italy

来源：

EDULEARN13: 5TH INTERNATIONAL CONFERENCE ON EDUCATION AND NEW LEARNING TECHNOLOGIES | 2013年

关键词：

Lip Reading; Visual Feature Extraction; Active Shape Model (ASM); Adaptive Appearance model (AAM); Snakes; SCALE-SPACE; FACE;

D O I：

暂无

中图分类号：

G40 [教育学];

学科分类号：

040101 ; 120403 ;

摘要：

The human face is a dynamic object and has a high degree of variability in its appearance, which makes feature extraction a difficult problem in computer vision. A wide variety of techniques have been proposed. Lip reading computing plays a very important role in automatic speech recognition. For automatic lip reading, there are many competing methods for feature extraction. Often, because of the complexity of the task these methods are tested on only quite restricted datasets. Visual information derived from visual features and most important accurate lip extraction, can provide features invariant to noise perturbation for speech recognition systems and can be also used in a wide variety of applications. There are many techniques available to extract the visual features. Lip reading performance degrades dramatically due to the speaker variability encoded in the visual features. In this paper we compare the most used strategies, both High-Level and Low-Level analysis. In, High-Level Visual speech Analysis we will describe the Active shape model (ASM) and Active Appearance Model (AAM) and for the low-level approach, we will consider pixel-base methods for identification and extraction of salient visual features and compare them. Active Contour Model (ACM) known as "SNAKES" and Comparison of Model based and Image based methods are also described. This paper has put forward a way to select and extract visual features effectively for automatic lip reading.

引用

页码：2188 / 2196

页数：9

共 50 条

[1] Automatic Lip Reading by Using Multimodal Visual Features
Takahashi, Shohei
Ohya, Jun
INTELLIGENT ROBOTS AND COMPUTER VISION XXXI: ALGORITHMS AND TECHNIQUES, 2014, 9025
[2] Visual speech features representation for automatic lip-reading
Sagheer, A
Tsuruta, N
Taniguchi, RK
Maeda, S
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 781 - 784
[3] ROI Processing for Visual Features Extraction in Lip-reading
Wang, Xiaoping
Hao, Yufeng
Fu, Degang
Yuan, Chunwei
2008 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND SIGNAL PROCESSING, VOLS 1 AND 2, 2007, : 178 - +
[4] Lip features automatic extraction
Lievin, M
Luthon, F
1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 3, 1998, : 168 - 172
[5] Automatic lip localization and feature extraction for lip-reading
Werda, Salah
Mahdi, Walid
Ben Hamadou, Abdehnajid
VISAPP 2007: PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOLUME IU/MTSV, 2007, : 268 - +
[6] Extraction of Features for Lip-reading Using Autoencoders
Palecek, Karel
SPEECH AND COMPUTER, 2014, 8773 : 209 - 216
[7] Investigation of Effectiveness of Ensemble Features for Visual Lip Reading
Krishnachandran, M.
Ayyappan, Sonal
2014 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2014, : 530 - 533
[8] Lip Movements Recognition Towards An Automatic Lip Reading System for Myanmar Consonants
Thein, Thein
San, Kalyar Myo
2018 12TH INTERNATIONAL CONFERENCE ON RESEARCH CHALLENGES IN INFORMATION SCIENCE (RCIS), 2018,
[9] Lip feature extraction towards an automatic speechreading system
Zhang, X
Mersereau, RM
2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2000, : 226 - 229
[10] Visual units and confusion modelling for automatic lip-reading
Howell, Dominic
Cox, Stephen
Theobald, Barry
IMAGE AND VISION COMPUTING, 2016, 51 : 1 - 12

← 1 2 3 4 5 →