COMPARISONS OF VISUAL FEATURES EXTRACTION TOWARDS AUTOMATIC LIP READING

被引:0
|
作者
Butt, Waqqas Ur Rehman [1 ]
Lombardi, Luca [1 ]
机构
[1] Univ Pavia, I-27100 Pavia, Italy
关键词
Lip Reading; Visual Feature Extraction; Active Shape Model (ASM); Adaptive Appearance model (AAM); Snakes; SCALE-SPACE; FACE;
D O I
暂无
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
The human face is a dynamic object and has a high degree of variability in its appearance, which makes feature extraction a difficult problem in computer vision. A wide variety of techniques have been proposed. Lip reading computing plays a very important role in automatic speech recognition. For automatic lip reading, there are many competing methods for feature extraction. Often, because of the complexity of the task these methods are tested on only quite restricted datasets. Visual information derived from visual features and most important accurate lip extraction, can provide features invariant to noise perturbation for speech recognition systems and can be also used in a wide variety of applications. There are many techniques available to extract the visual features. Lip reading performance degrades dramatically due to the speaker variability encoded in the visual features. In this paper we compare the most used strategies, both High-Level and Low-Level analysis. In, High-Level Visual speech Analysis we will describe the Active shape model (ASM) and Active Appearance Model (AAM) and for the low-level approach, we will consider pixel-base methods for identification and extraction of salient visual features and compare them. Active Contour Model (ACM) known as "SNAKES" and Comparison of Model based and Image based methods are also described. This paper has put forward a way to select and extract visual features effectively for automatic lip reading.
引用
收藏
页码:2188 / 2196
页数:9
相关论文
共 50 条
  • [1] Automatic Lip Reading by Using Multimodal Visual Features
    Takahashi, Shohei
    Ohya, Jun
    INTELLIGENT ROBOTS AND COMPUTER VISION XXXI: ALGORITHMS AND TECHNIQUES, 2014, 9025
  • [2] Visual speech features representation for automatic lip-reading
    Sagheer, A
    Tsuruta, N
    Taniguchi, RK
    Maeda, S
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 781 - 784
  • [3] ROI Processing for Visual Features Extraction in Lip-reading
    Wang, Xiaoping
    Hao, Yufeng
    Fu, Degang
    Yuan, Chunwei
    2008 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND SIGNAL PROCESSING, VOLS 1 AND 2, 2007, : 178 - +
  • [4] Lip features automatic extraction
    Lievin, M
    Luthon, F
    1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 3, 1998, : 168 - 172
  • [5] Automatic lip localization and feature extraction for lip-reading
    Werda, Salah
    Mahdi, Walid
    Ben Hamadou, Abdehnajid
    VISAPP 2007: PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOLUME IU/MTSV, 2007, : 268 - +
  • [6] Extraction of Features for Lip-reading Using Autoencoders
    Palecek, Karel
    SPEECH AND COMPUTER, 2014, 8773 : 209 - 216
  • [7] Investigation of Effectiveness of Ensemble Features for Visual Lip Reading
    Krishnachandran, M.
    Ayyappan, Sonal
    2014 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2014, : 530 - 533
  • [8] Lip Movements Recognition Towards An Automatic Lip Reading System for Myanmar Consonants
    Thein, Thein
    San, Kalyar Myo
    2018 12TH INTERNATIONAL CONFERENCE ON RESEARCH CHALLENGES IN INFORMATION SCIENCE (RCIS), 2018,
  • [9] Lip feature extraction towards an automatic speechreading system
    Zhang, X
    Mersereau, RM
    2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2000, : 226 - 229
  • [10] Visual units and confusion modelling for automatic lip-reading
    Howell, Dominic
    Cox, Stephen
    Theobald, Barry
    IMAGE AND VISION COMPUTING, 2016, 51 : 1 - 12