Automatic Lip Reading in the Dutch Language Using Active Appearance Models on High Speed Recordings

被引:0
|
作者
Chitu, Alin Gavril [1 ]
Driel, Karin [1 ]
Rothkrantz, Leon J. M. [1 ]
机构
[1] Delft Univ Technol, Man Machine Interact Grp, Dept Mediamat, NL-2628 CD Delft, Netherlands
来源
TEXT, SPEECH AND DIALOGUE | 2010年 / 6231卷
关键词
lip reading; active appearance models; high speed recordings; data corpus; NDUTAVSC; Dutch;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents our work on lip reading in the Dutch language. The results are based on a new data corpus recorded at 100Hz in our group. The NDUTAVSC corpus is to date the largest corpus build for lip reading in Dutch. For parameterising the input data we use Active Appearance Models. Based on the results of AAM we define a set of high level geometric features which are used for training recognizer systems for different recognition tasks, such as fixed length digits strings, random length letters strings, random word sequences, fixed topic continuous speech and random continuous speech. We show that our approach gives great improvements compared to previous results. We also investigate the influence of the high speed recordings on the performance of the recognition. We show that in the case of high speech rate the use of higher speed recordings is compulsory.
引用
收藏
页码:259 / 266
页数:8
相关论文
共 50 条
  • [1] Automatic landmarking of cephalograms using active appearance models
    Vucinic, Predrag
    Trpovski, Zeljen
    Scepan, Ivana
    EUROPEAN JOURNAL OF ORTHODONTICS, 2010, 32 (03) : 233 - 241
  • [3] AUTOMATIC SEGMENTATION OF LUMBAR VERTEBRAE ON DIGITISED RADIOGRAPHS USING ACTIVE APPEARANCE MODELS
    Roberts, M.
    Cootes, T.
    Pacheco, E.
    Oh, T.
    Adams, J.
    OSTEOPOROSIS INTERNATIONAL, 2009, 20 : S268 - S269
  • [4] Mobile Phone Security using Automatic Lip Reading
    Lesani, Fatemeh Sadat
    Ghazvini, Faranak Fotouhi
    Dianat, Rouhollah
    2015 9TH INTERNATIONAL CONFERENCE ON E-COMMERCE IN DEVELOPING COUNTRIES: WITH FOCUS ON E-BUSINESS (ECDC), 2015,
  • [5] Automatic Lip Reading by Using Multimodal Visual Features
    Takahashi, Shohei
    Ohya, Jun
    INTELLIGENT ROBOTS AND COMPUTER VISION XXXI: ALGORITHMS AND TECHNIQUES, 2014, 9025
  • [6] Automatic segmentation of jaw tissues in CT using active appearance models and semi-automatic landmarking
    Rueda, Sylvia
    Gil, Jose Antonio
    Pichery, Raphael
    Alcaniz, Mariano
    MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2006, PT 1, 2006, 4190 : 167 - 174
  • [7] Rapid and automatic 3D face modeling using active appearance models
    Fan, Xiaojiu
    Peng, Qiang
    Chen, Jim X
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2009, 21 (05): : 668 - 673
  • [8] Automatic Segmentation of a Fetal Echocardiogram Using Modified Active Appearance Models and Sparse Representation
    Guo, Yi
    Wang, Yuanyuan
    Nie, Siqing
    Yu, Jinhua
    Chen, Ping
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2014, 61 (04) : 1121 - 1133
  • [9] Combining active appearance models and morphological operators using a pipeline for automatic myocardium extraction
    Pfeifer, B
    Hanser, F
    Trieb, T
    Hintermüller, C
    Seger, M
    Fischer, G
    Modre, R
    Tilg, B
    FUNCTIONAL IMAGING AND MODELING OF HEART, PROCEEDINGS, 2005, 3504 : 44 - 53