Vowel Recognition from RGB-D Facial Information

被引:3
|
作者
Carlos Castillo, Jose [1 ]
Encinar, Irene P. [1 ]
Conti-Morera, Alfonso [1 ]
Castro Gonzalez, Alvaro [1 ]
Angel Salichs, Miguel [1 ]
机构
[1] Univ Carlos III Madrid, Roboticslab, Madrid 28911, Spain
关键词
Apraxia of speech; Visual recognition; Classification; RGB-D;
D O I
10.1007/978-3-319-40114-0_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the main concerns in developed countries is population ageing. Elder people are susceptible of suffering conditions which reduce quality of life such as apraxia of speech, a burden that requires prolongued therapy. Our proposal is intended to be a first step towards automated solutions that assist speech therapy through detecting mouth poses. This work proposes a system for vowel poses recognition from an RGB-D camera that provides 2D and 3D information. 2D data is fed into a face recognition approach able to accurately locate and characterize the mouth in the image space. The approach also uses 3D real world measures obtained after pairing the 2D detection with the 3D information. Both information sources are processed by a set of classifiers to ascertain the best option for vowel recognition.
引用
收藏
页码:225 / 232
页数:8
相关论文
共 50 条
  • [21] ACTION RECOGNITION IN RGB-D EGOCENTRIC VIDEOS
    Tang, Yansong
    Tian, Yi
    Lu, Jiwen
    Feng, Jianjiang
    Zhou, Jie
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3410 - 3414
  • [22] Object Recognition in Noisy RGB-D Data
    Carlos Rangel, Jose
    Morell, Vicente
    Cazorla, Miguel
    Orts-Escolano, Sergio
    Garcia Rodriguez, Jose
    BIOINSPIRED COMPUTATION IN ARTIFICIAL SYSTEMS, PT II, 2015, 9108 : 261 - 270
  • [23] On RGB-D Face Recognition using Kinect
    Goswami, Gaurav
    Bharadwaj, Samarth
    Vatsa, Mayank
    Singh, Richa
    2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON BIOMETRICS: THEORY, APPLICATIONS AND SYSTEMS (BTAS), 2013,
  • [24] Structured Images for RGB-D Action Recognition
    Wang, Pichao
    Wang, Shuang
    Gao, Zhimin
    Hou, Yonghong
    Li, Wanqing
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 1005 - 1014
  • [25] RGB-D based Face Reconstruction and Recognition
    Hsu, Gee-Sern
    Liu, Yu-Lun
    Peng, Hsiao-Chia
    Chung, Sheng-Luen
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 339 - 344
  • [26] Analysis of facial motions by means of RGB-D data
    Luguev T.S.
    Luguev I.V.
    Pattern Recognition and Image Analysis, 2015, 25 (03) : 466 - 469
  • [27] Dynamic Facial Dataset Capture and Processing for Visual Speech Recognition using an RGB-D Sensor
    Ahmed, Naveed
    Lataifeh, Mohammed
    Junejo, Imran
    IAENG International Journal of Computer Science, 2020, 47 (04) : 1 - 6
  • [28] Learning Coupled Classifiers with RGB images for RGB-D object recognition
    Li, Xiao
    Fang, Min
    Zhang, Ju-Jie
    Wu, Jinqiao
    PATTERN RECOGNITION, 2017, 61 : 433 - 446
  • [29] Fusion of Skeleton and RGB Features for RGB-D Human Action Recognition
    Weiyao, Xu
    Muqing, Wu
    Min, Zhao
    Ting, Xia
    IEEE SENSORS JOURNAL, 2021, 21 (17) : 19157 - 19164
  • [30] Hand part labeling and gesture recognition from RGB-D data
    Yao, Yuan
    Zhang, Linjian
    Qiao, Wenbao
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2013, 25 (12): : 1810 - 1817