Vowel Recognition from RGB-D Facial Information

被引:3
|
作者
Carlos Castillo, Jose [1 ]
Encinar, Irene P. [1 ]
Conti-Morera, Alfonso [1 ]
Castro Gonzalez, Alvaro [1 ]
Angel Salichs, Miguel [1 ]
机构
[1] Univ Carlos III Madrid, Roboticslab, Madrid 28911, Spain
关键词
Apraxia of speech; Visual recognition; Classification; RGB-D;
D O I
10.1007/978-3-319-40114-0_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the main concerns in developed countries is population ageing. Elder people are susceptible of suffering conditions which reduce quality of life such as apraxia of speech, a burden that requires prolongued therapy. Our proposal is intended to be a first step towards automated solutions that assist speech therapy through detecting mouth poses. This work proposes a system for vowel poses recognition from an RGB-D camera that provides 2D and 3D information. 2D data is fed into a face recognition approach able to accurately locate and characterize the mouth in the image space. The approach also uses 3D real world measures obtained after pairing the 2D detection with the 3D information. Both information sources are processed by a set of classifiers to ascertain the best option for vowel recognition.
引用
收藏
页码:225 / 232
页数:8
相关论文
共 50 条
  • [1] RGB-D Sensor for Facial Expression Recognition in AAL Context
    Caroppo, Andrea
    Leone, Alessandro
    Siciliano, Pietro
    SENSORS AND MICROSYSTEMS, 2018, 457 : 313 - 321
  • [2] RGB-D Dynamic Facial Dataset Capture for Visual Speech Recognition
    Ahmed, Naveed
    2019 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2019, 11321
  • [3] Static Gesture Recognition Based on RGB-D Depth Information
    Wang, Yi
    Dong, Xiucheng
    Li, Changlong
    Yu, Ximu
    ADVANCES IN COMPUTERS, ELECTRONICS AND MECHATRONICS, 2014, 667 : 248 - +
  • [4] Bilinear Heterogeneous Information Machine for RGB-D Action Recognition
    Kong, Yu
    Fu, Yun
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 1054 - 1062
  • [5] Child Action Recognition in RGB and RGB-D Data
    Turarova, Aizada
    Zhanatkyzy, Aida
    Telisheva, Zhansaule
    Sabyrov, Arman
    Sandygulova, Anara
    HRI'20: COMPANION OF THE 2020 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2020, : 491 - 492
  • [6] Robust facial expression recognition using RGB-D images and multichannel features
    Cai, Linqin
    Xu, Hongbo
    Yang, Yang
    Yu, Jimin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (20) : 28591 - 28607
  • [7] Facial Expression Recognition Adaptive to Face Pose Using RGB-D Camera
    Inoue, Yuta
    Nishide, Shun
    Ren, Fuji
    TRENDS IN APPLIED KNOWLEDGE-BASED SYSTEMS AND DATA SCIENCE, 2016, 9799 : 422 - 427
  • [8] Robust facial expression recognition using RGB-D images and multichannel features
    Linqin Cai
    Hongbo Xu
    Yang Yang
    Jimin Yu
    Multimedia Tools and Applications, 2019, 78 : 28591 - 28607
  • [9] Pseudo RGB-D Face Recognition
    Jin, Bo
    Cruz, Leandro
    Goncalves, Nuno
    IEEE SENSORS JOURNAL, 2022, 22 (22) : 21780 - 21794
  • [10] GENDER RECOGNITION ON RGB-D IMAGE
    Zhang, Xiaoxiong
    Javed, Sajid
    Obeid, Ahmad
    Dias, Jorge
    Werghi, Naoufel
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1836 - 1840