Deep learning-based sign language recognition system using both manual and non-manual components fusion

被引:1
|
作者
Jebali, Maher [1 ]
Dakhli, Abdesselem [1 ]
Bakari, Wided [1 ]
机构
[1] Univ Hail, Comp Sci Dept, POB 2440, Hail 100190, Saudi Arabia
来源
AIMS MATHEMATICS | 2024年 / 9卷 / 01期
关键词
CNN; CTC; recurrent neural network; sign language recognition; head pose;
D O I
10.3934/math.2024105
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Sign language is regularly adopted by speech-impaired or deaf individuals to convey information; however, it necessitates substantial exertion to acquire either complete knowledge or skill. Sign language recognition (SLR) has the intention to close the gap between the users and the non-users of sign language by identifying signs from video speeches. This is a fundamental but arduous task as sign language is carried out with complex and often fast hand gestures and motions, facial expressions and impressionable body postures. Nevertheless, non-manual features are currently being examined since numerous signs have identical manual components but vary in non-manual components. To this end, we suggest a novel manual and non-manual SLR system (MNM-SLR) using a convolutional neural network (CNN) to get the benefits of multi-cue information towards a significant recognition rate. Specifically, we suggest a model for a deep convolutional, long short-term memory network that simultaneously exploits the non-manual features, which is summarized by utilizing the head pose, as well as a model of the embedded dynamics of manual features. Contrary to other frequent works that focused on depth cameras, multiple camera visuals and electrical gloves, we employed the use of RGB, which allows individuals to communicate with a deaf person through their personal devices. As a result, our framework achieves a high recognition rate with an accuracy of 90.12% on the SIGNUM dataset and 94.87% on RWTH-PHOENIX-Weather 2014 dataset.
引用
收藏
页码:2105 / 2122
页数:18
相关论文
共 50 条
  • [31] Polite appearances: How non-manual features convey politeness in British Sign Language
    Mapson, Rachel
    JOURNAL OF POLITENESS RESEARCH-LANGUAGE BEHAVIOUR CULTURE, 2014, 10 (02): : 157 - 184
  • [32] A Multi-layer Model for Sign Language's Non-Manual Gestures Generation
    El Ghoul, Oussama
    Jemni, Mohamed
    COMPUTERS HELPING PEOPLE WITH SPECIAL NEEDS, ICCHP 2014, PT II, 2014, 8548 : 466 - 473
  • [33] Difference in the production of Non-Manual Expressions for fluent signers in Brazilian Sign Language as first or second language
    Hanada, Leticia Kaori
    Barbosa, Plinio Almeida
    REVISTA DE ESTUDOS DA LINGUAGEM, 2022, 30 (01) : 53 - 84
  • [34] PEDAGOGICAL CHALLENGES IN TEACHING NON-MANUAL FEATURES TO LEARNERS OF IRISH SIGN LANGUAGE (ISL) AS A SECOND LANGUAGE
    Patrick, A. Matthews
    EDULEARN11: 3RD INTERNATIONAL CONFERENCE ON EDUCATION AND NEW LEARNING TECHNOLOGIES, 2011, : 4708 - 4715
  • [35] Exploiting Association Rules Mining to inform the use of non-manual features in sign language processing
    Smith, Robert G.
    SIGN LANGUAGE & LINGUISTICS, 2025,
  • [36] Addressing the Cardinals Puzzle: New Insights from Non-Manual Markers in Italian Sign Language
    Mantovan, Lara
    Geraci, Carlo
    Cardinaletti, Anna
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,
  • [37] Efhamni: A Deep Learning-Based Saudi Sign Language Recognition Application
    Al Khuzayem, Lama
    Shafi, Suha
    Aljahdali, Safia
    Alkhamesie, Rawan
    Alzamzami, Ohoud
    SENSORS, 2024, 24 (10)
  • [38] A Comprehensive Study on Deep Learning-Based Methods for Sign Language Recognition
    Adaloglou, Nikolas
    Chatzis, Theocharis
    Papastratis, Ilias
    Stergioulas, Andreas
    Papadopoulos, Georgios Th.
    Zacharopoulou, Vassia
    Xydopoulos, George J.
    Atzakas, Klimnis
    Papazachariou, Dimitris
    Daras, Petros
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1750 - 1762
  • [39] A sensing data and deep learning-based sign language recognition approach
    Hao, Wei
    Hou, Chen
    Zhang, Zhihao
    Zhai, Xueyu
    Wang, Li
    Lv, Guanghao
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 118
  • [40] Mouth features as non-manual cues for the categorization of lexical and productive signs in French Sign Language (LSF)
    Balvet, Antonio
    Sallandre, Marie-Anne
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,