A multiple stream architecture for the recognition of signs in Brazilian sign language in the context of health

被引:4
|
作者
da Silva, Diego R. B. [1 ]
de Araujo, Tiago Maritan U. [2 ]
do Rego, Thais Gaudencio [2 ]
Brandao, Manuella Aschoff Cavalcanti [2 ]
Goncalves, Luiz Marcos Garcia [1 ]
机构
[1] Univ Fed Rio Grande do Norte, Natal, Brazil
[2] Univ Fed Paraiba, Joao Pessoa, Brazil
关键词
Sign language; Datasets; Deep learning; Neural networks; Libras; RECOMMENDATION SYSTEM; MEDICAL IMAGES;
D O I
10.1007/s11042-023-16332-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deaf people communicate naturally through sign languages and often face barriers to communicating with hearing people and accessing information in written languages. These communication difficulties are aggravated in the health domain, especially in a hospital emergency, when human sign language interpreters are unavailable. This paper proposes a solution for automatically recognizing signs in Brazilian Sign Language (Libras) in the health context to reduce this problem. The idea is that the system could assist in the communication between a Deaf patient and his doctor in the future. Our solution involves a multiple-stream architecture that combines convolutional and recurrent neural networks, dealing with sign languages' visual phonemes individual and specialized ways. The first stream uses the optical flow as input for capturing information about the "movement" of the sign; the second stream extracts kinematic and postural features, including "handshapes" and "facial expressions"; and the third stream process the raw RGB images to address additional attributes about the sign not captured in the previous streams. Thus, we can process more spatiotemporal features that discriminate the classes during the training stage. The computational results show that the solution can recognize signs in Libras in the health context, with an average accuracy, precision, recall, and f1-score of 99.80%, 99.81%, 99.80%, and 99.80%, respectively. Our system also performed better than other works in the literature, obtaining an average accuracy of 100% in an Argentine Sign Language (LSA) dataset, which is usually used for comparison purposes.
引用
收藏
页码:19767 / 19785
页数:19
相关论文
共 50 条
  • [21] Real-Time Sign Language Recognition Based on Video Stream
    Zhao, Kai
    Zhang, Kejun
    Zhai, Yu
    Wang, Daotong
    Su, Jianbo
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7469 - 7474
  • [22] A Proposed Hybrid Sensor Architecture for Arabic Sign Language Recognition
    ElBadawy, Menna
    Elons, A. Samir
    Sheded, Hwaida
    Tolba, Mohamed F.
    INTELLIGENT SYSTEMS'2014, VOL 2: TOOLS, ARCHITECTURES, SYSTEMS, APPLICATIONS, 2015, 323 : 721 - 730
  • [23] THE ROLE OF BRAZILIAN SIGN LANGUAGE INTERPRETERS IN THE CONTEXT OF BASIC EDUCATION IN BRAZIL
    do Carmo, Livia Silveira
    de Freitas Reis, Marlene Barbosa
    HUMANIDADES & INOVACAO, 2022, 9 (13): : 10 - 20
  • [24] Real-time sign language recognition based on video stream
    Zhao K.
    Zhang K.
    Zhai Y.
    Wang D.
    Su J.
    International Journal of Systems, Control and Communications, 2021, 12 (02) : 158 - 174
  • [25] Context Matters: Self-Attention for Sign Language Recognition
    Slimane, Fares Ben
    Bouguessa, Mohamed
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7884 - 7891
  • [26] Gesture recognition: A review focusing on sign language in a mobile context
    Neiva, Davi Hirafuji
    Zanchettin, Cleber
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 103 : 159 - 183
  • [27] Multiple Proposals for Continuous Arabic Sign Language Recognition
    Hassan, Mohamed
    Assaleh, Khaled
    Shanableh, Tamer
    SENSING AND IMAGING, 2019, 20 (1):
  • [28] Multiple Proposals for Continuous Arabic Sign Language Recognition
    Mohamed Hassan
    Khaled Assaleh
    Tamer Shanableh
    Sensing and Imaging, 2019, 20
  • [29] Using multiple sensors for mobile sign language recognition
    Brashear, H
    Starner, T
    Lukowicz, P
    Junker, H
    SEVENTH IEEE INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, PROCEEDINGS, 2003, : 45 - 52
  • [30] Sign Language Recognition Using Multiple Kernel Learning: A Case Study of Pakistan Sign Language
    Shah, Farman
    Shah, Muhammad Saqlain
    Akram, Waseem
    Manzoor, Awais
    Mahmoud, Rasha Orban
    Abdelminaam, Diaa Salama
    IEEE ACCESS, 2021, 9 : 67548 - 67558