Efficient Sign Language Recognition System and Dataset Creation Method Based on Deep Learning and Image Processing

被引:2
|
作者
Cavalcante Carneiro, A. L. [1 ]
Silva, L. Brito [1 ]
Pinheiro Salvadeo, D. H. [1 ]
机构
[1] State Univ Sao Paulo, Dept Stat Appl Math & Computat, Av 24A,1515, BR-13506700 Rio Claro, SP, Brazil
关键词
Deep learning; sign language recognition; convolutional neural networks; image classification; object detection;
D O I
10.1117/12.2601018
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
New deep-learning architectures are created every year, achieving state-of-the-art results in image recognition and leading to the belief that, in a few years, complex tasks such as sign language translation will be considerably easier, serving as a communication tool for the hearing-impaired community. On the other hand, these algorithms still need a lot of data to be trained and the dataset creation process is expensive, time-consuming, and slow. Thereby, this work aims to investigate techniques of digital image processing and machine learning that can be used to create a sign language dataset effectively. We argue about data acquisition, such as the frames per second rate to capture or subsample the videos, the background type, preprocessing, and data augmentation, using convolutional neural networks and object detection to create an image classifier and comparing the results based on statistical tests. Different datasets were created to test the hypotheses, containing 14 words used daily and recorded by different smartphones in the RGB color system. We achieved an accuracy of 96.38% on the test set and 81.36% on the validation set containing more challenging conditions, showing that 30 FPS is the best frame rate subsample to train the classifier, geometric transformations work better than intensity transformations, and artificial background creation is not effective to model generalization. These trade-offs should be considered in future work as a cost-benefit guideline between computational cost and accuracy gain when creating a dataset and training a sign recognition model.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] ESMAANI: A Static and Dynamic Arabic Sign Language Recognition System Based on Machine and Deep Learning Models
    Hisham, Essam
    Saleh, Sherine Nagy
    2022 5TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, SIGNAL PROCESSING, AND THEIR APPLICATIONS (ICCSPA), 2022,
  • [42] An Image Compression Processing Method Based On Deep Learning
    Liu Ruihua
    Zhou Quan
    Xiao Huachao
    2019 2ND IEEE INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP), 2019, : 342 - 346
  • [43] Efhamni: A Deep Learning-Based Saudi Sign Language Recognition Application
    Al Khuzayem, Lama
    Shafi, Suha
    Aljahdali, Safia
    Alkhamesie, Rawan
    Alzamzami, Ohoud
    SENSORS, 2024, 24 (10)
  • [44] Hand Landmark-Based Sign Language Recognition Using Deep Learning
    John, Jerry
    Sherif, Bismin, V
    MACHINE LEARNING AND AUTONOMOUS SYSTEMS, 2022, 269 : 147 - 157
  • [45] Survey on sign language recognition in context of vision-based and deep learning
    Subburaj, S.
    Murugavalli, S.
    Measurement: Sensors, 2022, 23
  • [46] Traffic Sign Recognition by Image Preprocessing and Deep Learning
    Khamdamov, U. R.
    Umarov, M. A.
    Khalilov, S. P.
    Kayumov, A. A.
    Abidova, F. Sh.
    INTELLIGENT HUMAN COMPUTER INTERACTION, IHCI 2023, PT II, 2024, 14532 : 81 - 92
  • [47] Efficient Tracking Method to Make a Real Time Sign Language Recognition System
    Jebali, Maher
    Dalle, Patrice
    Jemni, Mohamed
    COMPUTERS HELPING PEOPLE WITH SPECIAL NEEDS, ICCHP 2014, PT II, 2014, 8548 : 454 - 457
  • [48] A Comprehensive Study on Deep Learning-Based Methods for Sign Language Recognition
    Adaloglou, Nikolas
    Chatzis, Theocharis
    Papastratis, Ilias
    Stergioulas, Andreas
    Papadopoulos, Georgios Th.
    Zacharopoulou, Vassia
    Xydopoulos, George J.
    Atzakas, Klimnis
    Papazachariou, Dimitris
    Daras, Petros
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1750 - 1762
  • [49] A Model for Qur'anic Sign Language Recognition Based on Deep Learning Algorithms
    AbdElghfar, Hany A. A.
    Ahmed, Abdelmoty M. M.
    Alani, Ali A. A.
    AbdElaal, Hammam M.
    Bouallegue, Belgacem
    Khattab, Mahmoud M.
    Tharwat, Gamal
    Youness, Hassan A. A.
    JOURNAL OF SENSORS, 2023, 2023
  • [50] A sensing data and deep learning-based sign language recognition approach
    Hao, Wei
    Hou, Chen
    Zhang, Zhihao
    Zhai, Xueyu
    Wang, Li
    Lv, Guanghao
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 118