Efficient Sign Language Recognition System and Dataset Creation Method Based on Deep Learning and Image Processing

被引:2
|
作者
Cavalcante Carneiro, A. L. [1 ]
Silva, L. Brito [1 ]
Pinheiro Salvadeo, D. H. [1 ]
机构
[1] State Univ Sao Paulo, Dept Stat Appl Math & Computat, Av 24A,1515, BR-13506700 Rio Claro, SP, Brazil
关键词
Deep learning; sign language recognition; convolutional neural networks; image classification; object detection;
D O I
10.1117/12.2601018
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
New deep-learning architectures are created every year, achieving state-of-the-art results in image recognition and leading to the belief that, in a few years, complex tasks such as sign language translation will be considerably easier, serving as a communication tool for the hearing-impaired community. On the other hand, these algorithms still need a lot of data to be trained and the dataset creation process is expensive, time-consuming, and slow. Thereby, this work aims to investigate techniques of digital image processing and machine learning that can be used to create a sign language dataset effectively. We argue about data acquisition, such as the frames per second rate to capture or subsample the videos, the background type, preprocessing, and data augmentation, using convolutional neural networks and object detection to create an image classifier and comparing the results based on statistical tests. Different datasets were created to test the hypotheses, containing 14 words used daily and recorded by different smartphones in the RGB color system. We achieved an accuracy of 96.38% on the test set and 81.36% on the validation set containing more challenging conditions, showing that 30 FPS is the best frame rate subsample to train the classifier, geometric transformations work better than intensity transformations, and artificial background creation is not effective to model generalization. These trade-offs should be considered in future work as a cost-benefit guideline between computational cost and accuracy gain when creating a dataset and training a sign recognition model.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] RECOGNITION OF SIGN LANGUAGE GESTURES USING DEEP LEARNING
    Manoj, R.
    Karthick, R. E.
    Priyadharshini, Indira R.
    Renuka, G.
    Monica
    INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (05) : 508 - 516
  • [32] Deep Learning Methods for Indian Sign Language Recognition
    Likhar, Pratik
    Bhagat, Neel Kamal
    Rathna, G. N.
    2020 IEEE 10TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE-BERLIN), 2020,
  • [33] Isolated Sign Language Recognition Using Deep Learning
    Das, Sukanya
    Yadav, Sumit Kumar
    Samanta, Debasis
    COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT I, 2024, 2009 : 343 - 356
  • [34] Fall Behavior Recognition Based on Deep Learning and Image Processing
    Xu, He
    Shen, Leixian
    Zhang, Qingyun
    Cao, Guoxu
    INTERNATIONAL JOURNAL OF MOBILE COMPUTING AND MULTIMEDIA COMMUNICATIONS, 2018, 9 (04) : 1 - 15
  • [35] Image based Arabic Sign Language recognition
    Mohandes, M
    Deriche, M
    ISSPA 2005: The 8th International Symposium on Signal Processing and its Applications, Vols 1 and 2, Proceedings, 2005, : 86 - 89
  • [36] A computer vision-based system for recognition and classification of Urdu sign language dataset
    Zahid H.
    Rashid M.
    Syed S.A.
    Ullah R.
    Asif M.
    Khan M.
    Mujeeb A.A.
    Khan A.H.
    PeerJ Computer Science, 2022, 8
  • [37] Image based recognition of Pakistan sign language
    Raees, Muhammad
    Ullah, Sehat
    Rahman, Sami Ur
    Rabbi, Ihsan
    JOURNAL OF ENGINEERING RESEARCH, 2016, 4 (01): : 22 - 41
  • [38] A computer vision-based system for recognition and classification of Urdu sign language dataset
    Zahid, Hira
    Rashid, Munaf
    Syed, Sidra Abid
    Ullah, Rafi
    Asif, Muhammad
    Khan, Muzammil
    Mujeeb, Amenah Abdul
    Khan, Ali Haider
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [39] Deep learning pathways for automatic sign language processing
    Toshpulatov, Mukhiddin
    Lee, Wookey
    Jun, Jaesung
    Lee, Suan
    PATTERN RECOGNITION, 2025, 164
  • [40] An Efficient Method for Sign Language Recognition from Image Using Convolutional Neural Network
    Kotarski, Sebastian
    Maleszka, Bernadetta
    MULTIMEDIA AND NETWORK INFORMATION SYSTEMS, 2019, 833 : 99 - 108