Deep neural learning techniques with long short-term memory for gesture recognition

被引:1
|
作者
Deepak Kumar Jain
Aniket Mahanti
Pourya Shamsolmoali
Ramachandran Manikandan
机构
[1] Chongqing University of Posts and Telecommunications,Key Laboratory of Intelligent Air
[2] University of Auckland,Ground Cooperative Control for Universities in Chongqing, College of Automation
[3] Shanghai Jiao Tong University,Institute of Image Processing and Pattern Recognition
[4] SASTRA Deemed University,School of Computing
来源
关键词
Gesture recognition; Shift invariant; Convolutional deep structured; Neural learning; Long short-term memory; Bivariate fully recurrent deep neural network;
D O I
暂无
中图分类号
学科分类号
摘要
Gesture recognition is a kind of biometric which has assumed great significance in the field of computer vision for communicating information through human activities. To recognize the various gestures and achieve efficient classification, an efficient computational machine learning technique is required. The Shift Invariant Convolutional Deep Structured Neural Learning with Long Short-Term Memory (SICDSNL–LSTM) and Bivariate Fully Recurrent Deep Neural Network with Long Short-Term Memory (BFRDNN–LSTM) have been introduced for classifying human activities with efficient accuracy and minimal time complexity. The SICDSNL–LSTM technique collects gesture data (a kind of biometric) from the dataset and gives it to the input layers of Shift Invariant Convolutional Deep Structured Neural Learning. The SICDSNL–LSTM technique uses two hidden layers for performing regression and classification. In the regression process, dice similarity is used for measuring the relationship between data and output classes. In the second process, the input data are classified into dissimilar classes for each time step using LSTM unit with soft-step activation function. The soft-step activation function uses ‘forget gate’ for removing the less significant data from the cell state. This is also used to make a decision to display the output at a particular time step and to remove other class results. Then, LSTM output is given to the output layers, and convolutional deep neural learning is performed to classify the gesture. Based on the classification results, human activity and gesture are recognized with high accuracy. The BFRDNN–LSTM technique also performs regression in the first hidden layers using bivariate correlation to find relationship between data and classes. The LSTM unit in BFRDNN–LSTM technique uses Gaussian activation function in the second hidden layers for categorizing incoming data into various classes at each time step. In this proposed BFRDNN–LSTM method, fully recurrent deep neural network utilizes gradient descent function to minimize the error rate at the output layers and to increase the accuracy of the gesture recognition. Both SICDSNL–LSTM and BFRDNN–LSTM techniques automatically learn the features and the data to minimize time complexity in gesture recognition. Experimental evaluation is carried out using factors such as gesture recognition accuracy, false-positive rate and time complexity with a number of data.
引用
收藏
页码:16073 / 16089
页数:16
相关论文
共 50 条
  • [31] Deep Learning for Price Movement Prediction Using Convolutional Neural Network and Long Short-Term Memory
    Yang, Can
    Zhai, Junjie
    Tao, Guihua
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [32] Recognition Method of Massage Techniques Based on Attention Mechanism and Convolutional Long Short-Term Memory Neural Network
    Zhu, Shengding
    Lei, Jingtao
    Chen, Dongdong
    SENSORS, 2022, 22 (15)
  • [33] Bangla Compound Character Recognition by Combining Deep Convolutional Neural Network with Bidirectional Long Short-Term Memory
    Hasan, Md Jahid
    Wahid, Md Ferdous
    Alom, Md Shahin
    2019 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL INFORMATION AND COMMUNICATION TECHNOLOGY (EICT), 2019,
  • [34] Multimodal Dimensional Affect Recognition Using Deep Bidirectional Long Short-Term Memory Recurrent Neural Networks
    Pei, Ercheng
    Yang, Le
    Jiang, Dongmei
    Sahli, Hichem
    2015 INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2015, : 208 - 214
  • [35] Toward Transportation Mode Recognition Using Deep Convolutional and Long Short-Term Memory Recurrent Neural Networks
    Qin, Yanjun
    Luo, Haiyong
    Zhao, Fang
    Wang, Chenxing
    Wang, Jiaqi
    Zhang, Yuexia
    IEEE ACCESS, 2019, 7 : 142353 - 142367
  • [36] CONVOLUTIONAL, LONG SHORT-TERM MEMORY, FULLY CONNECTED DEEP NEURAL NETWORKS
    Sainath, Tara N.
    Vinyals, Oriol
    Senior, Andrew
    Sak, Hasim
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4580 - 4584
  • [37] CONSTRUCTING LONG SHORT-TERM MEMORY BASED DEEP RECURRENT NEURAL NETWORKS FOR LARGE VOCABULARY SPEECH RECOGNITION
    Li, Xianggang
    Wu, Xihong
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4520 - 4524
  • [38] Reinforcement learning with long short-term memory
    Bakker, B
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 1475 - 1482
  • [39] A Deep Neural Network Model for Short-Term Load Forecast Based on Long Short-Term Memory Network and Convolutional Neural Network
    Tian, Chujie
    Ma, Jian
    Zhang, Chunhong
    Zhan, Panpan
    ENERGIES, 2018, 11 (12)
  • [40] Application of Deep Learning Long Short-Term Memory in Energy Demand Forecasting
    Al Khafaf, Nameer
    Jalili, Mandi
    Sokolowski, Peter
    ENGINEERING APPLICATIONS OF NEURAL NETWORKSX, 2019, 1000 : 31 - 42