Cascading Pose Features with CNN-LSTM for Multiview Human Action Recognition

被引:14
|
作者
Malik, Najeeb ur Rehman [1 ]
Abu-Bakar, Syed Abdul Rahman [1 ]
Sheikh, Usman Ullah [1 ]
Channa, Asma [2 ]
Popescu, Nirvana [2 ]
机构
[1] Univ Teknol Malaysia, Comp Vis Video & Image Proc Lab, ECE Dept, Johor Baharu 81310, Malaysia
[2] Univ Politehn Bucuresti, Comp Sci Dept, Bucharest 060042, Romania
来源
SIGNALS | 2023年 / 4卷 / 01期
基金
欧盟地平线“2020”;
关键词
human action recognition (HAR); deep learning; CNN-LSTM; REPRESENTATION;
D O I
10.3390/signals4010002
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Human Action Recognition (HAR) is a branch of computer vision that deals with the identification of human actions at various levels including low level, action level, and interaction level. Previously, a number of HAR algorithms have been proposed based on handcrafted methods for action recognition. However, the handcrafted techniques are inefficient in case of recognizing interaction level actions as they involve complex scenarios. Meanwhile, the traditional deep learning-based approaches take the entire image as an input and later extract volumes of features, which greatly increase the complexity of the systems; hence, resulting in significantly higher computational time and utilization of resources. Therefore, this research focuses on the development of an efficient multi-view interaction level action recognition system using 2D skeleton data with higher accuracy while reducing the computation complexity based on deep learning architecture. The proposed system extracts 2D skeleton data from the dataset using the OpenPose technique. Later, the extracted 2D skeleton features are given as an input directly to the Convolutional Neural Networks and Long Short-Term Memory (CNN-LSTM) architecture for action recognition. To reduce the complexity, instead of passing the whole image, only extracted features are given to the CNN-LSTM architecture, thus eliminating the need for feature extraction. The proposed method was compared with other existing methods, and the outcomes confirm the potential of the proposed technique. The proposed OpenPose-CNNLSTM achieved an accuracy of 94.4% for MCAD (Multi-camera action dataset) and 91.67% for IXMAS (INRIA Xmas Motion Acquisition Sequences). Our proposed method also significantly decreases the computational complexity by reducing the number of inputs features to 50.
引用
收藏
页码:40 / 55
页数:16
相关论文
共 50 条
  • [31] Myoelectric Human Computer Interaction Using CNN-LSTM Neural Network for Dynamic Hand Gestures Recognition
    Li, Qiyu
    Langari, Reza
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5947 - 5949
  • [32] Myoelectric human computer interaction using CNN-LSTM neural network for dynamic hand gesture recognition
    Li, Qiyu
    Langari, Reza
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (03) : 4207 - 4221
  • [33] Human Behavior Recognition Based on CNN-LSTM Hybrid and Multi-Sensing Feature Information Fusion
    Fan C.
    Journal of Combinatorial Mathematics and Combinatorial Computing, 2023, 118 : 143 - 154
  • [34] Robust Pose Features for Action Recognition
    Lee, Hyungtae
    Morariu, Vlad I.
    Davis, Larry S.
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2014, : 365 - 372
  • [35] Emotion Recognition from Facial Expression Using Hybrid CNN-LSTM Network
    Mohana, M.
    Subashini, P.
    Krishnaveni, M.
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (08)
  • [36] Dynamic Two Hand Gesture Recognition using CNN-LSTM based networks
    Sharma, Vaidehi
    Jaiswal, Mohita
    Sharma, Abhishek
    Saini, Sandeep
    Tomar, Raghuvir
    2021 IEEE INTERNATIONAL SYMPOSIUM ON SMART ELECTRONIC SYSTEMS (ISES 2021), 2021, : 224 - 229
  • [37] RECOGNITION OF ATRIAL FIBRILLATION BASED ON CNN-LSTM AND LAPLACIAN SUPPORT VECTOR MACHINE
    Wang, Ying
    Li, Yongjian
    Chen, Meng
    Huo, Rui
    Liu, Lei
    Liang, Yesong
    Wei, Shoushui
    JOURNAL OF MECHANICS IN MEDICINE AND BIOLOGY, 2024, 24 (04)
  • [38] Facial Expression Recognition in Videos An CNN-LSTM based Model for Video Classification
    Abdullah, Muhammad
    Ahmad, Mobeen
    Han, Dongil
    2020 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2020,
  • [39] MALICIOUS URL RECOGNITION AND DETECTION USING ATTENTION-BASED CNN-LSTM
    Peng, Yongfang
    Tian, Shengwei
    Yu, Long
    Lv, Yalong
    Wang, Ruijin
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2019, 13 (11) : 5580 - 5593
  • [40] A Hybrid CNN-LSTM Network for Hand Gesture Recognition with Surface EMG Signals
    Cai, Zehua
    Zhu, Yuesheng
    THIRTEENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2021), 2021, 11878