Residual deep gated recurrent unit-based attention framework for human activity recognition by exploiting dilated features

被引:1
|
作者
Pandey, Ajeet [1 ]
Kumar, Piyush [1 ]
机构
[1] Natl Inst Technol Patna, Comp Sci & Engn, Patna 800005, Bihar, India
来源
VISUAL COMPUTER | 2024年 / 40卷 / 12期
关键词
Dilated convolutional neural network; Gated recurrent unit; Attention mechanism; Action recognition; Residual mechanism; NETWORK; LSTM;
D O I
10.1007/s00371-024-03266-w
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Human activity recognition (HAR) in video streams becomes a thriving research area in computer vision and pattern recognition. Activity recognition in actual video is quite demanding due to a lack of data with respect to motion, way or style, and cluttered background. The current HAR approaches primarily apply pre-trained weights of various deep learning (DL) models for the apparent description of frames during the learning phase. It impacts the assessment of feature discrepancies, like the separation between both the temporal and visual cues. To address this issue, a residual deep gated recurrent unit (RD-GRU)-enabled attention framework with a dilated convolutional neural network (DiCNN) is introduced in this article. This approach particularly targets potential information in the input video frame to recognize the distinct activities in the videos. The DiCNN network is used to capture the crucial, unique features. In this network, the skip connection segment is employed with DiCNN to update the information that retains more knowledge than a shallow layer. Moreover, these features are fed into an attention module to capture the added high-level discriminative action associated with patterns and signs. The attention mechanism is followed by an RD-GRU to learn the long video sequences in order to enhance the performance. The performance metrics, namely accuracy, precision, recall, and f1-score, are used to evaluate the performance of the introduced model on four diverse benchmark datasets: UCF11, UCF Sports, JHMDB, and THUMOS. On these datasets it achieves an accuracy of 98.54%, 99.31%, 82.47%, and 95.23%, respectively. This illustrates the validity of the proposed work compared with state-of-the-art (SOTA) methods.
引用
收藏
页码:8693 / 8712
页数:20
相关论文
共 50 条
  • [21] A Deep Dilated Convolutional Self-attention Model for Multimodal Human Activity Recognition
    Wang, Shengzhi
    Xiao, Shuo
    Wang, Yu
    Jiang, Haifeng
    Zhang, Guopeng
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 791 - 797
  • [22] An Attention Minimal Gated Unit-Based Causality Analysis Framework for Root Cause Diagnosis of Faults in Nonstationary Industrial Processes
    Ma, Liang
    Peng, Yifei
    Peng, Kaixiang
    IEEE SENSORS JOURNAL, 2025, 25 (04) : 6952 - 6966
  • [23] Facial expression recognition based on bidirectional gated recurrent units within deep residual network
    Shen, Wenjuan
    Li, Xiaoling
    INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS, 2020, 13 (04) : 527 - 543
  • [24] ACBiGRU-DAO: Attention Convolutional Bidirectional Gated Recurrent Unit-based Dynamic Arithmetic Optimization for Air Quality Prediction
    Vinoth Panneerselvam
    Revathi Thiagarajan
    Environmental Science and Pollution Research, 2023, 30 : 86804 - 86820
  • [25] ACBiGRU-DAO: Attention Convolutional Bidirectional Gated Recurrent Unit-based Dynamic Arithmetic Optimization for Air Quality Prediction
    Panneerselvam, Vinoth
    Thiagarajan, Revathi
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2023, 30 (37) : 86804 - 86820
  • [26] Human action recognition using attention based LSTM network with dilated CNN features
    Muhammad, Khan
    Mustaqeem
    Ullah, Amin
    Imran, Ali Shariq
    Sajjad, Muhammad
    Kiran, Mustafa Servet
    Sannino, Giovanna
    de Albuquerque, Victor Hugo C.
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 125 : 820 - 830
  • [27] Self-attention transformer unit-based deep learning framework for skin lesions classification in smart healthcare
    Rezaee, Khosro
    Zadeh, Hossein Ghayoumi
    DISCOVER APPLIED SCIENCES, 2024, 6 (01)
  • [28] Attention-Based Residual BiLSTM Networks for Human Activity Recognition
    Zhang, Junjie
    Liu, Yuanhao
    Yuan, Hua
    IEEE ACCESS, 2023, 11 : 94173 - 94187
  • [29] END-TO-END LANGUAGE RECOGNITION USING ATTENTION BASED HIERARCHICAL GATED RECURRENT UNIT MODELS
    Padi, Bharat
    Mohan, Anand
    Ganapathy, Sriram
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5966 - 5970
  • [30] An effective framework of human abnormal behaviour recognition and tracking using multiscale dilated assisted residual attention network
    Vidya, Queen Mary
    Selvakumar, S.
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 247