Residual deep gated recurrent unit-based attention framework for human activity recognition by exploiting dilated features

被引:1
|
作者
Pandey, Ajeet [1 ]
Kumar, Piyush [1 ]
机构
[1] Natl Inst Technol Patna, Comp Sci & Engn, Patna 800005, Bihar, India
来源
VISUAL COMPUTER | 2024年 / 40卷 / 12期
关键词
Dilated convolutional neural network; Gated recurrent unit; Attention mechanism; Action recognition; Residual mechanism; NETWORK; LSTM;
D O I
10.1007/s00371-024-03266-w
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Human activity recognition (HAR) in video streams becomes a thriving research area in computer vision and pattern recognition. Activity recognition in actual video is quite demanding due to a lack of data with respect to motion, way or style, and cluttered background. The current HAR approaches primarily apply pre-trained weights of various deep learning (DL) models for the apparent description of frames during the learning phase. It impacts the assessment of feature discrepancies, like the separation between both the temporal and visual cues. To address this issue, a residual deep gated recurrent unit (RD-GRU)-enabled attention framework with a dilated convolutional neural network (DiCNN) is introduced in this article. This approach particularly targets potential information in the input video frame to recognize the distinct activities in the videos. The DiCNN network is used to capture the crucial, unique features. In this network, the skip connection segment is employed with DiCNN to update the information that retains more knowledge than a shallow layer. Moreover, these features are fed into an attention module to capture the added high-level discriminative action associated with patterns and signs. The attention mechanism is followed by an RD-GRU to learn the long video sequences in order to enhance the performance. The performance metrics, namely accuracy, precision, recall, and f1-score, are used to evaluate the performance of the introduced model on four diverse benchmark datasets: UCF11, UCF Sports, JHMDB, and THUMOS. On these datasets it achieves an accuracy of 98.54%, 99.31%, 82.47%, and 95.23%, respectively. This illustrates the validity of the proposed work compared with state-of-the-art (SOTA) methods.
引用
收藏
页码:8693 / 8712
页数:20
相关论文
共 50 条
  • [41] Rotating machinery fault classification based on one-dimensional residual network with attention mechanism and bidirectional gated recurrent unit
    Dong, Zhilin
    Zhao, Dezun
    Cui, Lingli
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (08)
  • [42] Deep Triplet Networks with Attention for Sensor-based Human Activity Recognition
    Khaertdinov, Bulat
    Ghaleb, Esam
    Asteriadis, Stylianos
    2021 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS (PERCOM), 2021,
  • [43] Radar Human Activity Recognition with an Attention-Based Deep Learning Network
    Huan, Sha
    Wu, Limei
    Zhang, Man
    Wang, Zhaoyue
    Yang, Chao
    SENSORS, 2023, 23 (06)
  • [44] Modified deep residual network architecture deployed on serverless framework of IoT platform based on human activity recognition application
    Keshavarzian, Alireza
    Sharifian, Saeed
    Seyedin, Sanaz
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 101 : 14 - 28
  • [45] A Novel Deep Multifeature Extraction Framework Based on Attention Mechanism Using Wearable Sensor Data for Human Activity Recognition
    Wang, Yang
    Xu, Hongji
    Liu, Yunxia
    Wang, Mengmeng
    Wang, Yuhao
    Yang, Yang
    Zhou, Shuang
    Zeng, Jiaqi
    Xu, Jie
    Li, Shijie
    Li, Jianjun
    IEEE SENSORS JOURNAL, 2023, 23 (07) : 7188 - 7198
  • [46] SKELETON-BASED VIEW INVARIANT DEEP FEATURES FOR HUMAN ACTIVITY RECOGNITION
    Dhiman, Chhavi
    Saxena, Manan
    Vishwakarma, Dinesh Kumar
    2019 IEEE FIFTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2019), 2019, : 225 - 230
  • [47] Hybrid data augmentation and deep attention-based dilated convolutional-recurrent neural networks for speech emotion recognition
    Pham, Nhat Truong
    Dang, Duc Ngoc Minh
    Nguyen, Ngoc Duy
    Nguyen, Thanh Thi
    Nguyen, Hai
    Manavalan, Balachandran
    Lim, Chee Peng
    Nguyen, Sy Dzung
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 230
  • [48] Significance of handcrafted features in human activity recognition with attention-based RNN models
    Abraham, Sonia
    James, Rekha K.
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (10) : 1151 - 1163
  • [50] A Seismic Sensor based Human Activity Recognition Framework using Deep Learning
    Choudhary, Priyankar
    Goel, Neeraj
    Saini, Mukesh
    2021 17TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2021), 2021,