Human Action Recognition Using Key-Frame Attention-Based LSTM Networks

被引:1
|
作者
Yang, Changxuan [1 ]
Mei, Feng [1 ]
Zang, Tuo [1 ]
Tu, Jianfeng [1 ]
Jiang, Nan [1 ]
Liu, Lingfeng [1 ,2 ]
机构
[1] East China JiaoTong Univ, Sch Informat Engn, Nanchang 330013, Peoples R China
[2] Jiangxi Minxuan Intelligent Technol Co Ltd, Nanchang 330029, Peoples R China
关键词
action recognition; ARMA; attention mechanism; key-frame extraction; K-means; LSTM; REPRESENTATION;
D O I
10.3390/electronics12122622
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human action recognition is a classical problem in computer vision and machine learning, and the task of effectively and efficiently recognising human actions is a concern for researchers. In this paper, we propose a key-frame-based approach to human action recognition. First, we designed a key-frame attention-based LSTM network (KF-LSTM) using the attention mechanism, which can be combined with LSTM to effectively recognise human action sequences by assigning different weight scale values to give more attention to key frames. In addition, we designed a new key-frame extraction method by combining an automatic segmentation model based on the autoregressive moving average (ARMA) algorithm and the K-means clustering algorithm. This method effectively avoids the possibility of inter-frame confusion in the temporal sequence of key frames of different actions and ensures that the subsequent human action recognition task proceeds smoothly. The dataset used in the experiments was acquired with an IMU sensor-based motion capture device, and we separately extracted the motion features of each joint using a manual method and then performed collective inference.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] A Novel Attention-Based Key-Frame Determination Method
    Shih, Huang-Chia
    IEEE TRANSACTIONS ON BROADCASTING, 2013, 59 (03) : 556 - 562
  • [2] KEY-FRAME EXTRACTION AND KEY-FRAME RATE DETERMINATION USING HUMAN ATTENTION MODELING
    Shih, Huang-Chia
    2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,
  • [3] Temporal-scale Convolutional Networks for Human Action Recognition Based on Key-Frame Extraction
    Wei, Zhao-qiang
    Kong, Yong-qiang
    Wei, Zhen-gang
    Zhang, Xiao-long
    2018 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND NETWORK TECHNOLOGY (CCNT 2018), 2018, 291 : 484 - 489
  • [4] Human action recognition using two-stream attention based LSTM networks
    Dai, Cheng
    Liu, Xingang
    Lai, Jinfeng
    APPLIED SOFT COMPUTING, 2020, 86
  • [5] A new Approach to Speed up in Action Recognition Based on Key-frame Extraction
    Azouji, Neda
    Azimifar, Zohreh
    2013 8TH IRANIAN CONFERENCE ON MACHINE VISION & IMAGE PROCESSING (MVIP 2013), 2013, : 219 - 222
  • [6] Advancing human action recognition: A hybrid approach using attention-based LSTM and 3D CNN
    Saoudi, El Mehdi
    Jaafari, Jaafar
    Andaloussi, Said Jai
    SCIENTIFIC AFRICAN, 2023, 21
  • [7] Attention-based LSTM Network for Wearable Human Activity Recognition
    Sun, Bo
    Liu, Meiqin
    Zheng, Ronghao
    Zhang, Senlin
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 8677 - 8682
  • [8] Recognition of Ironic Sentences in Twitter using Attention-Based LSTM
    Martini, Andrianarisoa Tojo
    Farrukh, Makhmudov
    Ge, Hongwei
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (08) : 7 - 11
  • [9] Spatio-Temporal Attention-Based LSTM Networks for 3D Action Recognition and Detection
    Song, Sijie
    Lan, Cuiling
    Xing, Junliang
    Zeng, Wenjun
    Liu, Jiaying
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (07) : 3459 - 3471
  • [10] Key-Frame based Fvent Recognition in Unconstrained Videos using Temporal Features
    Jana, Prithwish
    Bhaumik, Swarnabja
    Mohanta, Partha Pratim
    PROCEEDINGS OF 2019 IEEE REGION 10 SYMPOSIUM (TENSYMP), 2019, : 349 - 354