Skeleton-Guided Action Recognition with Multistream 3D Convolutional Neural Network for Elderly-Care Robot

被引:1
|
作者
Zhang, Dawei [1 ]
Zhang, Yanmin [2 ]
Zhou, Meng [1 ]
机构
[1] Zhengzhou Univ, Sch Comp & Artificial Intelligence, Zhengzhou 450001, Henan, Peoples R China
[2] Zhengzhou Univ, Sch Elect & Informat Engn, Zhengzhou 450001, Henan, Peoples R China
基金
中国国家自然科学基金;
关键词
action recognition; deep learning; service robots; 2-STREAM;
D O I
10.1002/aisy.202300326
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the arrival of a global aging society, elderly-care robots are becoming more and more attractive and can provide better caring services through action recognition. This article presents a skeleton-guided action recognition framework with multistream 3D convolutional neural network. Two parallel dual-stream lightweight networks are proposed to enhance the feature extraction ability of human action and meanwhile reduce computation. Two different modes of skeleton input video are constructed to improve the recognition accuracy by decision fusion. The backbone networks adopt Resnet-18, the feature fusion layer and sliding window mechanism are both designed, and two cross-entropy losses are used to supervise their training. A dataset (named elder care action recognition (EC-AR)) with different categories of action is built. The experimental results on HMDB-51 and EC-AR datasets both demonstrate that the proposed framework outperforms the existing methods. The developed method is also applied to a prototype of elderly-care robots, and the test results in home scenarios show that it still has high recognition accuracy and good real-time performance. This article presents a skeleton-guided action recognition framework with multistream 3D convolutional neural network for elderly-care robot. Two parallel dual-stream Light-SlowFast networks based on ResNet-18 are proposed to enhance the feature extraction ability of human action and meanwhile reduce computation. Two different modes of skeleton input video are constructed to improve the recognition accuracy by decision fusion.image & COPY; 2023 WILEY-VCH GmbH
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Automatic 3D Pollen Recognition Based on Convolutional Neural Network
    Wang, Zhuo
    Wang, Zixuan
    Wang, Likai
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [32] Temporal Residual Feature Learning for Efficient 3D Convolutional Neural Network on Action Recognition Task
    Wang, Haonan
    Mei, Yuchen
    Lin, Jun
    Wang, Zhongfeng
    2020 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2020, : 123 - 128
  • [33] Viewpoint guided multi-stream neural network for skeleton action recognition
    Yicheng He
    Zixi Liang
    Shaocong He
    Yonghua Wang
    Ming Yin
    Multimedia Tools and Applications, 2024, 83 : 6783 - 6802
  • [34] Viewpoint guided multi-stream neural network for skeleton action recognition
    He, Yicheng
    Liang, Zixi
    He, Shaocong
    Wang, Yonghua
    Yin, Ming
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 6783 - 6802
  • [35] Action Recognition Based on Features Fusion and 3D Convolutional Neural Networks
    Liu, Lulu
    Hu, Fangyu
    Zhou, Jiahui
    PROCEEDINGS OF 2016 9TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 1, 2016, : 178 - 181
  • [36] TIME-ASYMMETRIC 3D CONVOLUTIONAL NEURAL NETWORKS FOR ACTION RECOGNITION
    Wu, Chengjie
    Han, Jiayue
    Li, Xiaoqiang
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 21 - 25
  • [37] Basketball technique action recognition using 3D convolutional neural networks
    Wang, Jingfei
    Zuo, Liang
    Martinez, Carlos Cordente
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [38] An efficient attention module for 3d convolutional neural networks in action recognition
    Jiang, Guanghao
    Jiang, Xiaoyan
    Fang, Zhijun
    Chen, Shanshan
    APPLIED INTELLIGENCE, 2021, 51 (10) : 7043 - 7057
  • [39] SPATIOTEMPORAL PYRAMID POOLING IN 3D CONVOLUTIONAL NEURAL NETWORKS FOR ACTION RECOGNITION
    Cheng, Cheng
    Lv, Pin
    Su, Bing
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 3468 - 3472
  • [40] An efficient attention module for 3d convolutional neural networks in action recognition
    Guanghao Jiang
    Xiaoyan Jiang
    Zhijun Fang
    Shanshan Chen
    Applied Intelligence, 2021, 51 : 7043 - 7057