Machine Learning Based Action Recognition with Modular CNN

被引:0
|
作者
Huang, Shi-Zong [1 ]
Chiu, Ching-Te [1 ]
Chang, Yu-Jen [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Commun, Hsinchu, Taiwan
关键词
Action Recognition; Deep Convolutional Networks; Real-time Computing; Dynamic Sampling Learning;
D O I
10.1109/APSIPAASC58517.2023.10317425
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When building models for action recognition, 3D convolutional neural networks (CNNs) are commonly used. However, 3D CNNs also increase the model parameters significantly. We propose two methods, image segmentation and dynamic sampling learning to reduce network parameters and required memory access. Using image segmentation to keep the location of the action and remove the background of each image reduces the size of the feature map. Dynamic sampling learning allows the model to learn from low sampling rates without adding additional parameters, and to maintain performance while reducing the number of images. In order to implement the overall model in hardware for edge devices, we limit the kernel sizes of the 2D convolution layers and 3D convolution layers in the model to only 3x3 and 3x3x3 respectively. We perform experiments on HMDB51 [1] and UCF101 [2] datasets respectively with our proposed model. The accuracy of our proposed method achieve 7.2% and 5.9% reduction compared with DS-GRU2021 [3]. However, the number of parameters of our model is 30% fewer and execution speed x180 faster than DS-GRU2021 [3].
引用
收藏
页码:211 / 216
页数:6
相关论文
共 50 条
  • [1] Dynamic action recognition based on dynemes and Extreme Learning Machine
    Iosifidis, Alexandros
    Tefas, Anastasios
    Pitas, Ioannis
    PATTERN RECOGNITION LETTERS, 2013, 34 (15) : 1890 - 1898
  • [2] Aircraft recognition using modular extreme learning machine
    Rong, Hai-Jun
    Jia, Ya-Xin
    Zhao, Guang-She
    NEUROCOMPUTING, 2014, 128 : 166 - 174
  • [3] Multisource learning for skeleton-based action recognition using deep LSTM and CNN
    Cui, Ran
    Zhu, Aichun
    Hua, Gang
    Yin, Hongsheng
    Liu, Haiqiang
    JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (04)
  • [4] Research on classification and recognition of badminton batting action based on machine learning
    Sun Liwei
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (05) : 6241 - 6252
  • [5] A Machine Learning Approach for Human Action Recognition
    Susmitha, A.
    Sunanda
    Mohanty, Mihir Narayan
    Hota, Sarbeswara
    AMBIENT INTELLIGENCE IN HEALTH CARE, ICAIHC 2022, 2023, 317 : 443 - 450
  • [6] P-CNN: Pose-based CNN Features for Action Recognition
    Cheron, Guilhem
    Laptev, Ivan
    Schmid, Cordelia
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 3218 - 3226
  • [7] Towards a Modular Machine Learning Architecture for Human Activity Recognition
    Schroth, Marc
    Birkenmaier, Dennis
    Stork, Wilhelm
    2024 IEEE SENSORS APPLICATIONS SYMPOSIUM, SAS 2024, 2024,
  • [8] Research on basketball players' action recognition based on interactive system and machine learning
    Li, Jin
    Gu, Deping
    Journal of Intelligent and Fuzzy Systems, 2021, 40 (02): : 2029 - 2039
  • [9] Human action recognition using extreme learning machine based on visual vocabularies
    Minhas, Rashid
    Baradarani, Aryaz
    Seifzadeh, Sepideh
    Wu, Q. M. Jonathan
    NEUROCOMPUTING, 2010, 73 (10-12) : 1906 - 1917
  • [10] Research on basketball players' action recognition based on interactive system and machine learning
    Li, Jin
    Gu, Deping
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (02) : 2029 - 2039