Machine Learning Based Action Recognition with Modular CNN

被引:0
|
作者
Huang, Shi-Zong [1 ]
Chiu, Ching-Te [1 ]
Chang, Yu-Jen [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Commun, Hsinchu, Taiwan
关键词
Action Recognition; Deep Convolutional Networks; Real-time Computing; Dynamic Sampling Learning;
D O I
10.1109/APSIPAASC58517.2023.10317425
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When building models for action recognition, 3D convolutional neural networks (CNNs) are commonly used. However, 3D CNNs also increase the model parameters significantly. We propose two methods, image segmentation and dynamic sampling learning to reduce network parameters and required memory access. Using image segmentation to keep the location of the action and remove the background of each image reduces the size of the feature map. Dynamic sampling learning allows the model to learn from low sampling rates without adding additional parameters, and to maintain performance while reducing the number of images. In order to implement the overall model in hardware for edge devices, we limit the kernel sizes of the 2D convolution layers and 3D convolution layers in the model to only 3x3 and 3x3x3 respectively. We perform experiments on HMDB51 [1] and UCF101 [2] datasets respectively with our proposed model. The accuracy of our proposed method achieve 7.2% and 5.9% reduction compared with DS-GRU2021 [3]. However, the number of parameters of our model is 30% fewer and execution speed x180 faster than DS-GRU2021 [3].
引用
收藏
页码:211 / 216
页数:6
相关论文
共 50 条
  • [31] HybridNet: Integrating GCN and CNN for skeleton-based action recognition
    Wenjie Yang
    Jianlin Zhang
    Jingju Cai
    Zhiyong Xu
    Applied Intelligence, 2023, 53 : 574 - 585
  • [32] Continuous Action Recognition Based on Hybrid CNN-LDCRF Model
    Lei, Jun
    Li, Guohui
    Li, Shuohao
    Tu, Dan
    Guo, Qiang
    2016 INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2016), 2016, : 63 - 69
  • [33] Gait Recognition Based on GF-CNN and Metric Learning
    Wen, Junqin
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2020, 16 (05): : 1105 - 1112
  • [34] Residual Learning Based CNN for Gesture Recognition in Robot Interaction
    Han, Hua
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2021, 17 (02): : 385 - 398
  • [35] CNN based Transfer Learning for Historical Chinese Character Recognition
    Tang, Yejun
    Peng, Liangrui
    Xu, Qian
    Wang, Yanwei
    Furuhata, Akio
    PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 25 - 29
  • [36] Similarity Learning for CNN-Based ASL Alphabet Recognition
    Fierro Radilla, Atoany Nazareth
    Perez Daniel, Karina Ruby
    Benitez-Garcia, Gibran
    Najera Garcia, Pedro
    Valdez, Ramona Fuentes
    NEW TRENDS IN INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES, 2021, 337 : 633 - 645
  • [37] Semi-CNN Architecture for Effective Spatio-Temporal Learning in Action Recognition
    Leong, Mei Chee
    Prasad, Dilip K.
    Lee, Yong Tsui
    Lin, Feng
    APPLIED SCIENCES-BASEL, 2020, 10 (02):
  • [38] LEARNING GEOMETRIC FEATURES WITH DUAL - STREAM CNN FOR 3D ACTION RECOGNITION
    Thien Huynh-The
    Hua, Cam-Hao
    Nguyen Anh Tu
    Kim, Dong-Seong
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2353 - 2357
  • [39] Face recognition based on extreme learning machine
    Zong, Weiwei
    Huang, Guang-Bin
    NEUROCOMPUTING, 2011, 74 (16) : 2541 - 2551
  • [40] Machine Learning based Face Recognition System
    Srinivas, N.
    Suryanarayana, Vadhri
    Babu, B. Hari
    INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (03) : 1532 - 1539