Machine Learning Based Action Recognition with Modular CNN

被引：0

作者：

Huang, Shi-Zong ^{[1
]}

Chiu, Ching-Te ^{[1
]}

Chang, Yu-Jen ^{[1
]}

机构：

[1] Natl Tsing Hua Univ, Dept Commun, Hsinchu, Taiwan

来源：

2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC | 2023年

关键词：

Action Recognition; Deep Convolutional Networks; Real-time Computing; Dynamic Sampling Learning;

D O I：

10.1109/APSIPAASC58517.2023.10317425

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

When building models for action recognition, 3D convolutional neural networks (CNNs) are commonly used. However, 3D CNNs also increase the model parameters significantly. We propose two methods, image segmentation and dynamic sampling learning to reduce network parameters and required memory access. Using image segmentation to keep the location of the action and remove the background of each image reduces the size of the feature map. Dynamic sampling learning allows the model to learn from low sampling rates without adding additional parameters, and to maintain performance while reducing the number of images. In order to implement the overall model in hardware for edge devices, we limit the kernel sizes of the 2D convolution layers and 3D convolution layers in the model to only 3x3 and 3x3x3 respectively. We perform experiments on HMDB51 [1] and UCF101 [2] datasets respectively with our proposed model. The accuracy of our proposed method achieve 7.2% and 5.9% reduction compared with DS-GRU2021 [3]. However, the number of parameters of our model is 30% fewer and execution speed x180 faster than DS-GRU2021 [3].

引用

页码：211 / 216

页数：6

共 50 条

[31] HybridNet: Integrating GCN and CNN for skeleton-based action recognition
Wenjie Yang
Jianlin Zhang
Jingju Cai
Zhiyong Xu
Applied Intelligence, 2023, 53 : 574 - 585
[32] Continuous Action Recognition Based on Hybrid CNN-LDCRF Model
Lei, Jun
Li, Guohui
Li, Shuohao
Tu, Dan
Guo, Qiang
2016 INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2016), 2016, : 63 - 69
[33] Gait Recognition Based on GF-CNN and Metric Learning
Wen, Junqin
JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2020, 16 (05): : 1105 - 1112
[34] Residual Learning Based CNN for Gesture Recognition in Robot Interaction
Han, Hua
JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2021, 17 (02): : 385 - 398
[35] CNN based Transfer Learning for Historical Chinese Character Recognition
Tang, Yejun
Peng, Liangrui
Xu, Qian
Wang, Yanwei
Furuhata, Akio
PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 25 - 29
[36] Similarity Learning for CNN-Based ASL Alphabet Recognition
Fierro Radilla, Atoany Nazareth
Perez Daniel, Karina Ruby
Benitez-Garcia, Gibran
Najera Garcia, Pedro
Valdez, Ramona Fuentes
NEW TRENDS IN INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES, 2021, 337 : 633 - 645
[37] Semi-CNN Architecture for Effective Spatio-Temporal Learning in Action Recognition
Leong, Mei Chee
Prasad, Dilip K.
Lee, Yong Tsui
Lin, Feng
APPLIED SCIENCES-BASEL, 2020, 10 (02):
[38] LEARNING GEOMETRIC FEATURES WITH DUAL - STREAM CNN FOR 3D ACTION RECOGNITION
Thien Huynh-The
Hua, Cam-Hao
Nguyen Anh Tu
Kim, Dong-Seong
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2353 - 2357
[39] Face recognition based on extreme learning machine
Zong, Weiwei
Huang, Guang-Bin
NEUROCOMPUTING, 2011, 74 (16) : 2541 - 2551
[40] Machine Learning based Face Recognition System
Srinivas, N.
Suryanarayana, Vadhri
Babu, B. Hari
INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (03) : 1532 - 1539

← 1 2 3 4 5 →