Recognizing micro actions in videos by learning multi-layer local features

被引:1
|
作者
Mi, Yang [1 ]
Liu, Zhihao [2 ]
Zhao, Kai [3 ]
Wang, Song [4 ]
机构
[1] China Agr Univ, Coll Informat & Elect Engn, Dept Data Sci & Engn, Beijing 100083, Peoples R China
[2] Beijing Jiaotong Univ, Beijing Key Lab Traff Data Anal & Min, Beijing 100044, Peoples R China
[3] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Jiangsu, Peoples R China
[4] Univ South Carolina, Dept Comp Sci & Engn, Columbia, SC 29201 USA
关键词
Action recognition; Micro action; Lower-level layers; Local features; ACTION RECOGNITION; CNN;
D O I
10.1016/j.patrec.2022.04.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recognizing micro actions such as slight head shaking or hand clapping from videos can be challenging since they only involve small movements of local body parts. In this paper, we propose to fuse features from both higher-level and lower-level layers of convolutional neural networks for improving the accuracy of micro-action recognition. Deep features in higher-level layers have been shown to be effective in rec-ognizing general actions, such as biking and jumping, that involve relatively large movements. Different from features in higher-level layers, features in lower-level layers are usually of higher resolution and can help capture small motions in micro actions. In this paper, we employ class-discriminative information as a guidance in lower-level layers to learn local features that are highly associated with micro-action regions. In the experiments, we evaluate the proposed method on two micro-action video datasets and achieve new state-of-the-art performance. We also test the proposed method on two general-action video datasets with promising performance.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:55 / 62
页数:8
相关论文
共 50 条
  • [1] Recognizing the staircase of multi-layer dwelling
    Chen, Yeming
    Xinjianzhu/New Architecture, 1998, (60): : 31 - 32
  • [2] Leveraging Textural Features for Recognizing Actions in Low Quality Videos
    Rahman, Saimunur
    See, John
    Ho, Chiung Ching
    9TH INTERNATIONAL CONFERENCE ON ROBOTIC, VISION, SIGNAL PROCESSING AND POWER APPLICATIONS: EMPOWERING RESEARCH AND INNOVATION, 2017, 398 : 237 - 245
  • [3] Learning specific and conserved features of multi-layer networks
    Wu, Wenming
    Yang, Tao
    Ma, Xiaoke
    Zhang, Wensheng
    Li, He
    Huang, Jianbin
    Li, Yanni
    Cui, Jiangtao
    INFORMATION SCIENCES, 2023, 622 : 930 - 945
  • [4] A local supervised learning algorithm for multi-layer perceptrons
    Vlachos, DS
    ICNAAM 2004: INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2004, 2004, : 452 - 454
  • [5] Recognizing Micro-Actions and Reactions from Paired Egocentric Videos
    Yonetani, Ryo
    Kitani, Kris M.
    Sato, Yoichi
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2629 - 2638
  • [6] RECOGNIZING MICRO ACTIONS IN VIDEOS: LEARNING MOTION DETAILS VIA SEGMENT-LEVEL TEMPORAL PYRAMID
    Mi, Yang
    Wang, Song
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1036 - 1041
  • [7] Local learning for multi-layer, multi-component predictive system
    Al-Jubouri, Bassma
    Gabrys, Bogdan
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS: PROCEEDINGS OF THE 20TH INTERNATIONAL CONFERENCE KES-2016, 2016, 96 : 723 - 732
  • [8] Research on recognizing flotation states based on image texture features and multi-layer SVMs
    Wang, Jie-Sheng
    Gao, Xian-Wen
    Zhang, Yong
    Kongzhi yu Juece/Control and Decision, 2010, 25 (10): : 1523 - 1526
  • [9] Local Invariance Representation Learning Algorithm with Multi-layer Extreme Learning Machine
    Jia, Xibin
    Li, Xiaobo
    Du, Hua
    Bhanu, Bir
    NEURAL INFORMATION PROCESSING, ICONIP 2016, PT IV, 2016, 9950 : 505 - 513
  • [10] Learning deep representation and discriminative features for clustering of multi-layer networks
    Wu, Wenming
    Ma, Xiaoke
    Wang, Quan
    Gong, Maoguo
    Gao, Quanxue
    NEURAL NETWORKS, 2024, 170 : 405 - 416