Semantic features and high-order physical features fusion for action recognition

被引:5
|
作者
Xia, Limin [1 ]
Ma, Wentao [1 ]
Feng, Lu [1 ]
机构
[1] Cent South Univ, Sch Automat, Changsha 410083, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recogntion; Attention mechanism; Semantic adaptation; Feature fusion; Two-stream network; EFFICIENT; NETWORK; JOINT;
D O I
10.1007/s10586-021-03346-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human action recognition (HAR) is one of the most challenging tasks in the field of computer vision due to complex backgrounds and ambiguity action, etc. To tackle these issues, we propose a novel action recognition framework called Semantic Feature and High-order Physical Feature Fusion (SF-HPFF). Concretely, we first calculate attention pooling module with a low-rank approximation to remove the information of irrelevant complex backgrounds and thus capture the interested target motion region. On this basis, motion features based on the physical characteristics of flow field and semantic features based on word embedding are developed to distinguish ambiguity behaviors. These features are of low dimension and high discrimination, which help to reduce computation burden significantly while maintaining an excellent recognition performance. Finally, cascaded convolutional fusion network is adopted to fuse features and accomplish classification. Multiple experiment results validate that the proposed SF-HPFF outperforms the state-of-art action recognition methods.
引用
收藏
页码:3515 / 3529
页数:15
相关论文
共 50 条
  • [41] High-order nonnegative blind source separation based on edge features
    Zhao, Mingzhan
    Zheng, Weipeng
    Lv, Yingli
    Du, Chunmei
    Wang, Zhiliang
    Xu, Xiaojun
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (08) : 4163 - 4170
  • [42] Image retrieval by spatial topology distances with high-order shape features
    Yang, HC
    DOCUMENT RECOGNITION AND RETRIEVAL IX, 2002, 4670 : 82 - 88
  • [43] Vortex beams with high-order cylindrical polarization: features of focal distributions
    Svetlana Nikolaevna Khonina
    Applied Physics B, 2019, 125
  • [44] Multiple depth-levels features fusion enhanced network for action recognition
    Wang, Shengquan
    Kong, Jun
    Jiang, Min
    Liu, Tianshan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 73
  • [45] Deep learning network model based on fusion of spatiotemporal features for action recognition
    Ge Yang
    Wu-xing Zou
    Multimedia Tools and Applications, 2022, 81 : 9875 - 9896
  • [46] Hybrid features for skeleton-based action recognition based on network fusion
    Chen, Zhangmeng
    Pan, Junjun
    Yang, Xiaosong
    Qin, Hong
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2020, 31 (4-5)
  • [47] A deep multimodal network based on bottleneck layer features fusion for action recognition
    Tej Singh
    Dinesh Kumar Vishwakarma
    Multimedia Tools and Applications, 2021, 80 : 33505 - 33525
  • [48] Fusion of Skeleton and RGB Features for RGB-D Human Action Recognition
    Weiyao, Xu
    Muqing, Wu
    Min, Zhao
    Ting, Xia
    IEEE SENSORS JOURNAL, 2021, 21 (17) : 19157 - 19164
  • [49] Multi-Layered Deep Learning Features Fusion for Human Action Recognition
    Kiran, Sadia
    Khan, Muhammad Attique
    Javed, Muhammad Younus
    Alhaisoni, Majed
    Tariq, Usman
    Nam, Yunyoung
    Damaševǐcius, Robertas
    Sharif, Muhammad
    Computers, Materials and Continua, 2021, 69 (03): : 4061 - 4075
  • [50] Semantic segmentation based on fusion of features and classifiers
    Xue, Yanbing
    Geng, Huiqiang
    Zhang, Hua
    Xue, Zhenshan
    Xu, Guangping
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (17) : 22199 - 22211