Semantic features and high-order physical features fusion for action recognition

被引:5
|
作者
Xia, Limin [1 ]
Ma, Wentao [1 ]
Feng, Lu [1 ]
机构
[1] Cent South Univ, Sch Automat, Changsha 410083, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recogntion; Attention mechanism; Semantic adaptation; Feature fusion; Two-stream network; EFFICIENT; NETWORK; JOINT;
D O I
10.1007/s10586-021-03346-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human action recognition (HAR) is one of the most challenging tasks in the field of computer vision due to complex backgrounds and ambiguity action, etc. To tackle these issues, we propose a novel action recognition framework called Semantic Feature and High-order Physical Feature Fusion (SF-HPFF). Concretely, we first calculate attention pooling module with a low-rank approximation to remove the information of irrelevant complex backgrounds and thus capture the interested target motion region. On this basis, motion features based on the physical characteristics of flow field and semantic features based on word embedding are developed to distinguish ambiguity behaviors. These features are of low dimension and high discrimination, which help to reduce computation burden significantly while maintaining an excellent recognition performance. Finally, cascaded convolutional fusion network is adopted to fuse features and accomplish classification. Multiple experiment results validate that the proposed SF-HPFF outperforms the state-of-art action recognition methods.
引用
收藏
页码:3515 / 3529
页数:15
相关论文
共 50 条
  • [21] Higher Order Geometrical Image Features Representation for Action Recognition
    Sjarif, Nilam Nur Amir
    Shamsuddin, Siti Mariyam
    Hashim, Siti Zaiton Mohd
    Ralescu, Anca L.
    2013 INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2013, : 264 - 269
  • [22] Cortex-based mechanism for discovery of high-order features
    Kursun, O
    Favorov, O
    PROCEEDINGS OF THE 17TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2003, : 286 - 290
  • [23] Human action recognition using fusion of features for unconstrained video sequences
    Patel, Chirag I.
    Garg, Sanjay
    Zaveri, Tanish
    Banerjee, Asim
    Patel, Ripal
    COMPUTERS & ELECTRICAL ENGINEERING, 2018, 70 : 284 - 301
  • [24] DMMs-Based Multiple Features Fusion for Human Action Recognition
    Bulbul, Mohammad Farhad
    Jiang, Yunsheng
    Ma, Jinwen
    INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT, 2015, 6 (04): : 23 - 39
  • [25] Diverse Features Fusion Network for video-based action recognition
    Deng, Haoyang
    Kong, Jun
    Jiang, Min
    Liu, Tianshan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 77
  • [26] Application of semantic features in face recognition
    Zhou, Huiyu
    Yuan, Yuan
    Sadka, Abdul H.
    PATTERN RECOGNITION, 2008, 41 (10) : 3251 - 3256
  • [27] Action recognition with global features
    Mokhber, A
    Achard, C
    Qu, XT
    Milgram, M
    COMPUTER VISION IN HUMAN-COMPUTER INTERACTION, PROCEEDINGS, 2005, 3766 : 110 - 119
  • [28] Automatic Extraction of Semantic Action Features
    Tran Thang Thanh
    Chen, Fan
    Kotani, Kazunori
    Bac Le
    2013 INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS (SITIS), 2013, : 148 - 155
  • [29] Human Action Recognition in Video Sequence using Logistic Regression by Features Fusion Approach based on CNN Features
    Ahmad, Tariq
    Wu, Jinsong
    Khan, Imran
    Rahim, Asif
    Khan, Amjad
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (11) : 18 - 25
  • [30] HIGH-ORDER DIRECTIONAL FEATURES AND SPARSE REPRESENTATION BASED CLASSIFICATION FOR IN-AIR HANDWRITTEN CHINESE CHARACTER RECOGNITION
    Qu, Xiwen
    Wang, Weiqiang
    Lu, Ke
    Ji, Zhangjian
    2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2016,