Action Status Based Novel Relative Feature Representations for Interaction Recognition

被引:10
|
作者
Li Yanshan [1 ,2 ]
Guo Tianyu [1 ,2 ]
Liu Xing [1 ,2 ]
Luo Wenhan [3 ]
Xie Weixin [1 ,2 ]
机构
[1] Shenzhen Univ, ATR Natl Key Lab Def Technol, Shenzhen 518000, Peoples R China
[2] Shenzhen Univ, Guangdong Key Lab Intelligent Informat Proc, Shenzhen 518000, Peoples R China
[3] Tencent, Shenzhen 518000, Peoples R China
基金
中国国家自然科学基金;
关键词
Human action analysis; Interaction recognition; Action status; Multi-stream network; Relative feature representations;
D O I
10.1049/cje.2020.00.088
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Skeleton-based action recognition has always been an important research topic in computer vision. Most of the researchers in this field currently pay more attention to actions performed by a single person while there is very little work dedicated to the identification of interactions between two people. However, the practical application of interaction recognition is actually more critical in our society considering that actions are often performed by multiple people. How to design an effective scheme to learn discriminative spatial and temporal representations for skeleton-based interaction recognition is still a challenging problem. Focusing on the characteristics of skeleton data for interactions, we first define the moving distance to distinguish the action status of the participants. Then some view-invariant relative features are proposed to fully represent the spatial and temporal relationship of the skeleton sequence. Further, a new coding method is proposed to obtain the novel relative feature representations. Finally, we design a three-stream CNN model to learn deep features for interaction recognition. We evaluate our method on SBU dataset, NTU RGB+D 60 dataset and NTU RGB+D 120 dataset. The experimental results also verify that our method is effective and exhibits great robustness compared with current state-of-the-art methods.
引用
收藏
页码:168 / 180
页数:13
相关论文
共 50 条
  • [31] STV-based video feature processing for action recognition
    Wang, Jing
    Xu, Zhijie
    SIGNAL PROCESSING, 2013, 93 (08) : 2151 - 2168
  • [32] Action Recognition of Temporal Segment Network Based on Feature Fusion
    Li H.
    Ding Y.
    Li C.
    Zhang S.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (01): : 145 - 158
  • [33] Temporal Feature Weighting for Prototype-Based Action Recognition
    Mauthner, Thomas
    Roth, Peter M.
    Bischof, Horst
    COMPUTER VISION - ACCV 2010, PT II, 2011, 6493 : 566 - 579
  • [34] Human action recognition based on AdaBoost algorithm for feature extraction
    Ji, Xiaofei
    Zhou, Lu
    Li, Yibo
    2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (CIT), 2014, : 801 - 805
  • [35] Temporal Segment Networks Based on Feature Propagation for Action Recognition
    Shi Y.
    Zeng Z.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2020, 32 (04): : 582 - 589
  • [36] Video Based Action Recognition using Spatial and Temporal Feature
    Dai, Cheng
    Liu, Xingang
    Zhong, Luhao
    Yu, Tao
    IEEE 2018 INTERNATIONAL CONGRESS ON CYBERMATICS / 2018 IEEE CONFERENCES ON INTERNET OF THINGS, GREEN COMPUTING AND COMMUNICATIONS, CYBER, PHYSICAL AND SOCIAL COMPUTING, SMART DATA, BLOCKCHAIN, COMPUTER AND INFORMATION TECHNOLOGY, 2018, : 635 - 638
  • [37] Research on Human Action Recognition Algorithm Based on Sine Feature
    Zhang, Haiyun
    Dou, Huazhou
    Li, Bo
    2020 4TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2020), 2020, 1518
  • [38] Human Action Recognition Based on Motion Feature and Manifold Learning
    Wang, Jun
    Xia, Limin
    Ma, Wentao
    IEEE ACCESS, 2021, 9 : 89287 - 89299
  • [39] Parallel Absolute-Relative Feature Based Phonotactic Language Recognition
    Liu, Weiwei
    Zhang, Wei-Qiang
    Li, Zhiyi
    Liu, Jia
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 59 - 63
  • [40] Learning feature representations for an object recognition system
    Welke, Kai
    Oztop, Erhan
    Ude, Ales
    Dillmann, Ruediger
    Cheng, Gordon
    2006 6TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS, VOLS 1 AND 2, 2006, : 290 - +