Multi-perspective and multi-modality joint representation and recognition model for 3D action recognition

被引:48
|
作者
Gao, Z. [1 ,2 ]
Zhang, H. [1 ,2 ]
Xu, G. P. [1 ,2 ]
Xue, Y. B. [1 ,2 ]
机构
[1] Tianjin Univ Technol, Key Lab Comp Vis & Syst, Minist Educ, Tianjin 300384, Peoples R China
[2] Tianjin Univ Technol, Tianjin Key Lab Intelligence Comp & Novel Softwar, Tianjin 300384, Peoples R China
基金
中国国家自然科学基金;
关键词
3D Action recognition; Difference motion history image; Multi-perspective projection; Multi-modality feature; PHOG; MMJRR; 3-D OBJECT RETRIEVAL;
D O I
10.1016/j.neucom.2014.06.085
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we proposed multi-perspective and multi-modality discriminated and joint representation and recognition model for 3D action recognition. Specifically, for depth and RGB image sequence, we construct a novel difference motion history image, and then propose multi-perspective projections to capture the target motion process, after that, pyramid histogram of orientated gradients is extracted for each projection to describe the target motion, finally, multi-perspective and multi-modality discriminated and joint representation and recognition, model is proposed to recognize human action. Large scale experimental results on challenging and public DHA 3D and MSR-Action3D action datasets show that the performances of our difference motion history image on two modalities are much better than traditional motion history image, at the same time, our description scheme is also very robust and efficient, what is more, our proposed multi-perspective and multi-modality discriminated and joint representation and recognition model further improves the performance, which outperforms the state-of-the-art methods, and whose best performances on MSR-Action3D and DHA datasets reach 90.5% and 98.2% respectively. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:554 / 564
页数:11
相关论文
共 50 条
  • [41] HarMI: Human Activity Recognition Via Multi-Modality Incremental Learning
    Zhang, Xiao
    Yu, Hongzheng
    Yang, Yang
    Gu, Jingjing
    Li, Yujun
    Zhuang, Fuzhen
    Yu, Dongxiao
    Ren, Zhaochun
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (03) : 939 - 951
  • [42] Multi-modality Network with Visual and Geometrical Information for Micro Emotion Recognition
    Guo, Jianzhu
    Zhou, Shuai
    Wu, Jinlin
    Wan, Jun
    Zhu, Xiangyu
    Lei, Zhen
    Li, Stan Z.
    2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 814 - 819
  • [43] MULTI-MODALITY RECOGNITION OF HUMAN FACE AND EAR BASED ON DEEP LEARNING
    Fan, Ting-Yu
    Mu, Zhi-Chun
    Yang, Ru-Yin
    2017 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION (ICWAPR), 2017, : 38 - 42
  • [44] Pedestrian recognition by using a kernel-based multi-modality approach
    Sirbu, Adela-Maria
    Rogozan, Alexandrina
    Diosan, Laura
    Bensrhair, Abdelaziz
    16TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2014), 2014, : 258 - 263
  • [45] Multi-Modality Mobile Image Recognition Based on Thermal and Visual Cameras
    Lai, Jui-Hsin
    Lin, Chung-Ching
    Chen, Chun-Fu
    Lin, Ching-Yung
    2015 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2015, : 477 - 482
  • [46] FREQUENCYGRAMS AND MULTI- FEATURE JOINT SPARSE REPRESENTATION FOR ACTION AND GESTURE RECOGNITION
    Sandhan, Tushar
    Choi, Jin Young
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 1450 - 1454
  • [47] MMJN: Multi-Modal Joint Networks for 3D Shape Recognition
    Nie, Weizhi
    Liang, Qi
    Liu, An-An
    Mao, Zhendong
    Li, Yangyang
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 908 - 916
  • [48] Hidden Markov models for multi-perspective radar target recognition
    Cui, Jingjing
    Gudnason, Jon
    Brookes, Mike
    2008 IEEE RADAR CONFERENCE, VOLS. 1-4, 2008, : 1937 - 1941
  • [49] A Novel Two-Stream Transformer-Based Framework for Multi-Modality Human Action Recognition
    Shi, Jing
    Zhang, Yuanyuan
    Wang, Weihang
    Xing, Bin
    Hu, Dasha
    Chen, Liangyin
    APPLIED SCIENCES-BASEL, 2023, 13 (04):
  • [50] Integral vision: a multi-perspective approach to the recognition of graduate attributes
    Haigh, Martin
    Clifford, Valerie A.
    HIGHER EDUCATION RESEARCH & DEVELOPMENT, 2011, 30 (05) : 573 - 584