Multi-perspective and multi-modality joint representation and recognition model for 3D action recognition

被引:48
|
作者
Gao, Z. [1 ,2 ]
Zhang, H. [1 ,2 ]
Xu, G. P. [1 ,2 ]
Xue, Y. B. [1 ,2 ]
机构
[1] Tianjin Univ Technol, Key Lab Comp Vis & Syst, Minist Educ, Tianjin 300384, Peoples R China
[2] Tianjin Univ Technol, Tianjin Key Lab Intelligence Comp & Novel Softwar, Tianjin 300384, Peoples R China
基金
中国国家自然科学基金;
关键词
3D Action recognition; Difference motion history image; Multi-perspective projection; Multi-modality feature; PHOG; MMJRR; 3-D OBJECT RETRIEVAL;
D O I
10.1016/j.neucom.2014.06.085
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we proposed multi-perspective and multi-modality discriminated and joint representation and recognition model for 3D action recognition. Specifically, for depth and RGB image sequence, we construct a novel difference motion history image, and then propose multi-perspective projections to capture the target motion process, after that, pyramid histogram of orientated gradients is extracted for each projection to describe the target motion, finally, multi-perspective and multi-modality discriminated and joint representation and recognition, model is proposed to recognize human action. Large scale experimental results on challenging and public DHA 3D and MSR-Action3D action datasets show that the performances of our difference motion history image on two modalities are much better than traditional motion history image, at the same time, our description scheme is also very robust and efficient, what is more, our proposed multi-perspective and multi-modality discriminated and joint representation and recognition model further improves the performance, which outperforms the state-of-the-art methods, and whose best performances on MSR-Action3D and DHA datasets reach 90.5% and 98.2% respectively. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:554 / 564
页数:11
相关论文
共 50 条
  • [31] Advanced Multi-Perspective Enrolment in Finger Vein Recognition
    Prommegger, Bernhard
    Uhl, Andreas
    2020 8TH INTERNATIONAL WORKSHOP ON BIOMETRICS AND FORENSICS (IWBF 2020), 2020,
  • [32] Skeleton Sequence and RGB Frame Based Multi-Modality Feature Fusion Network for Action Recognition
    Zhu, Xiaoguang
    Zhu, Ye
    Wang, Haoyu
    Wen, Honglin
    Yan, Yan
    Liu, Peilin
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (03)
  • [33] Convolutional non-local spatial-temporal learning for multi-modality action recognition
    Ren, Ziliang
    Yuan, Huaqiang
    Wei, Wenhong
    Zhao, Tiezhu
    Zhang, Qieshi
    ELECTRONICS LETTERS, 2022, 58 (20) : 765 - 767
  • [34] MULTI-MODALITY ANALYSIS OF A 3D PRINTED BIOCOMPATIABLE POLYMER SCAFFOLD
    Sutherland, Nigel
    Shen, Yihong
    Li, Qin
    Zhang, Lihai
    Mo, Xiumei
    van Gaal, William Joseph, III
    Barlis, Peter
    Poon, Eric
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2022, 79 (09) : 2018 - 2018
  • [35] ON THE CAPABILITIES OF A MULTI-MODALITY 3D BIOPRINTER FOR CUSTOMIZED BIOMEDICAL DEVICES
    Ravi, Prashanth
    Shiakolas, Panos S.
    Welch, Tre
    Saini, Tushar
    Guleserian, Kristine
    Batra, Ankit K.
    PROCEEDINGS OF THE ASME INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, 2015, VOL 2A, 2016,
  • [36] Multi-modality 3D object detection in autonomous driving: A review
    Tang, Yingjuan
    He, Hongwen
    Wang, Yong
    Mao, Zan
    Wang, Haoyu
    NEUROCOMPUTING, 2023, 553
  • [37] A 3D Object Detection Based on Multi-Modality Sensors of USV
    Wu, Yingying
    Qin, Huacheng
    Liu, Tao
    Liu, Hao
    Wei, Zhiqiang
    APPLIED SCIENCES-BASEL, 2019, 9 (03):
  • [38] Learning Disentangled Representation for Multi-View 3D Object Recognition
    Huang, Jingjia
    Yan, Wei
    Li, Ge
    Li, Thomas
    Liu, Shan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (02) : 646 - 659
  • [39] OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment
    Cheng, Xize
    Jin, Tao
    Li, Linjun
    Lin, Wang
    Duan, Xinyu
    Zhao, Zhou
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 6592 - 6607
  • [40] Modality Mixer for Multi-modal Action Recognition
    Lee, Sumin
    Woo, Sangmin
    Park, Yeonju
    Nugroho, Muhammad Adi
    Kim, Changick
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3297 - 3306