A Hybrid Network for Large-Scale Action Recognition from RGB and Depth Modalities

被引:21
|
作者
Wang, Huogen [1 ,2 ]
Song, Zhanjie [3 ]
Li, Wanqing [2 ]
Wang, Pichao [4 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Univ Wollongong, Adv Multimedia Res Lab, Wollongong, NSW 2522, Australia
[3] Tianjin Univ, Sch Math, Tianjin 300350, Peoples R China
[4] Alibaba Grp US Inc, Bellevue, WA 98004 USA
基金
中国国家自然科学基金;
关键词
action recognition; weighted rank pooling; weighted dynamic image; 3D convolutional LSTM network; canonical correlation analysis;
D O I
10.3390/s20113305
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The paper presents a novel hybrid network for large-scale action recognition from multiple modalities. The network is built upon the proposed weighted dynamic images. It effectively leverages the strengths of the emerging Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) based approaches to specifically address the challenges that occur in large-scale action recognition and are not fully dealt with by the state-of-the-art methods. Specifically, the proposed hybrid network consists of a CNN based component and an RNN based component. Features extracted by the two components are fused through canonical correlation analysis and then fed to a linear Support Vector Machine (SVM) for classification. The proposed network achieved state-of-the-art results on the ChaLearn LAP IsoGD, NTU RGB+D and Multi-modal & Multi-view & Interactive ((MI)-I-2) datasets and outperformed existing methods by a large margin (over 10 percentage points in some cases).
引用
收藏
页码:1 / 25
页数:25
相关论文
共 50 条
  • [21] HYBRID ANALYSIS OF A LARGE-SCALE NETWORK BY NODE-TEARING
    TONG, MD
    CHEN, WK
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 1986, 321 (05): : 273 - 287
  • [22] Hybrid, large-scale wireless sensor network for missile defense
    Katopodis, Panagiotis
    Katsis, Grigorios
    Walker, Owens
    Tummala, Murati
    Michael, J. Bret
    2007 IEEE INTERNATIONAL CONFERENCE ON SYSTEM OF SYSTEMS ENGINEERING, VOLS 1 AND 2, 2007, : 516 - 520
  • [23] ChaLearn Looking at People: IsoGD and ConGD Large-Scale RGB-D Gesture Recognition
    Wan, Jun
    Lin, Chi
    Wen, Longyin
    Li, Yunan
    Miao, Qiguang
    Escalera, Sergio
    Anbarjafari, Gholamreza
    Guyon, Isabelle
    Guo, Guodong
    Li, Stan Z.
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (05) : 3422 - 3433
  • [24] Human Action Recognition using Meta Learning for RGB and Depth Information
    Amiri, S. Mohsen
    Pourazad, Mahsa T.
    Nasiopoulos, Panos
    Leung, Victor C. M.
    2014 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS (ICNC), 2014, : 363 - 367
  • [25] Training Convolutional Neural Network for Sketch Recognition on Large-Scale Dataset
    Zhou, Wen
    Jia, Jinyuan
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2020, 17 (01) : 82 - 89
  • [26] Toward Large-Scale Face Recognition Using Social Network Context
    Stone, Zak
    Zickler, Todd
    Darrell, Trevor
    PROCEEDINGS OF THE IEEE, 2010, 98 (08) : 1408 - 1415
  • [27] Integration of HIS, RIS, Modalities and a large-scale PACS
    Kotter, E
    Schrader, U
    Allmann, KH
    Einert, A
    Schneider, B
    Langer, M
    CAR '97 - COMPUTER ASSISTED RADIOLOGY AND SURGERY, 1997, 1134 : 538 - 543
  • [28] Large-scale Semantic Mapping and Reasoning with Heterogeneous Modalities
    Pronobis, Andrzej
    Jensfelt, Patric
    2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2012, : 3515 - 3522
  • [29] Fusion Based Deep CNN for Improved Large-Scale Image Action Recognition
    Lavinia, Yukhe
    Vo, Holly H.
    Verma, Abhishek
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2016, : 609 - 614
  • [30] Human action recognition with a large-scale brain-inspired photonic computer
    Piotr Antonik
    Nicolas Marsal
    Daniel Brunner
    Damien Rontani
    Nature Machine Intelligence, 2019, 1 : 530 - 537