A Hybrid Network for Large-Scale Action Recognition from RGB and Depth Modalities

被引:21
|
作者
Wang, Huogen [1 ,2 ]
Song, Zhanjie [3 ]
Li, Wanqing [2 ]
Wang, Pichao [4 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Univ Wollongong, Adv Multimedia Res Lab, Wollongong, NSW 2522, Australia
[3] Tianjin Univ, Sch Math, Tianjin 300350, Peoples R China
[4] Alibaba Grp US Inc, Bellevue, WA 98004 USA
基金
中国国家自然科学基金;
关键词
action recognition; weighted rank pooling; weighted dynamic image; 3D convolutional LSTM network; canonical correlation analysis;
D O I
10.3390/s20113305
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The paper presents a novel hybrid network for large-scale action recognition from multiple modalities. The network is built upon the proposed weighted dynamic images. It effectively leverages the strengths of the emerging Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) based approaches to specifically address the challenges that occur in large-scale action recognition and are not fully dealt with by the state-of-the-art methods. Specifically, the proposed hybrid network consists of a CNN based component and an RNN based component. Features extracted by the two components are fused through canonical correlation analysis and then fed to a linear Support Vector Machine (SVM) for classification. The proposed network achieved state-of-the-art results on the ChaLearn LAP IsoGD, NTU RGB+D and Multi-modal & Multi-view & Interactive ((MI)-I-2) datasets and outperformed existing methods by a large margin (over 10 percentage points in some cases).
引用
收藏
页码:1 / 25
页数:25
相关论文
共 50 条
  • [31] A Large-scale Study of Spatiotemporal Representation Learning with a New Benchmark on Action Recognition
    Deng, Andong
    Yang, Taojiannan
    Chen, Chen
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 20462 - 20474
  • [32] Human action recognition with a large-scale brain-inspired photonic computer
    Antonik, Piotr
    Marsal, Nicolas
    Brunner, Daniel
    Rontani, Damien
    NATURE MACHINE INTELLIGENCE, 2019, 1 (11) : 530 - 537
  • [33] Efficient large-scale action recognition in videos using extreme learning machines
    Varol, Gul
    Salah, Albert Ali
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (21) : 8274 - 8282
  • [34] Human Action Recognition in Large-Scale Datasets Using Histogram of Spatiotemporal Gradients
    Reddy, Kishore K.
    Cuntoor, Naresh
    Perera, Amitha
    Hoogs, Anthony
    2012 IEEE NINTH INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL-BASED SURVEILLANCE (AVSS), 2012, : 106 - 111
  • [35] Fast Hybrid Network Reconfiguration for Large-Scale Lossless Interconnection Networks
    Tasoulas, Evangelos
    Gran, Ernst Gunnar
    Skeie, Tor
    Johnsen, Bjorn Dag
    15TH IEEE INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS (IEEE NCA 2016), 2016, : 101 - 108
  • [36] Analysis of Large-Scale Hybrid Peer-to-Peer Network Topology
    Xie, Chao
    Pan, Yi
    GLOBECOM 2006 - 2006 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, 2006,
  • [37] Train rescheduling for large-scale disruptions in a large-scale railway network
    Zhang, Chuntian
    Gao, Yuan
    Cacchiani, Valentina
    Yang, Lixing
    Gao, Ziyou
    TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2023, 174
  • [38] Learning Large-scale Subsurface Simulations with a Hybrid Graph Network Simulator
    Wu, Tailin
    Wang, Qinchen
    Zhang, Yinan
    Ying, Rex
    Cao, Kaidi
    Sosic, Rok
    Jalali, Ridwan
    Hamam, Hassan
    Maucec, Marko
    Leskovec, Jure
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 4184 - 4194
  • [39] Hybrid Communication Network Architectures for Monitoring Large-Scale Wind Turbine
    Ahmed, Mohamed A.
    Kim, Young-Chon
    JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2013, 8 (06) : 1626 - 1636
  • [40] Large-scale network visualization
    Abello, J
    Koutsofios, E
    Gansner, ER
    North, SC
    COMPUTER GRAPHICS-US, 1999, 33 (03): : 13 - 15