MCDGait: multimodal co-learning distillation network with spatial-temporal graph reasoning for gait recognition in the wild

被引:0
|
作者
Xiong, Jianbo [1 ]
Zou, Shinan [1 ]
Tang, Jin [1 ]
Tjahjadi, Tardi [2 ]
机构
[1] Cent South Univ, Sch Automation, Changsha, Peoples R China
[2] Univ Warwick, Sch Engn, Coventry, England
来源
VISUAL COMPUTER | 2024年 / 40卷 / 10期
关键词
Biometrics; Human identification; Gait recognition; Multimodal co-learning distillation; Spatial-temporal graph reasoning;
D O I
10.1007/s00371-024-03426-y
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Gait recognition in the wild has attracted the attention of the academic community. However, existing unimodal algorithms cannot achieve the same performance on in-the-wild datasets as in-the-lab datasets because unimodal data have many limitations in-the-wild environments. Therefore, we propose a multimodal approach combining silhouettes and skeletons and formulate the multimodal gait recognition problem as a multimodal co-learning problem. In particular, we propose a multimodal co-learning distillation network (MCDGait) that integrates two sub-networks processing unimodal data into a single fusion network. Based on the semantic consistency of different modalities and the paradigm of deep mutual learning, the performance of the entire network is continuously improved via the bidirectional knowledge distillation between the sub-networks and fusion network. Inspired by the observation that specific body parts or joints exhibit unique motion characteristics and have linkage with other parts or joints during walking, we propose a spatial-temporal graph reasoning module (ST-GRM). This module represents the parts or joints as graph nodes and the motion linkages between them as edges. By utilizing dynamic graph generator, the module implicitly captures the dynamic changes of the human body. Based on the generated graphs, the independent spatial-temporal linkage feature of each part and the interactive spatial-temporal linkage feature are aggregated simultaneously. Extensive experiments conducted on two in-the-wild datasets demonstrate the state-of-the-art performance of the proposed method. The average rank-1 accuracy on datasets Gait3D and GREW is 50.90% and 58.06%, respectively. The source code can be obtained from https://github.com/BoyeXiong/MCDGait.
引用
收藏
页码:7221 / 7234
页数:14
相关论文
共 50 条
  • [41] Graph Attention Spatial-Temporal Network for Deep Learning Based Mobile Traffic Prediction
    He, Kaiwen
    Huang, Yufen
    Chen, Xu
    Zhou, Zhi
    Yu, Shuai
    2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
  • [42] MST-GAT: A multimodal spatial-temporal graph attention network for time series anomaly detection
    Ding, Chaoyue
    Sun, Shiliang
    Zhao, Jing
    INFORMATION FUSION, 2023, 89 : 527 - 536
  • [43] STA-GCN:Spatial Temporal Adaptive Graph Convolutional Network for Gait Emotion Recognition
    Chen, Chuang
    Sun, Xiao
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1385 - 1390
  • [44] Spatial-temporal attention with graph and general neural network-based sign language recognition
    Miah, Abu Saleh Musa
    Hasan, Md. Al Mehedi
    Okuyama, Yuichi
    Tomioka, Yoichi
    Shin, Jungpil
    PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (02)
  • [45] Emotion recognition using spatial-temporal EEG features through convolutional graph attention network
    Li, Zhongjie
    Zhang, Gaoyan
    Wang, Longbiao
    Wei, Jianguo
    Dang, Jianwu
    JOURNAL OF NEURAL ENGINEERING, 2023, 20 (01)
  • [46] Multi-granular spatial-temporal synchronous graph convolutional network for robust action recognition
    Li, Chang
    Huang, Qian
    Mao, Yingchi
    Li, Xing
    Wu, Jie
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 257
  • [47] Automatic Traffic Anomaly Detection on the Road Network with Spatial-Temporal Graph Neural Network Representation Learning
    Zhang, Hengyuan
    Zhao, Suyao
    Liu, Ruiheng
    Wang, Wenlong
    Hong, Yixin
    Hu, Runjiu
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [48] A novel spatial-temporal graph convolution network based on temporal embedding graph structure learning for multivariate time series prediction
    Lei, Tianyang
    Li, Jichao
    Yang, Kewei
    Gong, Chang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 141
  • [49] Spatial-temporal interaction learning based two-stream network for action recognition
    Liu, Tianyu
    Ma, Yujun
    Yang, Wenhan
    Ji, Wanting
    Wang, Ruili
    Jiang, Ping
    INFORMATION SCIENCES, 2022, 606 : 864 - 876
  • [50] Meta-learning based spatial-temporal graph attention network for traffic signal control
    Wang, Min
    Wu, Libing
    Li, Man
    Wu, Dan
    Shi, Xiaochuan
    Ma, Chao
    KNOWLEDGE-BASED SYSTEMS, 2022, 250