Ensemble CNN-ViT Using Feature-Level Fusion for Gait Recognition

被引:0
|
作者
Mogan, Jashila Nair [1 ]
Lee, Chin Poo [1 ]
Lim, Kian Ming [2 ]
机构
[1] Multimedia Univ, Fac Informat Sci & Technol, Melaka 75450, Malaysia
[2] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo 315100, Zhejiang, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Computational modeling; Hidden Markov models; Convolutional neural networks; Transformers; Deep learning; Biological system modeling; ensemble; fusion; feature-fusion; gait; gait recognition; IMAGE; MODEL;
D O I
10.1109/ACCESS.2024.3439602
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Individual deep learning models showcase impressive performance; however, the capacity of a single model might fall short in capturing the full spectrum of intricate patterns present in the input data. Thus, relying solely on a single model may hamper the attainment of optimal results and broader generalization. In light of this, the paper presents an ensemble method that leverages the strengths of multiple Convolutional Neural Networks (CNNs) and Transformer models to elevate gait recognition performance. Additionally, a novel gait representation named windowed Gait Energy Image (GEI) is introduced, obtained by averaging gait frames irrespective of gait cycles. Firstly, the windowed GEI is input to the Convolutional Neural Networks and Transformer models to learn significant gait features. Each model is followed by a Multilayer Perceptron (MLP) to encode the relationship between the extracted features and corresponding class labels. Subsequently, the extracted gait features from each model are flattened and concatenated into a cohesive feature representation before passing through another MLP for subject classification. The performance of the proposed method was assessed on three datasets: OU-ISIR dataset D, CASIA-B, and OU-LP dataset. Experimental results demonstrated remarkable improvements compared to existing methods across all three datasets. The proposed method achieved accuracy rates of 100% on OU-ISIR D, 99.93% on CASIA-B, and 99.94% on OU-LP, showcasing the superior performance of the Ensemble CNN-ViT model using feature-level fusion compared to state-of-the-art methods.
引用
收藏
页码:108573 / 108583
页数:11
相关论文
共 50 条
  • [41] Multimodal feature fusion for CNN-based gait recognition: an empirical comparison
    Castro, Francisco M.
    Marin-Jimenez, Manuel J.
    Guil, Nicolas
    de la Blanca, Nicolas
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (17): : 14173 - 14193
  • [42] Handmetric Verification Based on Feature-Level Fusion
    Li, Qiang
    Qiu, Zhengding
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2006, 6 (2A): : 164 - 168
  • [43] A feature-level fusion based improved multimodal biometric recognition system using ear and profile face
    Partha Pratim Sarangi
    Deepak Ranjan Nayak
    Madhumita Panda
    Banshidhar Majhi
    Journal of Ambient Intelligence and Humanized Computing, 2022, 13 : 1867 - 1898
  • [44] Combining feature-level and decision-level fusion in a hierarchical classifier for emotion recognition in the wild
    Sun, Bo
    Li, Liandong
    Wu, Xuewen
    Zuo, Tian
    Chen, Ying
    Zhou, Guoyan
    He, Jun
    Zhu, Xiaoming
    JOURNAL ON MULTIMODAL USER INTERFACES, 2016, 10 (02) : 125 - 137
  • [45] A feature-level fusion based improved multimodal biometric recognition system using ear and profile face
    Sarangi, Partha Pratim
    Nayak, Deepak Ranjan
    Panda, Madhumita
    Majhi, Banshidhar
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2022, 13 (04) : 1867 - 1898
  • [46] Bimodal system for emotion recognition from facial expressions and physiological signals using feature-level fusion
    Abdat, F.
    Maaoui, C.
    Pruski, A.
    UKSIM FIFTH EUROPEAN MODELLING SYMPOSIUM ON COMPUTER MODELLING AND SIMULATION (EMS 2011), 2011, : 24 - 29
  • [47] When CNN meet with ViT: decision-level feature fusion for camouflaged object detection
    Yue, Guowen
    Jiao, Ge
    Li, Chen
    Xiang, Jiahao
    VISUAL COMPUTER, 2024, : 3957 - 3972
  • [48] Feature-level fusion for effective palmprint authentication
    Kong, AWK
    Zhang, D
    BIOMETRIC AUTHENTICATION, PROCEEDINGS, 2004, 3072 : 761 - 767
  • [49] Combining feature-level and decision-level fusion in a hierarchical classifier for emotion recognition in the wild
    Bo Sun
    Liandong Li
    Xuewen Wu
    Tian Zuo
    Ying Chen
    Guoyan Zhou
    Jun He
    Xiaoming Zhu
    Journal on Multimodal User Interfaces, 2016, 10 : 125 - 137
  • [50] A Biometric System with Hierarchical Feature-level Fusion
    Soviany, Sorin
    Sandulescu, Virginia
    Puscoci, Sorin
    Soviany, Cristina
    PROCEEDINGS OF THE 2018 10TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTERS AND ARTIFICIAL INTELLIGENCE (ECAI), 2018,