Ensemble CNN-ViT Using Feature-Level Fusion for Gait Recognition

被引:0
|
作者
Mogan, Jashila Nair [1 ]
Lee, Chin Poo [1 ]
Lim, Kian Ming [2 ]
机构
[1] Multimedia Univ, Fac Informat Sci & Technol, Melaka 75450, Malaysia
[2] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo 315100, Zhejiang, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Computational modeling; Hidden Markov models; Convolutional neural networks; Transformers; Deep learning; Biological system modeling; ensemble; fusion; feature-fusion; gait; gait recognition; IMAGE; MODEL;
D O I
10.1109/ACCESS.2024.3439602
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Individual deep learning models showcase impressive performance; however, the capacity of a single model might fall short in capturing the full spectrum of intricate patterns present in the input data. Thus, relying solely on a single model may hamper the attainment of optimal results and broader generalization. In light of this, the paper presents an ensemble method that leverages the strengths of multiple Convolutional Neural Networks (CNNs) and Transformer models to elevate gait recognition performance. Additionally, a novel gait representation named windowed Gait Energy Image (GEI) is introduced, obtained by averaging gait frames irrespective of gait cycles. Firstly, the windowed GEI is input to the Convolutional Neural Networks and Transformer models to learn significant gait features. Each model is followed by a Multilayer Perceptron (MLP) to encode the relationship between the extracted features and corresponding class labels. Subsequently, the extracted gait features from each model are flattened and concatenated into a cohesive feature representation before passing through another MLP for subject classification. The performance of the proposed method was assessed on three datasets: OU-ISIR dataset D, CASIA-B, and OU-LP dataset. Experimental results demonstrated remarkable improvements compared to existing methods across all three datasets. The proposed method achieved accuracy rates of 100% on OU-ISIR D, 99.93% on CASIA-B, and 99.94% on OU-LP, showcasing the superior performance of the Ensemble CNN-ViT model using feature-level fusion compared to state-of-the-art methods.
引用
收藏
页码:108573 / 108583
页数:11
相关论文
共 50 条
  • [31] Feature-level fusion approaches based on multimodal EEG data for depression recognition
    Cai, Hanshu
    Qu, Zhidiao
    Li, Zhe
    Zhang, Yi
    Hu, Xiping
    Hu, Bin
    INFORMATION FUSION, 2020, 59 (59) : 127 - 138
  • [32] Finger-vein and fingerprint recognition based on a feature-level fusion method
    Yang, Jinfeng
    Hong, Bofeng
    FIFTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2013), 2013, 8878
  • [33] Feature-level Fusion of Deep Convolutional Neural Networks for Sketch Recognition on Smartphones
    Boyaci, Emel
    Sert, Mustafa
    2017 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2017,
  • [34] Alternative Deep Learning Architectures for Feature-Level Fusion in Human Activity Recognition
    Maitre, Julien
    Bouchard, Kevin
    Gaboury, Sebastien
    MOBILE NETWORKS & APPLICATIONS, 2021, 26 (05): : 2076 - 2086
  • [35] Palm vein recognition based on multi-sampling and feature-level fusion
    Yan, Xuekui
    Kang, Wenxiong
    Deng, Feiqi
    Wu, Qiuxia
    NEUROCOMPUTING, 2015, 151 : 798 - 807
  • [36] Remote Sensing Crop Water Stress Determination Using CNN-ViT Architecture
    Lehouel, Kawtar
    Saber, Chaima
    Bouziani, Mourad
    Yaagoubi, Reda
    AI, 2024, 5 (02) : 618 - 634
  • [37] Alternative Deep Learning Architectures for Feature-Level Fusion in Human Activity Recognition
    Julien Maitre
    Kevin Bouchard
    Sébastien Gaboury
    Mobile Networks and Applications, 2021, 26 : 2076 - 2086
  • [38] Speech emotion classification using feature-level and classifier-level fusion
    Siba Prasad Mishra
    Pankaj Warule
    Suman Deb
    Evolving Systems, 2024, 15 : 541 - 554
  • [39] Speech emotion classification using feature-level and classifier-level fusion
    Mishra, Siba Prasad
    Warule, Pankaj
    Deb, Suman
    EVOLVING SYSTEMS, 2024, 15 (02) : 541 - 554
  • [40] Multimodal feature fusion for CNN-based gait recognition: an empirical comparison
    Francisco M. Castro
    Manuel J. Marín-Jiménez
    Nicolás Guil
    Nicolás Pérez de la Blanca
    Neural Computing and Applications, 2020, 32 : 14173 - 14193