Ensemble CNN-ViT Using Feature-Level Fusion for Gait Recognition

被引:0
|
作者
Mogan, Jashila Nair [1 ]
Lee, Chin Poo [1 ]
Lim, Kian Ming [2 ]
机构
[1] Multimedia Univ, Fac Informat Sci & Technol, Melaka 75450, Malaysia
[2] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo 315100, Zhejiang, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Computational modeling; Hidden Markov models; Convolutional neural networks; Transformers; Deep learning; Biological system modeling; ensemble; fusion; feature-fusion; gait; gait recognition; IMAGE; MODEL;
D O I
10.1109/ACCESS.2024.3439602
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Individual deep learning models showcase impressive performance; however, the capacity of a single model might fall short in capturing the full spectrum of intricate patterns present in the input data. Thus, relying solely on a single model may hamper the attainment of optimal results and broader generalization. In light of this, the paper presents an ensemble method that leverages the strengths of multiple Convolutional Neural Networks (CNNs) and Transformer models to elevate gait recognition performance. Additionally, a novel gait representation named windowed Gait Energy Image (GEI) is introduced, obtained by averaging gait frames irrespective of gait cycles. Firstly, the windowed GEI is input to the Convolutional Neural Networks and Transformer models to learn significant gait features. Each model is followed by a Multilayer Perceptron (MLP) to encode the relationship between the extracted features and corresponding class labels. Subsequently, the extracted gait features from each model are flattened and concatenated into a cohesive feature representation before passing through another MLP for subject classification. The performance of the proposed method was assessed on three datasets: OU-ISIR dataset D, CASIA-B, and OU-LP dataset. Experimental results demonstrated remarkable improvements compared to existing methods across all three datasets. The proposed method achieved accuracy rates of 100% on OU-ISIR D, 99.93% on CASIA-B, and 99.94% on OU-LP, showcasing the superior performance of the Ensemble CNN-ViT model using feature-level fusion compared to state-of-the-art methods.
引用
收藏
页码:108573 / 108583
页数:11
相关论文
共 50 条
  • [21] CNN-ViT Supported Weakly-Supervised Video Segment Level Anomaly Detection
    Sharif, Md. Haidar
    Jiao, Lei
    Omlin, Christian W.
    SENSORS, 2023, 23 (18)
  • [22] Feature-level fusion in personal identification
    Gao, Y
    Maggs, M
    2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 468 - 473
  • [23] An extension of iterative scaling for joint decision-level and feature-level fusion in ensemble classification
    Miller, DJ
    Pal, S
    2005 IEEE WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2005, : 61 - 66
  • [24] SPONTANEOUS FACIAL EXPRESSION RECOGNITION BY USING FEATURE-LEVEL FUSION OF VISIBLE AND THERMAL INFRARED IMAGES
    Wang, Zhaoyu
    Wang, Shangfei
    2011 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2011,
  • [25] A novel feature-level fusion scheme with multimodal attention CNN for heart sound classification
    Ranipa, Kalpeshkumar
    Zhu, Wei -Ping
    Swamy, M. N. S.
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 248
  • [26] Traumatic Brain Injury Structure Detection Using Advanced Wavelet Transformation Fusion Algorithm with Proposed CNN-ViT
    Abdullah, Ansar
    Siddique, Ansar
    Fatima, Zulaikha
    Shaukat, Kamran
    INFORMATION, 2024, 15 (10)
  • [27] Accurate Human Recognition by Score-Level and Feature-Level Fusion Using Palm-Phalanges Print
    Gopal
    Srivastava, Smriti
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2018, 43 (02) : 543 - 554
  • [28] Wood species identification using feature-level fusion scheme
    Zhao, Peng
    Dou, Gang
    Chen, Guang-Sheng
    OPTIK, 2014, 125 (03): : 1144 - 1148
  • [29] Dynamic human fatigue detection using feature-level fusion
    Fan, Xiao
    Yin, Bao-Cai
    Sun, Yan-Feng
    IMAGE AND SIGNAL PROCESSING, 2008, 5099 : 94 - 102
  • [30] Feature-level Fusion of Global and Local Features for Finger-vein Recognition
    Yang, Jinfeng
    Zhang, Xu
    2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 1702 - 1705