Ensemble CNN-ViT Using Feature-Level Fusion for Gait Recognition

被引:0
|
作者
Mogan, Jashila Nair [1 ]
Lee, Chin Poo [1 ]
Lim, Kian Ming [2 ]
机构
[1] Multimedia Univ, Fac Informat Sci & Technol, Melaka 75450, Malaysia
[2] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo 315100, Zhejiang, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Computational modeling; Hidden Markov models; Convolutional neural networks; Transformers; Deep learning; Biological system modeling; ensemble; fusion; feature-fusion; gait; gait recognition; IMAGE; MODEL;
D O I
10.1109/ACCESS.2024.3439602
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Individual deep learning models showcase impressive performance; however, the capacity of a single model might fall short in capturing the full spectrum of intricate patterns present in the input data. Thus, relying solely on a single model may hamper the attainment of optimal results and broader generalization. In light of this, the paper presents an ensemble method that leverages the strengths of multiple Convolutional Neural Networks (CNNs) and Transformer models to elevate gait recognition performance. Additionally, a novel gait representation named windowed Gait Energy Image (GEI) is introduced, obtained by averaging gait frames irrespective of gait cycles. Firstly, the windowed GEI is input to the Convolutional Neural Networks and Transformer models to learn significant gait features. Each model is followed by a Multilayer Perceptron (MLP) to encode the relationship between the extracted features and corresponding class labels. Subsequently, the extracted gait features from each model are flattened and concatenated into a cohesive feature representation before passing through another MLP for subject classification. The performance of the proposed method was assessed on three datasets: OU-ISIR dataset D, CASIA-B, and OU-LP dataset. Experimental results demonstrated remarkable improvements compared to existing methods across all three datasets. The proposed method achieved accuracy rates of 100% on OU-ISIR D, 99.93% on CASIA-B, and 99.94% on OU-LP, showcasing the superior performance of the Ensemble CNN-ViT model using feature-level fusion compared to state-of-the-art methods.
引用
收藏
页码:108573 / 108583
页数:11
相关论文
共 50 条
  • [1] A Novel Model Based on CNN-ViT Fusion and Ensemble Learning for the Automatic Detection of Pes Planus
    Dogan, Kamil
    Selcuk, Turab
    Yilmaz, Abdurrahman
    JOURNAL OF CLINICAL MEDICINE, 2024, 13 (16)
  • [2] Combined CNN LSTM with attention for speech emotion recognition based on feature-level fusion
    Liu Y.
    Chen A.
    Zhou G.
    Yi J.
    Xiang J.
    Wang Y.
    Multimedia Tools and Applications, 2024, 83 (21) : 59839 - 59859
  • [3] RPROP Algorithm in Feature-Level Fusion Recognition
    Liu Hui-min
    Li Xiang
    Wang Hong-qiang
    Fu Yao-wen
    Shen Rong-jun
    2008 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-11, 2008, : 764 - +
  • [4] Action Recognition Based on Feature-level Fusion
    Cheng, Wanli
    Chen, Enqing
    TENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2018), 2018, 10806
  • [5] Robust Human Activity Recognition Using Multimodel Feature-Level Fusion
    Ehatisham-Ul-Haq, Muhammad
    Javed, Ali
    Azam, Muhammad Awais
    Malik, Hafiz M. A.
    Irtaza, Aun
    Lee, Ik Hyun
    Mahmood, Muhammad Tariq
    IEEE ACCESS, 2019, 7 : 60736 - 60751
  • [6] Feature-level data fusion for bimodal person recognition
    Chibelushi, CC
    Mason, JSD
    Deravi, F
    SIXTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND ITS APPLICATIONS, VOL 1, 1997, (443): : 399 - 403
  • [7] Multimodal Emotion Recognition Framework Using a Decision-Level Fusion and Feature-Level Fusion Approach
    Devi, C. Akalya
    Renuka, D.
    IETE JOURNAL OF RESEARCH, 2023, 69 (12) : 8909 - 8920
  • [8] Palmprint identification using feature-level fusion
    Kong, A
    Zhang, D
    Kamel, M
    PATTERN RECOGNITION, 2006, 39 (03) : 478 - 487
  • [9] Feature-Level Fusion Recognition of Space Targets With Composite Micromotion
    Zhang, Yuanpeng
    Xie, Yan
    Kang, Le
    Li, Kaiming
    Luo, Ying
    Zhang, Qun
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (01) : 934 - 951
  • [10] Feature-Level Fusion of Multimodal Physiological Signals for Emotion Recognition
    Chen, Jing
    Ru, Bin
    Xu, Lixin
    Moore, Philip
    Su, Yun
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 395 - 399