mIV3Net: modified inception V3 network for hand gesture recognition

被引:0
|
作者
Bhumika Karsh
R. H. Laskar
R. K. Karsh
机构
[1] NIT Silchar,Speech and Image Processing Laboratory, Department of ECE
来源
Multimedia Tools and Applications | 2024年 / 83卷
关键词
Hand gesture recognition (HGR); Inception V3; Sign language recognition; Deep learning; Transfer learning; Human–computer interaction (HCI);
D O I
暂无
中图分类号
学科分类号
摘要
Hand gesture plays an important role in communication among the hearing and speech disorders people. Hand gesture recognition (HGR) is the backbone of human–computer interaction (HCI). Most of the reported hand gesture recognition techniques suffer due to the complex backgrounds. As per the literature, most of the existing HGR methods have only selected a few inter-class similar gestures for recognition performance. This paper proposes a two-phase deep learning-based HGR system to mitigate the complex background issue and consider all gesture classes. In the first phase, inception V3 architecture is improved and named mIV3Net: modified inception V3 network to reduce the computational resource requirement. In the second phase, mIV3Net has been fine-tuned to offer more attention to prominent features. As a result, better abstract knowledge has been used for gesture recognition. Hence, the proposed algorithm has more discrimination characteristics. The efficacy of the proposed two-phase-based HGR system is validated and generalized through experimentation using five publicly available standard datasets: MUGD, ISL, ArSL, NUS-I, and NUS-II. The accuracy values of the proposed system on five datasets in the above order are 97.14%, 99.3%, 97.4%, 99%, and 99.8%, which indicates significant improvement, i.e., 12.58%, 2.54%, 2.73%, 0.56%, and 2.02%, respectively, than the state-of-the-art HGR systems.
引用
收藏
页码:10587 / 10613
页数:26
相关论文
共 50 条
  • [21] Tomato Plant Leaf Disease Detection Using Inception V3
    Baheti, Harsh
    Thakare, Anuradha
    Bhople, Yash
    Darekar, Sudarshan
    Dodmani, Om
    INTELLIGENT SYSTEMS AND APPLICATIONS, ICISA 2022, 2023, 959 : 49 - 60
  • [22] 基于Inception V3的图像状态分类技术
    王旖旎
    液晶与显示, 2020, (04) : 389 - 394
  • [23] 基于GoogLeNet Inception V3的迁移学习研究
    薛晨兴
    张军
    邢家源
    无线电工程, 2020, 50 (02) : 118 - 122
  • [24] Classification of brain tumors using wavelet transform and Inception v3 convolutional neural network model
    Kaya, Zihni
    Aslan, Zafer
    Gunes, Ali
    Okatan, Ali
    JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY, 2024, 39 (03): : 1945 - 1952
  • [25] Next-Gen Dynamic Hand Gesture Recognition: MediaPipe, Inception-v3 and LSTM-Based Enhanced Deep Learning Model
    Kwon, Oh-Jin
    Kim, Jaeho
    Jamil, Sonain
    Lee, Jinhee
    Ullah, Faiz
    ELECTRONICS, 2024, 13 (16)
  • [26] Survey on 3D Hand Gesture Recognition
    Cheng, Hong
    Yang, Lu
    Liu, Zicheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (09) : 1659 - 1673
  • [27] 3D separable convolutional neural network for dynamic hand gesture recognition
    Hu, Zhongxu
    Hu, Youmin
    Liu, Jie
    Wu, Bo
    Han, Dongmin
    Kurfess, Thomas
    NEUROCOMPUTING, 2018, 318 : 151 - 161
  • [28] 基于Inception V3的火灾探测算法
    高杨
    西藏科技, 2021, (08) : 66 - 68+71
  • [29] Hand gesture recognition using T-CombNET: a net neural network model
    Lamar, Marcus Vinicius
    Bhuiyan, Md. Shoaib
    Iwata, Akira
    IEICE Transactions on Information and Systems, 2000, 383 -D (11) : 1986 - 1995
  • [30] VI-NET: A hybrid deep convolutional neural network using VGG and inception V3 model for copy-move forgery classification
    Kumar, Sanjeev
    Gupta, Suneet K.
    Kaur, Manjit
    Gupta, Umesh
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 89