mIV3Net: modified inception V3 network for hand gesture recognition

被引：0

作者：

Bhumika Karsh

R. H. Laskar

R. K. Karsh

机构：

[1] NIT Silchar,Speech and Image Processing Laboratory, Department of ECE

来源：

Multimedia Tools and Applications | 2024年 / 83卷

关键词：

Hand gesture recognition (HGR); Inception V3; Sign language recognition; Deep learning; Transfer learning; Human–computer interaction (HCI);

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Hand gesture plays an important role in communication among the hearing and speech disorders people. Hand gesture recognition (HGR) is the backbone of human–computer interaction (HCI). Most of the reported hand gesture recognition techniques suffer due to the complex backgrounds. As per the literature, most of the existing HGR methods have only selected a few inter-class similar gestures for recognition performance. This paper proposes a two-phase deep learning-based HGR system to mitigate the complex background issue and consider all gesture classes. In the first phase, inception V3 architecture is improved and named mIV3Net: modified inception V3 network to reduce the computational resource requirement. In the second phase, mIV3Net has been fine-tuned to offer more attention to prominent features. As a result, better abstract knowledge has been used for gesture recognition. Hence, the proposed algorithm has more discrimination characteristics. The efficacy of the proposed two-phase-based HGR system is validated and generalized through experimentation using five publicly available standard datasets: MUGD, ISL, ArSL, NUS-I, and NUS-II. The accuracy values of the proposed system on five datasets in the above order are 97.14%, 99.3%, 97.4%, 99%, and 99.8%, which indicates significant improvement, i.e., 12.58%, 2.54%, 2.73%, 0.56%, and 2.02%, respectively, than the state-of-the-art HGR systems.

引用

页码：10587 / 10613

页数：26

共 50 条

[21] Tomato Plant Leaf Disease Detection Using Inception V3
Baheti, Harsh
Thakare, Anuradha
Bhople, Yash
Darekar, Sudarshan
Dodmani, Om
INTELLIGENT SYSTEMS AND APPLICATIONS, ICISA 2022, 2023, 959 : 49 - 60
[22] 基于Inception V3的图像状态分类技术
王旖旎
液晶与显示, 2020, (04) : 389 - 394
[23] 基于GoogLeNet Inception V3的迁移学习研究
薛晨兴
张军
邢家源
无线电工程, 2020, 50 (02) : 118 - 122
[24] Classification of brain tumors using wavelet transform and Inception v3 convolutional neural network model
Kaya, Zihni
Aslan, Zafer
Gunes, Ali
Okatan, Ali
JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY, 2024, 39 (03): : 1945 - 1952
[25] Next-Gen Dynamic Hand Gesture Recognition: MediaPipe, Inception-v3 and LSTM-Based Enhanced Deep Learning Model
Kwon, Oh-Jin
Kim, Jaeho
Jamil, Sonain
Lee, Jinhee
Ullah, Faiz
ELECTRONICS, 2024, 13 (16)
[26] Survey on 3D Hand Gesture Recognition
Cheng, Hong
Yang, Lu
Liu, Zicheng
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (09) : 1659 - 1673
[27] 3D separable convolutional neural network for dynamic hand gesture recognition
Hu, Zhongxu
Hu, Youmin
Liu, Jie
Wu, Bo
Han, Dongmin
Kurfess, Thomas
NEUROCOMPUTING, 2018, 318 : 151 - 161
[28] 基于Inception V3的火灾探测算法
高杨
西藏科技, 2021, (08) : 66 - 68+71
[29] Hand gesture recognition using T-CombNET: a net neural network model
Lamar, Marcus Vinicius
Bhuiyan, Md. Shoaib
Iwata, Akira
IEICE Transactions on Information and Systems, 2000, 383 -D (11) : 1986 - 1995
[30] VI-NET: A hybrid deep convolutional neural network using VGG and inception V3 model for copy-move forgery classification
Kumar, Sanjeev
Gupta, Suneet K.
Kaur, Manjit
Gupta, Umesh
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 89

← 1 2 3 4 5 →