Convolutional Neural Network applied in mime speech recognition using sEMG data

被引:0
|
作者
Ai, Qing [1 ]
Zhang, Wei [1 ]
Zhang, Bixuan [1 ]
Li, Guang [1 ]
Yang, Meng [2 ]
机构
[1] Zhejiang Univ, Coll Control Sci & Engn, Inst Cyber Syst & Control, State Key Lab Ind Control Technol, Hangzhou 310027, Peoples R China
[2] China Univ Min & Technol, Sch Informat Engn, Dept Comp Sci & Technol, Beijing 100083, Peoples R China
关键词
Mime speech recognition; surface electromyography; convolutional neural network;
D O I
10.1109/cac48633.2019.8996926
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Decoding speaking intention from the cervical and facial muscle activity enables speech recognition independent of acoustic signals, thus allowing silent communication between human and computing devices. This research aims to design and optimize the convolutional network classifier for sEMG mime speech signals. The muscles involved in the vocalization and timbre modulation are found according to the anatomical map. Six-channel signals have been collected from the corresponding position using the sEMG signal acquisition device. The original signals are subjected to pre-processing, including noise reduction, active segment detection and interpolation, to form training and testing sets. Convolutional network models are applied to figure out the influence of structural parameters, such as convolution kernel size, the number of convolution kernels and the depth of network, on the recognition accuracy. Based on repeated trials, the optimal convolution network is provided, which provide above 80% accuracy rate.
引用
收藏
页码:3347 / 3352
页数:6
相关论文
共 50 条
  • [21] Multiresolution Convolutional Neural Network For Robust Speech Recognition
    Naderi, Navid
    Nasersharif, Babak
    2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 1459 - 1464
  • [22] Speech emotion recognition based on spiking neural network and convolutional neural network
    Du, Chengyan
    Liu, Fu
    Kang, Bing
    Hou, Tao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 147
  • [23] Multimodal speech emotion recognition and classification using convolutional neural network techniques
    Christy, A.
    Vaithyasubramanian, S.
    Jesudoss, A.
    Praveena, M. D. Anto
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (02) : 381 - 388
  • [24] Multimodal speech emotion recognition and classification using convolutional neural network techniques
    A. Christy
    S. Vaithyasubramanian
    A. Jesudoss
    M. D. Anto Praveena
    International Journal of Speech Technology, 2020, 23 : 381 - 388
  • [25] Developing a Speech Recognition System for Recognizing Tonal Speech Signals Using a Convolutional Neural Network
    Dua, Sakshi
    Kumar, Sethuraman Sambath
    Albagory, Yasser
    Ramalingam, Rajakumar
    Dumka, Ankur
    Singh, Rajesh
    Rashid, Mamoon
    Gehlot, Anita
    Alshamrani, Sultan S.
    AlGhamdi, Ahmed Saeed
    APPLIED SCIENCES-BASEL, 2022, 12 (12):
  • [26] Emotion Classification Based on Convolutional Neural Network Using Speech Data
    Vrebcevic, N.
    Mijic, I.
    Petrinovic, D.
    2019 42ND INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2019, : 1007 - 1012
  • [27] Feature selection of mime speech recognition using surface electromyography data
    Zhang, Ming
    Zhang, Wei
    Zhang, Bixuan
    Wang, You
    Li, Guang
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 3173 - 3178
  • [28] The Impact of Load Style Variation on Gait Recognition Based on sEMG Images Using a Convolutional Neural Network
    Zhang, Xianfu
    Hu, Yuping
    Luo, Ruimin
    Li, Chao
    Tang, Zhichuan
    SENSORS, 2021, 21 (24)
  • [29] A Fuzzy Neural Network Applied in the Speech Recognition System
    Zhang, Xueying
    Wang, Peng
    Li, Gaoyun
    Hou, Wenjun
    ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 3, PROCEEDINGS, 2008, : 14 - +
  • [30] Facial Expression Recognition using Convolutional Neural Network with Data Augmentation
    Ahmed, Tawsin Uddin
    Hossain, Sazzad
    Hossain, Mohammad Shahadat
    Ul Islam, Raihan
    Andersson, Karl
    2019 JOINT 8TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2019 3RD INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR) WITH INTERNATIONAL CONFERENCE ON ACTIVITY AND BEHAVIOR COMPUTING (ABC), 2019, : 336 - 341