A Study on Speech Recognition by a Neural Network Based on English Speech Feature Parameters

被引:1
|
作者
Mao, Congmin [1 ]
Liu, Sujing [1 ]
机构
[1] Hebei GEO Univ, Huaxin Coll, 69 Wufan Rd,Airport Ind Pk, Shijiazhuang 050700, Hebei, Peoples R China
关键词
English; speech feature parameters; back- propagation neural network; speech recognition; mel- frequency cepstral coefficient;
D O I
10.20965/jaciii.2024.p0679
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, from the perspective of English speech feature parameters, two feature parameters, the melfrequency cepstral coefficient (MFCC) and filter bank (Fbank), were selected to identify English speech. The algorithms used for recognition employed the classical back-propagation neural network (BPNN), recurrent neural network (RNN), and long short-term memory (LSTM) that were obtained by improving RNN. The three recognition algorithms were compared in the experiments, and the effects of the two feature parameters on the performance of the recognition algorithms were also compared. The LSTM model had the best identification performance among the three neural networks under different experimental environments; the neural network model using the MFCC feature parameter outperformed the neural network using the Fbank feature parameter; the LSTM model had the highest correct rate and the highest speed, while the RNN model ranked second, and the BPNN model ranked worst. The results confirm that the recognition can achieve higher speech recognition accuracy compared to other neural networks.
引用
收藏
页码:679 / 684
页数:6
相关论文
共 50 条
  • [31] Noisy Speech Recognition Based On RBF Neural Network
    Yan Gang
    Kong Haidong
    Yu Yang
    Zheng Xiaoxia
    ADVANCED MATERIALS AND INFORMATION TECHNOLOGY PROCESSING, PTS 1-3, 2011, 271-273 : 597 - 602
  • [32] Speech Emotion Recognition Based on Deep Neural Network
    Zhu, Zijiang
    Hu, Yi
    Li, Junshan
    Li, Jianjun
    Wang, Junhua
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 126 : 154 - 154
  • [33] Automatic Image and Speech Recognition Based on Neural Network
    Krol, Dariusz
    Szlachetko, Boguslaw
    JOURNAL OF INFORMATION TECHNOLOGY RESEARCH, 2010, 3 (02) : 1 - 17
  • [34] COMPARISON OF SEVERAL SPEECH SIGNAL FEATURE PARAMETERS FOR AUTOMATIC SPEECH RECOGNITION
    PARTALO, M
    SIJERCIC, Z
    SPEECH COMMUNICATION, 1989, 8 (04) : 347 - 353
  • [35] Design of a Quantization Algorithm of the Speech Feature Parameters for the Distributed Speech Recognition
    Lee, Joonseok
    Kang, Sangwon
    Yoon, Byungsik
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2005, 24 (04): : 217 - 223
  • [36] Study on Emotional Feature Analysis and Recognition in Chinese and English Speech Signal
    Wang Yutai
    Li Bo
    Wang Yanan
    PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON TEST AUTOMATION & INSTRUMENTATION, VOL. 3, 2008, : 1753 - 1757
  • [37] Speech emotion recognition based on spiking neural network and convolutional neural network
    Du, Chengyan
    Liu, Fu
    Kang, Bing
    Hou, Tao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 147
  • [38] Neural network approach to speech recognition
    Lee, Y.C.
    Chen, H.H.
    Sun, G.Z.
    Neural Networks, 1988, 1 (1 SUPPL)
  • [39] EXEMPLAR-BASED SPEECH ENHANCEMENT FOR DEEP NEURAL NETWORK BASED AUTOMATIC SPEECH RECOGNITION
    Baby, Deepak
    Gemmeke, Jort F.
    Virtanen, Tuomas
    Van hamme, Hugo
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4485 - 4489
  • [40] Quantum neural network in speech recognition
    Fei, L
    Zhao, SM
    Zheng, BY
    2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 1267 - 1270