A Study on Speech Recognition by a Neural Network Based on English Speech Feature Parameters

被引:1
|
作者
Mao, Congmin [1 ]
Liu, Sujing [1 ]
机构
[1] Hebei GEO Univ, Huaxin Coll, 69 Wufan Rd,Airport Ind Pk, Shijiazhuang 050700, Hebei, Peoples R China
关键词
English; speech feature parameters; back- propagation neural network; speech recognition; mel- frequency cepstral coefficient;
D O I
10.20965/jaciii.2024.p0679
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, from the perspective of English speech feature parameters, two feature parameters, the melfrequency cepstral coefficient (MFCC) and filter bank (Fbank), were selected to identify English speech. The algorithms used for recognition employed the classical back-propagation neural network (BPNN), recurrent neural network (RNN), and long short-term memory (LSTM) that were obtained by improving RNN. The three recognition algorithms were compared in the experiments, and the effects of the two feature parameters on the performance of the recognition algorithms were also compared. The LSTM model had the best identification performance among the three neural networks under different experimental environments; the neural network model using the MFCC feature parameter outperformed the neural network using the Fbank feature parameter; the LSTM model had the highest correct rate and the highest speed, while the RNN model ranked second, and the BPNN model ranked worst. The results confirm that the recognition can achieve higher speech recognition accuracy compared to other neural networks.
引用
收藏
页码:679 / 684
页数:6
相关论文
共 50 条
  • [21] Seismic facies analysis based on speech recognition feature parameters
    Xie, Tao
    Zheng, Xiaodong
    Zhang, Yan
    GEOPHYSICS, 2017, 82 (03) : O23 - O35
  • [22] Speech recognition in English cultural promotion via recurrent neural network
    Wang, Jian
    PERSONAL AND UBIQUITOUS COMPUTING, 2020, 24 (02) : 237 - 246
  • [23] Speech recognition in English cultural promotion via recurrent neural network
    Jian Wang
    Personal and Ubiquitous Computing, 2020, 24 : 237 - 246
  • [24] Donggan speech recognition based on deep neural network
    Xu, Haiyan
    Yang, Hongwu
    You, Yuren
    PROCEEDINGS OF 2019 IEEE 8TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC 2019), 2019, : 354 - 358
  • [25] An Optimal Method for Speech Recognition Based on Neural Network
    Ishak, Mohamad Khairi
    Madsen, Dag oivind
    Al-Zahrani, Fahad Ahmed
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (02): : 1951 - 1961
  • [26] Primi Speech Recognition Based on Deep Neural Network
    Hu, Wenjun
    Fu, Meijun
    Pan, Wenlin
    2016 IEEE 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS (IS), 2016, : 667 - 671
  • [27] Indonesian speech recognition based on Deep Neural Network
    Yang, Ruolin
    Yang, Jian
    Lu, Yu
    2021 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2021, : 36 - 41
  • [28] Continuous Speech Recognition based on Convolutional Neural Network
    Zhang, Qing-qing
    Liu, Yong
    Pan, Jie-lin
    Yan, Yong-hong
    SEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2015), 2015, 9631
  • [29] The spiking neural network based on fMRI for speech recognition
    Song, Yihua
    Guo, Lei
    Man, Menghua
    Wu, Youxi
    PATTERN RECOGNITION, 2024, 155
  • [30] A Neural Network based on Sequence Learning for Speech Recognition
    Elmisery, Fathy A.
    Starzyk, Janusz A.
    ICCES: 2008 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS, 2007, : 139 - +