A Study on Speech Recognition by a Neural Network Based on English Speech Feature Parameters

被引:1
|
作者
Mao, Congmin [1 ]
Liu, Sujing [1 ]
机构
[1] Hebei GEO Univ, Huaxin Coll, 69 Wufan Rd,Airport Ind Pk, Shijiazhuang 050700, Hebei, Peoples R China
关键词
English; speech feature parameters; back- propagation neural network; speech recognition; mel- frequency cepstral coefficient;
D O I
10.20965/jaciii.2024.p0679
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, from the perspective of English speech feature parameters, two feature parameters, the melfrequency cepstral coefficient (MFCC) and filter bank (Fbank), were selected to identify English speech. The algorithms used for recognition employed the classical back-propagation neural network (BPNN), recurrent neural network (RNN), and long short-term memory (LSTM) that were obtained by improving RNN. The three recognition algorithms were compared in the experiments, and the effects of the two feature parameters on the performance of the recognition algorithms were also compared. The LSTM model had the best identification performance among the three neural networks under different experimental environments; the neural network model using the MFCC feature parameter outperformed the neural network using the Fbank feature parameter; the LSTM model had the highest correct rate and the highest speed, while the RNN model ranked second, and the BPNN model ranked worst. The results confirm that the recognition can achieve higher speech recognition accuracy compared to other neural networks.
引用
收藏
页码:679 / 684
页数:6
相关论文
共 50 条
  • [1] A Neural Network Based Nonlinear Feature Transformation for Speech Recognition
    Hu, Hongbing
    Zahorian, Stephen A.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1533 - +
  • [2] Speech Recognition Based on Deep Tensor Neural Network and Multifactor Feature
    Shan, Yahui
    Liu, Min
    Zhan, Qingran
    Du, Shixuan
    Wang, Jing
    Xie, Xiang
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 650 - 654
  • [3] A Recognition Method Based on Speech Feature Parameters-English Teaching Practice
    Zhu, Lili
    Yan, Xiujing
    Wang, Jing
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [4] Effect of Articulatory Δ and ΔΔ Parameters on Multilayer Neural Network based Speech Recognition
    Banik, Manoj
    Kotwal, Mohammed Rokibul Alam
    Hassan, Foyzul
    Islam, Gazi Md. Moshfiqul
    Rahman, Sharif Mohammad Musfiqur
    Hasan, Mohammad Mahedi
    Muhammad, Ghulam
    Huda, Mohammad Nurul
    PROCEEDINGS OF THE 2010 IEEE ASIA PACIFIC CONFERENCE ON CIRCUIT AND SYSTEM (APCCAS), 2010, : 624 - 627
  • [5] Deep Neural Network Based Spectral Feature Mapping for Robust Speech Recognition
    Han, Kun
    He, Yanzhang
    Bagchi, Deblin
    Fosler-Lussier, Eric
    Wang, DeLiang
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2484 - 2488
  • [6] English Speech Recognition and Pronunciation Quality Evaluation Model Based on Neural Network
    Wang, Li
    SCIENTIFIC PROGRAMMING, 2022, 2022
  • [7] English Speech Recognition and Pronunciation Quality Evaluation Model Based on Neural Network
    Wang, Li
    SCIENTIFIC PROGRAMMING, 2022, 2022
  • [8] English Speech Recognition and Pronunciation Quality Evaluation Model Based on Neural Network
    Wang, Li
    SCIENTIFIC PROGRAMMING, 2022, 2022
  • [9] Efficient distribution of feature parameters for speech recognition in network environments
    Yoon, JS
    Lee, GH
    Kim, HK
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2005, PT 1, 2005, 3767 : 477 - 488
  • [10] Articulatory feature extraction for speech recognition using neural network
    Huda, Mohammad Nurul
    Hasan, Mohammad Mahedi
    Hassan, Foyzul
    Kotwal, Mohammed Rokibul Alam
    Muhammad, Ghulam
    Rahman, Chowdhury Mofizur
    International Review on Computers and Software, 2011, 6 (01) : 25 - 31