English speech recognition based on deep learning with multiple features

被引:2
|
作者
Zhaojuan Song
机构
[1] School of Translation Studies of Qufu Normal University,
来源
Computing | 2020年 / 102卷
关键词
Deep neural network; Fusion; Speech recognition; Multiple features; 68T10; 68T35; 68T50;
D O I
暂无
中图分类号
学科分类号
摘要
English is one of the widely used languages, with the shrinking of the global village, the smart home, the in-vehicle voice system and voice recognition software with English as the recognition language have gradually entered people’s field of vision, and have obtained the majority of users’ love by the practical accuracy. And deep learning technology in many tasks with its hierarchical feature learning ability and data modeling capabilities has achieved more than the performance of shallow learning technology. Therefore, this paper takes English speech as the research object, and proposes a deep learning speech recognition algorithm that combines speech features and speech attributes. Firstly, the deep neural network supervised learning method is used to extract the high-level features of the speech, select the output of the fixed hidden layer as the new speech feature for the newly generated network, and train the GMM–HMM acoustic model with the new speech features; secondly, the speech attribute extractor based on deep neural network is trained for multiple speech attributes, and the extracted speech attributes are classified into phoneme by deep neural network; finally, speech features and speech attribute features are merged into the same CNN framework by the neural network based on the linear feature fusion algorithm. The experimental results show that the proposed English speech recognition algorithm based on deep neural network with multiple features can directly and effectively combine the two methods by combining the speech features and the speech attributes of the speaker in the input layer of the deep neural network, and it can improve the performance of the English speech recognition system significantly.
引用
收藏
页码:663 / 682
页数:19
相关论文
共 50 条
  • [1] English speech recognition based on deep learning with multiple features
    Song, Zhaojuan
    COMPUTING, 2020, 102 (03) : 663 - 682
  • [2] An Acoustic Model For English Speech Recognition Based On Deep Learning
    Ling, Zhang
    2019 11TH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2019), 2019, : 610 - 614
  • [3] Research on English Vocabulary and Speech Corpus Recognition Based on Deep Learning
    Zhen, Wang
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [4] Recognition of English speech - using a deep learning algorithm
    Wang, Shuyan
    JOURNAL OF INTELLIGENT SYSTEMS, 2023, 32 (01)
  • [5] Deep Learning-Based Detection of Articulatory Features in Arabic and English Speech
    Algabri, Mohammed
    Mathkour, Hassan
    Alsulaiman, Mansour M.
    Bencherif, Mohamed A.
    SENSORS, 2021, 21 (04) : 1 - 23
  • [6] Deep Learning of Speech Features for Improved Phonetic Recognition
    Lee, Jaehyung
    Lee, Soo-Young
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1256 - 1259
  • [7] Speech Emotion Recognition Based on Multiple Acoustic Features and Deep Convolutional Neural Network
    Bhangale, Kishor
    Kothandaraman, Mohanaprasad
    ELECTRONICS, 2023, 12 (04)
  • [8] Deep Learning-Based Emotion Recognition by Fusion of Facial Expressions and Speech Features
    Vardhan, Jasthi Vivek
    Chakravarti, Yelavarti Kalyan
    Chand, Annam Jitin
    2024 2ND WORLD CONFERENCE ON COMMUNICATION & COMPUTING, WCONF 2024, 2024,
  • [9] Multi-Features Audio Extraction for Speech Emotion Recognition Based on Deep Learning
    Gondohanindijo, Jutono
    Muljono
    Noersasongko, Edi
    Pujiono
    Setiadi, De Rosal Moses
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (06) : 198 - 206
  • [10] On the relevance of auditory-based Gabor features for deep learning in robust speech recognition
    Castro Martinez A.M.
    Mallidi S.H.
    Meyer B.T.
    Castro Martinez, Angel Mario (angel.castro@uni-oldenburg.de), 1600, Academic Press (45): : 21 - 38