Mandarin Recognition Based on Self-Attention Mechanism with Deep Convolutional Neural Network (DCNN)-Gated Recurrent Unit (GRU)

被引:0
|
作者
Chen, Xun [1 ]
Wang, Chengqi [1 ]
Hu, Chao [1 ]
Wang, Qin [1 ]
机构
[1] Hainan Univ, Sch Informat & Commun Engn, Haikou 570228, Peoples R China
基金
中国国家自然科学基金;
关键词
self-attention mechanism; CTC; gated circulation units;
D O I
10.3390/bdcc8120195
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech recognition technology is an important branch in the field of artificial intelligence, aiming to transform human speech into computer-readable text information. However, speech recognition technology still faces many challenges, such as noise interference, and accent and speech rate differences. An aim of this paper is to explore a deep learning-based speech recognition method to improve the accuracy and robustness of speech recognition. Firstly, this paper introduces the basic principles of speech recognition and existing mainstream technologies, and then focuses on the deep learning-based speech recognition method. Through comparative experiments, it is found that the self-attention mechanism performs best in speech recognition tasks. In order to further improve speech recognition performance, this paper proposes a deep learning model based on the self-attention mechanism with DCNN-GRU. The model realizes the dynamic attention to an input speech by introducing the self-attention mechanism in a neural network model instead of an RNN and with a deep convolutional neural network, which improves the robustness and recognition accuracy of this model. This experiment uses 170 h of Chinese dataset AISHELL-1. Compared with the deep convolutional neural network, the deep learning model based on the self-attention mechanism with DCNN-GRU accomplishes a reduction of at least 6% in CER. Compared with a bidirectional gated recurrent neural network, the deep learning model based on the self-attention mechanism with DCNN-GRU accomplishes a reduction of 0.7% in CER. And finally, this experiment is performed on a test set analyzed the influencing factors affecting the CER. The experimental results show that this model exhibits good performance in various noise environments and accent conditions.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Self-attention based GRU neural network for deep knowledge tracing
    Jin, Shangzhu
    Zhao, Yan
    Peng, Jun
    Chen, Ning
    Xue, Run
    Liang, Minghui
    Jiang, Yunfeng
    2022 IEEE 17TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2022, : 1436 - 1440
  • [2] New GRU from Convolutional Neural Network and Gated Recurrent Unit
    Atassi, A.
    El Azami, I.
    Sadiq, A.
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON DATA SCIENCE, E-LEARNING AND INFORMATION SYSTEMS 2018 (DATA'18), 2018,
  • [3] Deep Learning Wind Power Prediction Model Based on Attention Mechanism-Based Convolutional Neural Network and Gated Recurrent Unit Neural Network
    Hou, Zai-Hong
    Bai, Yu-Long
    Ding, Lin
    Yue, Xiao-Xin
    Huang, Yu-Ting
    Song, Wei
    Bi, Qi
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2024, 33 (16)
  • [4] Attention-Based Convolutional Neural Network and Bidirectional Gated Recurrent Unit for Human Activity Recognition
    Tao, Shuai
    Zhao, Zhiqiang
    Qin, Jing
    Ji, Changqing
    Wang, Zumin
    2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 1128 - 1134
  • [5] Prediction of Network Security Situation Based on Attention Mechanism and Convolutional Neural Network-Gated Recurrent Unit
    Feng, Yuan
    Zhao, Hongying
    Zhang, Jianwei
    Cai, Zengyu
    Zhu, Liang
    Zhang, Ran
    APPLIED SCIENCES-BASEL, 2024, 14 (15):
  • [6] Emotional Stress Recognition Using Electroencephalogram Signals Based on a Three-Dimensional Convolutional Gated Self-Attention Deep Neural Network
    Kim, Hyoung-Gook
    Jeong, Dong-Ki
    Kim, Jin-Young
    APPLIED SCIENCES-BASEL, 2022, 12 (21):
  • [7] Automatic Food Recognition Using Deep Convolutional Neural Networks with Self-attention Mechanism
    Rahib Abiyev
    Joseph Adepoju
    Human-Centric Intelligent Systems, 2024, 4 (1): : 171 - 186
  • [8] Aerial Target Threat Assessment Based on Gated Recurrent Unit and Self-Attention Mechanism
    Chen, Chen
    Quan, Wei
    Shao, Zhuang
    Journal of Systems Engineering and Electronics, 2024, 35 (02) : 361 - 373
  • [9] Aerial target threat assessment based on gated recurrent unit and self-attention mechanism
    Chen, Chen
    Quan, Wei
    Shao, Zhuang
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2024, 35 (02) : 361 - 373
  • [10] Aerial target threat assessment based on gated recurrent unit and self-attention mechanism
    CHEN Chen
    QUAN Wei
    SHAO Zhuang
    JournalofSystemsEngineeringandElectronics, 2024, 35 (02) : 361 - 373