Mandarin Recognition Based on Self-Attention Mechanism with Deep Convolutional Neural Network (DCNN)-Gated Recurrent Unit (GRU)

被引:0
|
作者
Chen, Xun [1 ]
Wang, Chengqi [1 ]
Hu, Chao [1 ]
Wang, Qin [1 ]
机构
[1] Hainan Univ, Sch Informat & Commun Engn, Haikou 570228, Peoples R China
基金
中国国家自然科学基金;
关键词
self-attention mechanism; CTC; gated circulation units;
D O I
10.3390/bdcc8120195
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech recognition technology is an important branch in the field of artificial intelligence, aiming to transform human speech into computer-readable text information. However, speech recognition technology still faces many challenges, such as noise interference, and accent and speech rate differences. An aim of this paper is to explore a deep learning-based speech recognition method to improve the accuracy and robustness of speech recognition. Firstly, this paper introduces the basic principles of speech recognition and existing mainstream technologies, and then focuses on the deep learning-based speech recognition method. Through comparative experiments, it is found that the self-attention mechanism performs best in speech recognition tasks. In order to further improve speech recognition performance, this paper proposes a deep learning model based on the self-attention mechanism with DCNN-GRU. The model realizes the dynamic attention to an input speech by introducing the self-attention mechanism in a neural network model instead of an RNN and with a deep convolutional neural network, which improves the robustness and recognition accuracy of this model. This experiment uses 170 h of Chinese dataset AISHELL-1. Compared with the deep convolutional neural network, the deep learning model based on the self-attention mechanism with DCNN-GRU accomplishes a reduction of at least 6% in CER. Compared with a bidirectional gated recurrent neural network, the deep learning model based on the self-attention mechanism with DCNN-GRU accomplishes a reduction of 0.7% in CER. And finally, this experiment is performed on a test set analyzed the influencing factors affecting the CER. The experimental results show that this model exhibits good performance in various noise environments and accent conditions.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Convolutional Neural Network-Based Bidirectional Gated Recurrent Unit-Additive Attention Mechanism Hybrid Deep Neural Networks for Short-Term Traffic Flow Prediction
    Liu, Song
    Lin, Wenting
    Wang, Yue
    Yu, Dennis Z.
    Peng, Yong
    Ma, Xianting
    SUSTAINABILITY, 2024, 16 (05)
  • [42] Ghost imaging object recognition based on self-attention mechanism network
    He, Yunting
    Yuan, Sheng
    Song, Jiali
    AIP ADVANCES, 2023, 13 (12)
  • [43] Hybrid Convolutional and Gated Recurrent Unit Network with Attention for Drilling KickPrediction
    Qiao, Ying
    Tu, Xiaoyue
    Zhou, Liangzhi
    Guo, Xiao
    SPE JOURNAL, 2024, 29 (12): : 6852 - 6868
  • [44] A novel convolutional neural network with gated recurrent unit for automated speech emotion recognition and classification
    Prakash, P. Ravi
    Anuradha, D.
    Iqbal, Javid
    Galety, Mohammad Gouse
    Singh, Ruby
    Neelakandan, S.
    JOURNAL OF CONTROL AND DECISION, 2023, 10 (01) : 54 - 63
  • [45] Attention-based convolutional neural network for deep face recognition
    Ling, Hefei
    Wu, Jiyang
    Huang, Junrui
    Chen, Jiazhong
    Li, Ping
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (9-10) : 5595 - 5616
  • [46] Attention-based convolutional neural network for deep face recognition
    Hefei Ling
    Jiyang Wu
    Junrui Huang
    Jiazhong Chen
    Ping Li
    Multimedia Tools and Applications, 2020, 79 : 5595 - 5616
  • [47] Emotion recognition based on convolutional gated recurrent units with attention
    Ye, Zhu
    Jing, Yuan
    Wang, Qinghua
    Li, Pengrui
    Liu, Zhihong
    Yan, Mingjing
    Zhang, Yongqing
    Gao, Dongrui
    CONNECTION SCIENCE, 2023, 35 (01)
  • [48] A hierarchical deep convolutional neural network and gated recurrent unit framework for structural damage detection
    School of Information Science and Engineering, Chongqing Jiaotong University, China
    不详
    不详
    Inf Sci, 2020, (117-130): : 117 - 130
  • [49] A hierarchical deep convolutional neural network and gated recurrent unit framework for structural damage detection
    Yang, Jianxi
    Zhang, Likai
    Chen, Cen
    Li, Yangfan
    Li, Ren
    Wang, Guiping
    Jiang, Shixin
    Zeng, Zeng
    INFORMATION SCIENCES, 2020, 540 : 117 - 130
  • [50] Deep & Attention : A Self-Attention based Neural Network for Remaining Useful Lifetime Predictions
    Li, Yuanjun
    Wang, Xingang
    2021 7TH INTERNATIONAL CONFERENCE ON MECHATRONICS AND ROBOTICS ENGINEERING (ICMRE 2021), 2021, : 98 - 105