Transformer-based end-to-end speech recognition with residual Gaussian-based self-attention

被引:0
|
作者
Liang, Chengdong [1 ]
Xu, Menglong [1 ]
Zhang, Xiao-Lei [1 ]
机构
[1] CIAIC, School of Marine Science and Technology, Northwestern Polytechnical University, China
关键词
Compilation and indexing terms; Copyright 2024 Elsevier Inc;
D O I
暂无
中图分类号
学科分类号
摘要
Electric transformer testing - Speech communication - Gaussian distribution
引用
收藏
页码:1495 / 1499
相关论文
共 50 条
  • [31] AN END-TO-END SPEECH ACCENT RECOGNITION METHOD BASED ON HYBRID CTC/ATTENTION TRANSFORMER ASR
    Gao, Qiang
    Wu, Haiwei
    Sun, Yanqing
    Duan, Yitao
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7253 - 7257
  • [32] An End-to-end Speech Recognition Algorithm based on Attention Mechanism
    Chen, Jia-nan
    Gao, Shuang
    Sun, Han-zhe
    Liu, Xiao-hui
    Wang, Zi-ning
    Zheng, Yan
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 2935 - 2940
  • [33] Multi-Encoder Learning and Stream Fusion for Transformer-Based End-to-End Automatic Speech Recognition
    Lohrenz, Timo
    Li, Zhengyang
    Fingscheidt, Tim
    INTERSPEECH 2021, 2021, : 2846 - 2850
  • [34] Improving Transformer-based End-to-End Speech Recognition with Connectionist Temporal Classification and Language Model Integration
    Karita, Shigeki
    Soplin, Nelson Enrique Yalta
    Watanabe, Shinji
    Delcroix, Marc
    Ogawa, Atsunori
    Nakatani, Tomohiro
    INTERSPEECH 2019, 2019, : 1408 - 1412
  • [35] Fast offline transformer-based end-to-end automatic speech recognition for real-world applications
    Oh, Yoo Rhee
    Park, Kiyoung
    Park, Jeon Gue
    ETRI JOURNAL, 2022, 44 (03) : 476 - 490
  • [36] Transformer-based end-to-end attack on text CAPTCHAs with triplet deep attention
    Zhang, Bo
    Xiong, Yu-Jie
    Xia, Chunming
    Gao, Yongbin
    COMPUTERS & SECURITY, 2024, 146
  • [37] Dual Causal/Non-Causal Self-Attention for Streaming End-to-End Speech Recognition
    Moritz, Niko
    Hori, Takaaki
    Le Roux, Jonathan
    INTERSPEECH 2021, 2021, : 1822 - 1826
  • [38] EXPLORATION OF LANGUAGE-SPECIFIC SELF-ATTENTION PARAMETERS FOR MULTILINGUAL END-TO-END SPEECH RECOGNITION
    Houston, Brady
    Kirchhoff, Katrin
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 755 - 762
  • [39] Residual Energy-Based Models for End-to-End Speech Recognition
    Li, Qiujia
    Zhang, Yu
    Li, Bo
    Cao, Liangliang
    Woodland, Philip C.
    INTERSPEECH 2021, 2021, : 4069 - 4073
  • [40] END-TO-END ATTENTION-BASED LARGE VOCABULARY SPEECH RECOGNITION
    Bandanau, Dzmitry
    Chorowski, Jan
    Serdyuk, Dmitriy
    Brakel, Philemon
    Bengio, Yoshua
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 4945 - 4949