Research on Tibetan Speech Recognition Based on CNN-DFSMN-CTC

被引:0
|
作者
Northwest Normal University, Engineering Research Center of Gansu Province for Intelligent Information Technology and Application, College of Physics and Electronic Engineering, LanZhou, China [1 ]
机构
关键词
Compilation and indexing terms; Copyright 2024 Elsevier Inc;
D O I
暂无
中图分类号
学科分类号
摘要
Acoustic Modeling - Character recognition - Frequency domain analysis - Long short-term memory - Modeling languages - Time domain analysis
引用
收藏
页码:215 / 219
相关论文
共 50 条
  • [1] 一种基于CNN-DFSMN-CTC的语音识别模型
    梁宏涛
    刘家旭
    计算机与数字工程, 2024, 52 (10) : 2984 - 2990
  • [2] INVESTIGATION OF MODELING UNITS FOR MANDARIN SPEECH RECOGNITION USING DFSMN-CTC-SMBR
    Zhang, Shiliang
    Lei, Ming
    Liu, Yuan
    Li, Wei
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7085 - 7089
  • [3] Research on the Algorithm of Tibetan Speech Recognition based on DBN
    Pan, Xiuqin
    Xu, Xiaona
    Zhang, Hong
    Zhao, Yue
    Cao, Yongcun
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION WORKSHOP: IITA 2008 WORKSHOPS, PROCEEDINGS, 2008, : 412 - 415
  • [4] End-to-end Tibetan Ando dialect speech recognition based on hybrid CTC/attention architecture
    Sun, Jingwen
    Zhou, Gang
    Yang, Hongwu
    Wang, Man
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 628 - 632
  • [5] Research on Tibetan Speech Recognition Based on the Am-do Dialect
    Khysru, Kuntharrgyal
    Wei, Jianguo
    Dang, Jianwu
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (03): : 4897 - 4907
  • [6] Tibetan speech recognition based on wenet
    Zhe, Runyu
    Li, Guanyu
    Ma, Like
    PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON COMPUTER AND MULTIMEDIA TECHNOLOGY, ICCMT 2024, 2024, : 554 - 557
  • [7] Joint Decoding of CTC Based Systems for Speech Recognition
    Guo, Jiaqi
    You, Yongbin
    Qian, Yanmin
    Yu, Kai
    INTERSPEECH 2019, 2019, : 2205 - 2209
  • [8] Research on the Algorithm of Noisy-Robust Tibetan Speech Recognition Based on RBF
    Pan, Xiuqin
    Lu, Yong
    Cao, Yongcun
    Zhang, Hong
    Zhao, Yue
    Xu, Xiaona
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION WORKSHOP: IITA 2008 WORKSHOPS, PROCEEDINGS, 2008, : 416 - 419
  • [9] INTERMEDIATE LOSS REGULARIZATION FOR CTC-BASED SPEECH RECOGNITION
    Lee, Jaesong
    Watanabe, Shinji
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6224 - 6228
  • [10] Mandarin Electrolaryngeal Speech Recognition Based on WaveNet-CTC
    Qian, Zhaopeng
    Wang, Li
    Zhang, Shaochuan
    Liu, Chan
    Niu, Haijun
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2019, 62 (07): : 2203 - 2212