Research on Tibetan Speech Recognition Based on CNN-DFSMN-CTC

被引:0
|
作者
Northwest Normal University, Engineering Research Center of Gansu Province for Intelligent Information Technology and Application, College of Physics and Electronic Engineering, LanZhou, China [1 ]
机构
关键词
Compilation and indexing terms; Copyright 2024 Elsevier Inc;
D O I
暂无
中图分类号
学科分类号
摘要
Acoustic Modeling - Character recognition - Frequency domain analysis - Long short-term memory - Modeling languages - Time domain analysis
引用
收藏
页码:215 / 219
相关论文
共 50 条
  • [41] Robust CNN-based Speech Recognition With Gabor Filter Kernels
    Chang, Shuo-Yiin
    Morgan, Nelson
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 905 - 909
  • [42] Impact of different voting strategies in CNN based speech emotion recognition
    Simic, Nikola
    Suzic, Sinisa
    Nosek, Tijana
    Vujovic, Mia
    Secujski, Milan
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 1174 - 1177
  • [43] Research on speech synthesis technology based on Tibetan rhythmic features
    Khysru, Kuntharrgyal
    Yangzom
    Tang, Wenjie
    Wei, Jianguo
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 277
  • [44] ATTENTION-BASED GATED SCALING ADAPTIVE ACOUSTIC MODEL FOR CTC-BASED SPEECH RECOGNITION
    Ding, Fenglin
    Guo, Wu
    Dai, Lirong
    Du, Jun
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7404 - 7408
  • [45] Information-Preserving Multilayer CTC Loss for Speech Recognition
    Chen, Xianhong
    Luo, Deyu
    Xiong, Wenmeng
    Wang, Qi
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2025,
  • [46] Seamless equal accuracy ratio for inclusive CTC speech recognition
    Gao, Heting
    Wang, Xiaoxuan
    Kang, Sunghun
    Mina, Rusty
    Issa, Dias
    Harvill, John
    Sari, Leda
    Hasegawa-Johnson, Mark
    Yoo, Chang D.
    SPEECH COMMUNICATION, 2022, 136 : 76 - 83
  • [47] Linguistic-Enhanced Transformer with CTC Embedding for Speech Recognition
    Zhang, Xulong
    Wang, Jianzong
    Cheng, Ning
    Zhao, Mengyuan
    Zhang, Zhiyong
    Xiao, Jing
    2022 18TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING, MSN, 2022, : 915 - 920
  • [48] Deep Group Residual Convolutional CTC Networks for Speech Recognition
    Wang, Kai
    Guan, Donghai
    Li, Bohan
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2018, 2018, 11323 : 318 - 328
  • [49] Research on Mongolian Speech Recognition Based on FSMN
    Wang, Yonghe
    Bao, Feilong
    Zhang, Hongwei
    Gao, Guanglai
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2017, 2018, 10619 : 243 - 254
  • [50] Research on Speech Recognition Based on Embedded Platform
    Lv, Xiao-Min
    Qiu, Xiao-Mei
    Fang, Xu-Qi
    Ma, An-Jun
    Cai, Yi-Jie
    International Conference on Mechanics, Building Material and Civil Engineering (MBMCE 2015), 2015, : 698 - 703