Research on Tibetan Speech Recognition Based on CNN-DFSMN-CTC

被引:0
|
作者
Northwest Normal University, Engineering Research Center of Gansu Province for Intelligent Information Technology and Application, College of Physics and Electronic Engineering, LanZhou, China [1 ]
机构
关键词
Compilation and indexing terms; Copyright 2024 Elsevier Inc;
D O I
暂无
中图分类号
学科分类号
摘要
Acoustic Modeling - Character recognition - Frequency domain analysis - Long short-term memory - Modeling languages - Time domain analysis
引用
收藏
页码:215 / 219
相关论文
共 50 条
  • [21] Advances in Joint CTC-Attention based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LM
    Hari, Takaaki
    Watanabe, Shinji
    Zhang, Yu
    Chan, William
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 949 - 953
  • [22] End-to-End-Based Tibetan Multitask Speech Recognition
    Zhao, Yue
    Yue, Jianjian
    Xu, Xiaona
    Wu, Licheng
    Li, Xiali
    IEEE ACCESS, 2019, 7 : 162519 - 162529
  • [23] Efficient Conformer-Based CTC Model for Intelligent Cockpit Speech Recognition
    Guo, Hanzhi
    Chen, Yunshu
    Xie, Xukang
    Xu, Gaopeng
    Guo, Wei
    2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 522 - 526
  • [24] A CNN Based Speech Recognition Approach for Voice Controlled Elevator
    Shinde, Ashwini S.
    Jamdar, Abhishek S.
    Joshi, Kajal D.
    Sarode, Sourav T.
    2021 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER TECHNOLOGIES AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2021, : 728 - 733
  • [25] Concatenated Frame Image Based CNN for Visual Speech Recognition
    Saitoh, Takeshi
    Zhou, Ziheng
    Zhao, Guoying
    Pietikainen, Matti
    COMPUTER VISION - ACCV 2016 WORKSHOPS, PT II, 2017, 10117 : 277 - 289
  • [26] Multitask Learning with CTC and Segmental CRF for Speech Recognition
    Lu, Liang
    Kong, Lingpeng
    Dyer, Chris
    Smith, Noah A.
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 954 - 958
  • [27] Research on Emergency Parking Instruction Recognition Based on Speech Recognition and Speech Emotion Recognition
    Tian Kexin
    Huang Yongming
    Zhang Guobao
    Zhang Lin
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 2933 - 2937
  • [28] Speech Emotion Recognition Using CNN
    Huang, Zhengwei
    Dong, Ming
    Mao, Qirong
    Zhan, Yongzhao
    PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 801 - 804
  • [29] Analysis of CNN-based Speech Recognition System using Raw Speech as Input
    Palaz, Dimitri
    Magimai-Doss, Mathew
    Collobert, Ronan
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 11 - 15
  • [30] MULTIRESOLUTION CNN FOR REVERBERANT SPEECH RECOGNITION
    Park, Sunchan
    Jeong, Yongwon
    Kim, Hyung Soon
    2017 20TH CONFERENCE OF THE ORIENTAL CHAPTER OF THE INTERNATIONAL COORDINATING COMMITTEE ON SPEECH DATABASES AND SPEECH I/O SYSTEMS AND ASSESSMENT (O-COCOSDA), 2017, : 214 - 217