Development of the CUHK Dysarthric Speech Recognition System for the UASpeech Corpus

被引:0
|
作者
Yu, Jianwei [1 ]
Xie, Xurong [2 ]
Liu, Shansong [1 ]
Hu, Shoukang [1 ]
Lam, Max W. Y. [1 ]
Wu, Xixin [1 ]
Wong, Ka Ho [1 ]
Liu, Xunying [1 ]
Meng, Helen [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Syst Engn & Engn Management, Hong Kong, Peoples R China
[2] Chinese Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
来源
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES | 2018年
关键词
dysarthric speech; speech recognition; cross domain adaptation; system combination; auto-encoder;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dysarthric speech recognition is a highly challenging task. The articulatory motor control problems associated with neuromotor conditions produce large mismatch against normal speech, In addition, such data is difficult to collect in large quantities. This paper presents the development of the Chinese University of Hong Kong automatic speech recognition (ASR) system for the Universal Access Speech (UASpeech) [1]. A range of deep neural network (DNN) acoustic models and their more advanced variants based on time delayed neural networks (TDNNs) and long short-term memory recurrent neural networks (LSTM-RNNs) were developed. Speaker adaptation by learning hidden unit contributions (LHUC) was used. A semi-supervised complementary auto-encoder system was further constructed to improve the bottleneck feature extraction. Two out-of-domain (OOD) ASR systems separately trained on broadcast news and switchboard data were cross domain adapted towards the UASpeech data and adopted in system combination. The final combined system gave an overall word accuracy of 69.4% on the 16-speaker test set.
引用
收藏
页码:2938 / 2942
页数:5
相关论文
共 50 条
  • [21] DEVELOPMENT OF NEW SPEECH CORPUS FOR ELDERLY JAPANESE SPEECH RECOGNITION
    Iribe, Yurie
    Kitaoka, Norihide
    Segawa, Shuhei
    2015 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2015 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2015, : 27 - 31
  • [22] Chhattisgarhi speech corpus for research and development in automatic speech recognition
    Londhe, Narendra D.
    Kshirsagar, Ghanahshyam B.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (02) : 193 - 210
  • [23] Development of CNN-based robust dysarthric isolated digit recognition system by enhancing speech intelligibility
    Revathi A.
    Sasikaladevi N.
    Arunprasanth D.
    Research on Biomedical Engineering, 2022, 38 (04) : 1067 - 1079
  • [24] An automatic speech recognition system for spontaneous Punjabi speech corpus
    Kumar Y.
    Singh N.
    International Journal of Speech Technology, 2017, 20 (2) : 297 - 303
  • [25] Optimizing Vocabulary Modeling for Dysarthric Speech Recognition
    Na, Minsoo
    Chung, Minhwa
    COMPUTERS HELPING PEOPLE WITH SPECIAL NEEDS, PT II (ICCHP 2016), 2016, 9759 : 507 - 510
  • [26] A survey of technologies for automatic Dysarthric speech recognition
    Qian, Zhaopeng
    Xiao, Kejing
    Yu, Chongchong
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)
  • [27] Using articulatory likelihoods in the recognition of dysarthric speech
    Rudzicz, Frank
    SPEECH COMMUNICATION, 2012, 54 (03) : 430 - 444
  • [28] Using speech rhythm knowledge to improve dysarthric speech recognition
    Selouani, S. -A.
    Dahmani, H.
    Amami, R.
    Hamam, H.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2012, 15 (01) : 57 - 64
  • [29] A survey of technologies for automatic Dysarthric speech recognition
    Zhaopeng Qian
    Kejing Xiao
    Chongchong Yu
    EURASIP Journal on Audio, Speech, and Music Processing, 2023
  • [30] A Strategic Approach for Robust Dysarthric Speech Recognition
    Revathi, A.
    Sasikaladevi, N.
    Arunprasanth, D.
    Amirtharajan, Rengarajan
    WIRELESS PERSONAL COMMUNICATIONS, 2024, 134 (04) : 2315 - 2346