Japanese speech databases for robust speech recognition

被引:0
|
作者
Nakamura, A
Matsunaga, S
Shimizu, T
Tonomura, M
Sagisaka, Y
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Ar ATR, a next-generation speech translation system is under development towards natural trans-language communication. To cope with the various requirements to speech recognition technology for the new system, further research efforts should emphasize the robustness for large vocabulary, speaking variations often found in fast spontaneous speech and speaker variances. These are key problems to be solved not only for speech translation bur also far the general use of speech recognition in real environments In this paper, three large speech databases are designed to cope with these problems in speech recognition acid the current status of data collection is reported.
引用
收藏
页码:2199 / 2202
页数:4
相关论文
共 50 条
  • [31] Robust speech recognition by integrating speech separation and hypothesis testing
    Srinivasan, Soundararajan
    Wang, DeLiang
    SPEECH COMMUNICATION, 2010, 52 (01) : 72 - 81
  • [32] Robust Speech Recognition with Speech Enhanced Deep Neural Networks
    Du, Jun
    Wang, Qing
    Gao, Tian
    Xu, Yong
    Dai, Lirong
    Lee, Chin-Hui
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 616 - 620
  • [33] Application of noise robust MDT speech recognition on the SPEECON and SpeechDat-Car databases
    Gemmeke, J. F.
    Wang, Y.
    Van Segbroeck, M.
    Cranen, B.
    Van Hamme, H.
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1227 - +
  • [34] Robust Speech Recognition for Similar Japanese Pronunciation Phrases Under Noisy Conditions
    Mufungulwa, George
    Tsutsui, Hiroshi
    Miyanaga, Yoshikazu
    Abe, Shin-ichi
    Ochi, Mitsuru
    2017 INTERNATIONAL SYMPOSIUM ON SIGNALS, CIRCUITS AND SYSTEMS (ISSCS), 2017,
  • [35] Improving Speech Recognition for the Elderly: A New Corpus of Elderly Japanese Speech and Investigation of Acoustic Modeling for Speech Recognition
    Fukuda, Meiko
    Nishizaki, Hiromitsu
    Iribe, Yurie
    Nishimura, Ryota
    Kitaoka, Norihide
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6578 - 6585
  • [36] ACOUSTICAL PREPROCESSING FOR ROBUST SPEECH RECOGNITION
    STERN, RM
    ACERO, A
    SPEECH AND NATURAL LANGUAGE, 1989, : 311 - 318
  • [37] Toward Robust Speech Recognition and Understanding
    Sadaoki Furui
    Journal of VLSI signal processing systems for signal, image and video technology, 2005, 41 : 245 - 254
  • [38] Robust speech recognition in telephone network
    Han, MS
    Park, GB
    Park, JG
    Han, JQ
    PROGRESS IN CONNECTIONIST-BASED INFORMATION SYSTEMS, VOLS 1 AND 2, 1998, : 1103 - 1106
  • [39] An auditory model for robust speech recognition
    Luo, Xuewen
    Soon, Ing Yann
    Yeo, Chai Kiat
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1105 - 1109
  • [40] Toward robust speech recognition and understanding
    Furui, S
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2003, 2807 : 2 - 11