Japanese speech databases for robust speech recognition

被引：0

作者：

Nakamura, A

Matsunaga, S

Shimizu, T

Tonomura, M

Sagisaka, Y

机构：

来源：

ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4 | 1996年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Ar ATR, a next-generation speech translation system is under development towards natural trans-language communication. To cope with the various requirements to speech recognition technology for the new system, further research efforts should emphasize the robustness for large vocabulary, speaking variations often found in fast spontaneous speech and speaker variances. These are key problems to be solved not only for speech translation bur also far the general use of speech recognition in real environments In this paper, three large speech databases are designed to cope with these problems in speech recognition acid the current status of data collection is reported.

引用

页码：2199 / 2202

页数：4

共 50 条

[31] Robust speech recognition by integrating speech separation and hypothesis testing
Srinivasan, Soundararajan
Wang, DeLiang
SPEECH COMMUNICATION, 2010, 52 (01) : 72 - 81
[32] Robust Speech Recognition with Speech Enhanced Deep Neural Networks
Du, Jun
Wang, Qing
Gao, Tian
Xu, Yong
Dai, Lirong
Lee, Chin-Hui
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 616 - 620
[33] Application of noise robust MDT speech recognition on the SPEECON and SpeechDat-Car databases
Gemmeke, J. F.
Wang, Y.
Van Segbroeck, M.
Cranen, B.
Van Hamme, H.
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1227 - +
[34] Robust Speech Recognition for Similar Japanese Pronunciation Phrases Under Noisy Conditions
Mufungulwa, George
Tsutsui, Hiroshi
Miyanaga, Yoshikazu
Abe, Shin-ichi
Ochi, Mitsuru
2017 INTERNATIONAL SYMPOSIUM ON SIGNALS, CIRCUITS AND SYSTEMS (ISSCS), 2017,
[35] Improving Speech Recognition for the Elderly: A New Corpus of Elderly Japanese Speech and Investigation of Acoustic Modeling for Speech Recognition
Fukuda, Meiko
Nishizaki, Hiromitsu
Iribe, Yurie
Nishimura, Ryota
Kitaoka, Norihide
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6578 - 6585
[36] ACOUSTICAL PREPROCESSING FOR ROBUST SPEECH RECOGNITION
STERN, RM
ACERO, A
SPEECH AND NATURAL LANGUAGE, 1989, : 311 - 318
[37] Toward Robust Speech Recognition and Understanding
Sadaoki Furui
Journal of VLSI signal processing systems for signal, image and video technology, 2005, 41 : 245 - 254
[38] Robust speech recognition in telephone network
Han, MS
Park, GB
Park, JG
Han, JQ
PROGRESS IN CONNECTIONIST-BASED INFORMATION SYSTEMS, VOLS 1 AND 2, 1998, : 1103 - 1106
[39] An auditory model for robust speech recognition
Luo, Xuewen
Soon, Ing Yann
Yeo, Chai Kiat
2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1105 - 1109
[40] Toward robust speech recognition and understanding
Furui, S
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2003, 2807 : 2 - 11

← 1 2 3 4 5 →