Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments - Newest Part of the CENSREC Series -

被引：0

作者：

Nishiura, Takanobu

Nakayama, Masato

Denda, Yuki

Kitaoka, Norihide

Yamamoto, Kazumasa

Yamada, Takeshi

Tsuge, Satoru

Miyajima, Chiyomi

Fujimoto, Masakiyo

Takiguchi, Tetsuya

Tamura, Satoshi

Kuroiwa, Shingo

Takeda, Kazuya

Nakamura, Satoshi

机构：

来源：

SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008 | 2008年

关键词：

D O I：

暂无

中图分类号：

H0 [语言学];

学科分类号：

030303 ; 0501 ; 050102 ;

摘要：

Recently, speech recognition performance has been drastically improved by statistical methods and huge speech databases. Now performance improvement under such realistic environments as noisy conditions is being focused on. Since October 2001, we from the working group of the Information Processing Society in Japan have been working on evaluation methodologies and frameworks for Japanese noisy speech recognition. We have released frameworks including databases and evaluation tools called CENSREC-1 (Corpus and Environment for Noisy Speech RECognition 1; formerly AURORA-2J), CENSREC-2 (in-car connected digits recognition), CENSREC-3 (in-car isolated word recognition), and CENSREC-1-C (voice activity detection under noisy conditions). In this paper, we newly introduce a collection of databases and evaluation tools named CENSREC-4, which is an evaluation framework for distant-talking speech under hands-free conditions. Distant-talking speech recognition is crucial for a hands-free speech interface. Therefore, we measured room impulse responses to investigate reverberant speech recognition. The results of evaluation experiments proved that CENSREC-4 is an effective database suitable for evaluating the new dereverberation method because the traditional dereverberation process had difficulty sufficiently improving the recognition performance. The framework was released in March 2008, and many studies are being conducted with it in Japan.

引用

页码：1828 / 1834

页数：7

共 47 条

[21] Multi-party Human-Robot Interaction with Distant-Talking Speech Recognition
Gomez, Randy
Kawahara, Tatsuya
Nakamura, Keisuke
Nakadai, Kazuhiro
HRI'12: PROCEEDINGS OF THE SEVENTH ANNUAL ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2012, : 439 - 446
[22] Distant-talking Continuous Speech Recognition based on a novel Reverberation Model in the Feature Domain
Sehr, Armin
Zeller, Marcus
Kellermann, Walter
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 769 - 772
[23] CENSREC-3: An evaluation framework for Japanese speech recognition in real car-driving environments
Fujimoto, Masakiyo
Takeda, Kazuya
Nakamura, Satoshi
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (11) : 2783 - 2793
[24] Investigations into Early and Late Reflections on Distant-Talking Speech Recognition Toward Suitable Reverberation Criteria
Nishiura, Takanobu
Hirano, Yoshiki
Denda, Yuki
Nakayama, Masato
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1369 - 1372
[25] Distant-Talking Speech Recognition Based on Spectral Subtraction by Multi-Channel LMS Algorithm
Wang, Longbiao
Kitaoka, Norihide
Nakagawa, Seiichi
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (03): : 659 - 667
[26] Robust Speaker Recognition from Distant Speech under Real Reverberant Environments Using Speaker Embeddings
Nandwana, Mahesh Kumar
van Hout, Julien
McLaren, Mitchell
Stauffer, Allen
Richey, Colleen
Lawson, Aaron
Graciarena, Martin
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1106 - 1110
[27] Reverberation Model-Based Decoding in the Logmelspec Domain for Robust Distant-Talking Speech Recognition
Sehr, Armin
Maas, Roland
Kellermann, Walter
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1676 - 1691
[28] JOINT SPARSE REPRESENTATION BASED CEPSTRAL-DOMAIN DEREVERBERATION FOR DISTANT-TALKING SPEECH RECOGNITION
Li, Weifeng
Wang, Longbiao
Zhou, Fei
Liao, Qingmin
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7117 - 7120
[29] Distant-talking robust speech recognition using late reflection components of room impulse response
Gomez, Randy
Even, Jani
Saruwatari, Hiroshi
Shikano, Kiyohiro
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4581 - 4584
[30] Distant-talking speech recognition based on a 3-D Viterbi search using a microphone array
Yamada, T
Nakamura, S
Shikano, K
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (02): : 48 - 56

← 1 2 3 4 5 →