Efficient Data Selection for Speech Recognition Based on Prior Confidence Estimation Using Speech and Context Independent Models

被引：0

作者：

Kobashikawa, Satoshi ^{[1
]}

Asami, Taichi ^{[1
]}

Yamaguchi, Yoshikazu ^{[1
]}

Masataki, Hirokazu ^{[1
]}

Takahashi, Satoshi ^{[1
]}

机构：

[1] NTT Corp, NTT Cyber Space Labs, Tokyo, Japan

来源：

11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2 | 2010年

关键词：

speech recognition; confidence measure; data selection;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes an efficient data selection technique to identify well recognized texts in massive volumes of speech data. Conventional confidence measure techniques can be used to obtain this accurate data, but they require speech recognition results to estimate confidence. Without a significant level of confidence, considerable computer resources are wasted since inaccurate recognition results are generated only to be rejected later. The technique proposed herein rapidly estimates the prior confidence based on just an acoustic likelihood calculation by using speech and context independent models before speech recognition processing; it then recognizes data with high confidence selectively. Simulations show that it matches the data selection performance of the conventional posterior confidence measure with less than 2 % of the computation time.

引用

页码：238 / 241

页数：4

共 50 条

[1] Efficient data selection for speech recognition based on prior confidence estimation using speech and monophone models
Kobashikawa, Satoshi
Asami, Taichi
Yamaguchi, Yoshikazu
Masataki, Hirokazu
Takahashi, Satoshi
COMPUTER SPEECH AND LANGUAGE, 2014, 28 (06): : 1287 - 1297
[2] Efficient data selection for speech recognition based on prior confidence estimation
Kobashikawa, Satoshi
Asami, Taichi
Yamaguchi, Yoshikazu
Masataki, Hirokazu
Takahashi, Satoshi
ACOUSTICAL SCIENCE AND TECHNOLOGY, 2011, 32 (04) : 151 - 153
[3] Meta-models for confidence estimation in speech recognition
Dasmahapatra, S
Cox, S
2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1815 - 1818
[4] Discriminative Named Entity Recognition of Speech Data using Speech Recognition Confidence
Sudoh, Katsuhito
Tsukada, Hajime
Isozaki, Hideki
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 337 - 340
[5] ROBUST SPEECH RECOGNITION USING MULTIPLE PRIOR MODELS FOR SPEECH RECONSTRUCTION
Narayanan, Arun
Zhao, Xiaojia
Wang, DeLiang
Fosler-Lussier, Eric
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4800 - 4803
[6] Context-independent acoustic models for Thai speech recognition
Kasuriya, S
Kanokphara, S
Thatphithakkul, N
Cotsomrong, P
Sunpethniyom, T
IEEE INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES 2004 (ISCIT 2004), PROCEEDINGS, VOLS 1 AND 2: SMART INFO-MEDIA SYSTEMS, 2004, : 991 - 994
[7] CONFIDENCE ESTIMATION FOR ATTENTION-BASED SEQUENCE-TO-SEQUENCE MODELS FOR SPEECH RECOGNITION
Li, Qiujia
Qiu, David
Zhang, Yu
Li, Bo
He, Yanzhang
Woodland, Philip C.
Cao, Liangliang
Strohman, Trevor
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6388 - 6392
[8] CONTEXT-AWARE NEURAL CONFIDENCE ESTIMATION FOR RARE WORD SPEECH RECOGNITION
Qiu, David
Munkhdalai, Tsendsuren
He, Yanzhang
Sim, Khe Chai
2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 31 - 37
[9] Maximum entropy confidence estimation for speech recognition
White, Christopher
Droppo, Jasha
Acero, Alex
Odell, Julian
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 809 - +
[10] Incorporating speech recognition confidence into discriminative named entity recognition of speech data
Sudoh, Katsuhito
Tsukada, Hajime
Isozaki, Hideki
COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 617 - 624

← 1 2 3 4 5 →