Efficient Data Selection for Speech Recognition Based on Prior Confidence Estimation Using Speech and Context Independent Models

被引:0
|
作者
Kobashikawa, Satoshi [1 ]
Asami, Taichi [1 ]
Yamaguchi, Yoshikazu [1 ]
Masataki, Hirokazu [1 ]
Takahashi, Satoshi [1 ]
机构
[1] NTT Corp, NTT Cyber Space Labs, Tokyo, Japan
来源
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2 | 2010年
关键词
speech recognition; confidence measure; data selection;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an efficient data selection technique to identify well recognized texts in massive volumes of speech data. Conventional confidence measure techniques can be used to obtain this accurate data, but they require speech recognition results to estimate confidence. Without a significant level of confidence, considerable computer resources are wasted since inaccurate recognition results are generated only to be rejected later. The technique proposed herein rapidly estimates the prior confidence based on just an acoustic likelihood calculation by using speech and context independent models before speech recognition processing; it then recognizes data with high confidence selectively. Simulations show that it matches the data selection performance of the conventional posterior confidence measure with less than 2 % of the computation time.
引用
收藏
页码:238 / 241
页数:4
相关论文
共 50 条
  • [1] Efficient data selection for speech recognition based on prior confidence estimation using speech and monophone models
    Kobashikawa, Satoshi
    Asami, Taichi
    Yamaguchi, Yoshikazu
    Masataki, Hirokazu
    Takahashi, Satoshi
    COMPUTER SPEECH AND LANGUAGE, 2014, 28 (06): : 1287 - 1297
  • [2] Efficient data selection for speech recognition based on prior confidence estimation
    Kobashikawa, Satoshi
    Asami, Taichi
    Yamaguchi, Yoshikazu
    Masataki, Hirokazu
    Takahashi, Satoshi
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2011, 32 (04) : 151 - 153
  • [3] Meta-models for confidence estimation in speech recognition
    Dasmahapatra, S
    Cox, S
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1815 - 1818
  • [4] Discriminative Named Entity Recognition of Speech Data using Speech Recognition Confidence
    Sudoh, Katsuhito
    Tsukada, Hajime
    Isozaki, Hideki
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 337 - 340
  • [5] ROBUST SPEECH RECOGNITION USING MULTIPLE PRIOR MODELS FOR SPEECH RECONSTRUCTION
    Narayanan, Arun
    Zhao, Xiaojia
    Wang, DeLiang
    Fosler-Lussier, Eric
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4800 - 4803
  • [6] Context-independent acoustic models for Thai speech recognition
    Kasuriya, S
    Kanokphara, S
    Thatphithakkul, N
    Cotsomrong, P
    Sunpethniyom, T
    IEEE INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES 2004 (ISCIT 2004), PROCEEDINGS, VOLS 1 AND 2: SMART INFO-MEDIA SYSTEMS, 2004, : 991 - 994
  • [7] CONFIDENCE ESTIMATION FOR ATTENTION-BASED SEQUENCE-TO-SEQUENCE MODELS FOR SPEECH RECOGNITION
    Li, Qiujia
    Qiu, David
    Zhang, Yu
    Li, Bo
    He, Yanzhang
    Woodland, Philip C.
    Cao, Liangliang
    Strohman, Trevor
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6388 - 6392
  • [8] CONTEXT-AWARE NEURAL CONFIDENCE ESTIMATION FOR RARE WORD SPEECH RECOGNITION
    Qiu, David
    Munkhdalai, Tsendsuren
    He, Yanzhang
    Sim, Khe Chai
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 31 - 37
  • [9] Maximum entropy confidence estimation for speech recognition
    White, Christopher
    Droppo, Jasha
    Acero, Alex
    Odell, Julian
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 809 - +
  • [10] Incorporating speech recognition confidence into discriminative named entity recognition of speech data
    Sudoh, Katsuhito
    Tsukada, Hajime
    Isozaki, Hideki
    COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 617 - 624