Verifying and correcting recognition string hypotheses using discriminative utterance verification

被引:20
|
作者
Sukkar, RA
Setlur, AR
Lee, CH
Jacob, J
机构
[1] AT&T Bell Labs, Lucent Technol, Naperville, IL 60566 USA
[2] AT&T Bell Labs, Lucent Technol, Murray Hill, NJ 07974 USA
关键词
D O I
10.1016/S0167-6393(97)00031-9
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Utterance verification (UV) is a process by which the output of a speech recognizer is verified to determine if the input speech actually includes the recognized keyword(s). The output of the speech verifier is a binary decision to accept or reject the recognized utterance based on a UV confidence score. In this paper, we extend the notion of utterance verification by presenting an utterance verification method that will be utilized to perform three tasks. (1) detect non-keyword strings (false alarms), (2) detect keyword substitution errors, and (3) selectively correct substitution errors when N-best string hypotheses are available. The utterance verification method presented here employs a set of verification-specific models that are independent of the models used in the recognition process. The verification models are trained using a discriminative training procedure that seeks to minimize the verification error by simultaneously maximizing the rejection of non-keywords and misrecognized keywords while minimizing the rejection of correctly recognized keywords. The error correction is performed by reordering the hypotheses produced by an N-best recognizer based on a UV confidence score. (C) 1997 Elsevier Science B.V.
引用
收藏
页码:333 / 342
页数:10
相关论文
共 50 条
  • [1] Correcting recognition errors via discriminative utterance verification
    Setlur, AR
    Sukkar, RA
    Jacob, J
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 602 - 605
  • [2] Discriminative utterance verification using minimum string verification error (MSVE) training
    Rahim, MG
    Lee, CH
    Juang, BH
    Chou, W
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 3585 - 3588
  • [3] Discriminative utterance verification for connected digits recognition
    Rahim, MG
    Lee, CH
    Juang, BH
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (03): : 266 - 277
  • [4] Vocabulary independent discriminative utterance verification for nonkeyword rejection in subword based speech recognition
    Sukkar, RA
    Lee, CH
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1996, 4 (06): : 420 - 429
  • [5] Study on robust utterance verification for connected digits recognition
    Rahim, Mazin G.
    Lee, Chin-Hui
    Juang, Biing-Hwang
    Journal of the Acoustical Society of America, 1997, 101 (5 pt 1):
  • [6] A study on robust utterance verification for connected digits recognition
    Rahim, MG
    Lee, CH
    Juang, BH
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 101 (05): : 2892 - 2902
  • [7] Improving utterance verification using hierarchical confidence measures in continuous natural numbers recognition
    Caminero, J
    Hernandez, L
    delaTorre, C
    Martin, C
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 891 - 894
  • [8] VERIFICATION OF RECOGNITION AND ALIGNMENT HYPOTHESES BY MEANS OF EDGE VERIFICATION STATISTICS
    HELLER, AJ
    STENSTROM, JR
    IMAGE UNDERSTANDING WORKSHOP /, 1989, : 957 - 966
  • [9] Utterance verification in continuous speech recognition: Decoding and training procedures
    Lleida, E
    Rose, RC
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (02): : 126 - 139
  • [10] A new hybrid decoding algorithm for speech recognition and utterance verification
    Koo, MW
    Lee, CH
    Juang, BH
    1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 303 - 310