Verifying and correcting recognition string hypotheses using discriminative utterance verification

被引:20
|
作者
Sukkar, RA
Setlur, AR
Lee, CH
Jacob, J
机构
[1] AT&T Bell Labs, Lucent Technol, Naperville, IL 60566 USA
[2] AT&T Bell Labs, Lucent Technol, Murray Hill, NJ 07974 USA
关键词
D O I
10.1016/S0167-6393(97)00031-9
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Utterance verification (UV) is a process by which the output of a speech recognizer is verified to determine if the input speech actually includes the recognized keyword(s). The output of the speech verifier is a binary decision to accept or reject the recognized utterance based on a UV confidence score. In this paper, we extend the notion of utterance verification by presenting an utterance verification method that will be utilized to perform three tasks. (1) detect non-keyword strings (false alarms), (2) detect keyword substitution errors, and (3) selectively correct substitution errors when N-best string hypotheses are available. The utterance verification method presented here employs a set of verification-specific models that are independent of the models used in the recognition process. The verification models are trained using a discriminative training procedure that seeks to minimize the verification error by simultaneously maximizing the rejection of non-keywords and misrecognized keywords while minimizing the rejection of correctly recognized keywords. The error correction is performed by reordering the hypotheses produced by an N-best recognizer based on a UV confidence score. (C) 1997 Elsevier Science B.V.
引用
收藏
页码:333 / 342
页数:10
相关论文
共 50 条
  • [21] UTTERANCE VERIFICATION USING IMPROVED CONFIDENCE MEASURES BASED ON ALIGNMENT CONFUSION RATE IN CHINESE DIGITS RECOGNITION
    Zhang, Shilei
    Jiang, Danning
    Qin, Yong
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1309 - 1312
  • [22] On-line garbage modeling for word and utterance verification in natural numbers recognition
    delaTorre, C
    HernandezGomez, L
    CamineroGil, FJ
    delAlamo, CM
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 845 - 848
  • [23] Discriminative HMM stream model for Mandarin digit string speech recognition
    Shi, YY
    Liu, J
    Liu, RS
    2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 528 - 531
  • [24] Verifying visual properties in sentence verification facilitates picture recognition memory
    Pecher, Diane
    Zanolie, Kiki
    Zeelenberg, Rene
    EXPERIMENTAL PSYCHOLOGY, 2007, 54 (03) : 173 - 179
  • [25] Speaker verification using frame and utterance level likelihood normalization
    Nakagawa, S
    Markov, KP
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1087 - 1090
  • [26] Improving utterance verification using a smoothed naive Bayes model
    Sanchis, A
    Juan, A
    Vidal, E
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 592 - 595
  • [27] Dynamic signature verification using discriminative training
    Russell, GF
    Hu, JY
    Biem, A
    EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 1260 - 1264
  • [28] SHORT UTTERANCE RECOGNITION USING A NETWORK WITH MINIMUM TRAINING
    TOM, MD
    TENORIO, MF
    NEURAL NETWORKS, 1991, 4 (06) : 711 - 722
  • [29] Multistage Utterance Verification for Keyword Recognition-based Online Spoken Content Retrieval
    Park, Jeong-Sik
    Jang, Gil-Jin
    Kim, Ji-Hwan
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (03) : 1000 - 1005
  • [30] DETECTING MISMATCH BETWEEN TEXT SCRIPT AND VOICE-OVER USING UTTERANCE VERIFICATION BASED ON PHONEME RECOGNITION RANKING
    Jeong, Yoonjae
    Cho, Hoon-Young
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8264 - 8268