Exploiting the ASR N-Best by tracking multiple dialog state hypotheses

被引:0
|
作者
Williams, Jason D. [1 ]
机构
[1] AT&T Labs Res, Shannon Lab, Florham Pk, NJ 07932 USA
关键词
dialogue modelling; dialogue management; spoken dialogue systems; confidence score; N-Best list;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When the top ASR hypothesis is incorrect, often the correct hypothesis is listed as an alternative in the ASR N-Best list. Whereas traditional spoken dialog systems have struggled to exploit this information, this paper argues that a dialog model that tracks a distribution over multiple dialog states can improve dialog accuracy by making use of the entire N-Best list. The key clement of the approach is a generative model of the N-Best list given the user's true hidden action. An evaluation on real dialog data verifies that dialog accuracy rates arc improved by making use of the entire N-Best list.
引用
收藏
页码:191 / 194
页数:4
相关论文
共 38 条
  • [21] Multirate ASR models for phone-class dependent N-best list rescoring
    Gadde, VR
    Sönmez, K
    Franco, H
    2005 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2005, : 157 - 161
  • [22] A COMPARATIVE STUDY ON METHODS OF WEIGHTED LANGUAGE MODEL TRAINING FOR RERANKING LVCSR N-BEST HYPOTHESES
    Oba, Takanobu
    Hori, Takaaki
    Nakamura, Atsushi
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5126 - 5129
  • [23] A comparative study of discriminative methods for reranking LVCSR N-best hypotheses in domain adaption and generalization
    Zhou, Zhengyu
    Gao, Jianfeng
    Soong, Frank K.
    Meng, Helen
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 141 - 144
  • [24] Efficient Multiple Hypotheses Tracking Scheme using Adaptive number of 'K' best Hypotheses for Target Tracking in Clutter
    Vudumu, Sarojini
    Paul, Viji P.
    Ramakrishnan, L.
    2018 22ND INTERNATIONAL MICROWAVE AND RADAR CONFERENCE (MIKON 2018), 2018, : 390 - 394
  • [25] Rescoring of N-Best Hypotheses Using Top-Down Selective Attention for Automatic Speech Recognition
    Kim, Ho-Gyeong
    Lee, Hwaran
    Kim, Geonmin
    Oh, Sang-Hoon
    Lee, Soo-Young
    IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (02) : 199 - 203
  • [26] Exploiting More Associations Between Slots for Multi-Domain Dialog State Tracking
    Hui Bai
    Yan Yang
    Jie Wang
    Big Data Mining and Analytics, 2022, 5 (01) : 41 - 52
  • [27] Exploiting More Associations Between Slots for Multi-Domain Dialog State Tracking
    Bai, Hui
    Yang, Yan
    Wang, Jie
    BIG DATA MINING AND ANALYTICS, 2022, 5 (01) : 41 - 52
  • [28] Implementation of a Chomsky-Schutzenberger n-Best Parser for Weighted Multiple Context-Free Grammars
    Ruprecht, Thomas
    Denkinger, Tobias
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 178 - 191
  • [29] IMPROVING NOISE ROBUSTNESS FOR SPOKEN CONTENT RETRIEVAL USING SEMI-SUPERVISED ASR AND N-BEST TRANSCRIPTS FOR BERT-BASED RANKING MODELS
    Moriya, Yasufumi
    Jones, Gareth. J. F.
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 398 - 405
  • [30] Compression of CTC-Trained Acoustic Models by Dynamic Frame-Wise Distillation Or Segment-Wise N-Best Hypotheses Imitation
    Ding, Haisong
    Chen, Kai
    Huo, Qiang
    INTERSPEECH 2019, 2019, : 3218 - 3222