ZERO RESOURCE GRAPH-BASED CONFIDENCE ESTIMATION FOR OPEN VOCABULARY SPOKEN TERM DETECTION

被引:0
|
作者
Norouzian, Atta [1 ]
Rose, Richard [1 ]
Ghalehjegh, Sina Hamidi [1 ]
Jansen, Aren [2 ,3 ]
机构
[1] McGill Univ, Dept Elect & Comp Engn, Montreal, PQ, Canada
[2] Human Knowledge Technol Ctr Excellence, Montreal, PQ, Canada
[3] Johns Hopkins Univ, Dept Elect & Comp Engn, Baltimore, MD 21218 USA
来源
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年
关键词
Open vocabulary spoken term detection; Dotplot; Random walk on directional graphs;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper the use of acoustic similarity of speech intervals for generating improved confidence scores for spoken term detection (STD) is investigated. A procedure based on acoustic dotplots which requires no training data is deployed for discovering similar speech intervals. A graph based random walk algorithm incorporates acoustic similarity of hypothesized term occurrences for improving the corresponding confidence scores. The proposed approach is evaluated in an open vocabulary STD task defined on a lecture domain corpus. It is shown that updating the confidence scores in this fashion results in a significant increase in term detection performance of out of vocabulary search terms. A relative improvement of 12.9% in figure of merit was gained relative to that obtained from a baseline lattice based STD system.
引用
收藏
页码:8292 / 8296
页数:5
相关论文
共 50 条
  • [21] Self-Paced Pattern Augmentation for Spoken Term Detection in Zero-Resource
    Sudhakar, P.
    Rao, Sreenivasa K.
    Mitra, Pabitra
    INTERSPEECH 2023, 2023, : 1618 - 1622
  • [22] FACILITATING OPEN VOCABULARY SPOKEN TERM DETECTION USING A MULTIPLE PASS HYBRID SEARCH ALGORITHM
    Norouzian, Atta
    Rose, Richard
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5169 - 5172
  • [23] Zero-resource audio-only spoken term detection based on a combination of template matching techniques
    Muscariello, Armando
    Gravier, Guillaume
    Bimbot, Frederic
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 928 - 931
  • [24] A Semantic Graph-Based Japanese Vocabulary Learning Game
    Wita, Ratsameetip
    Oly, Sahussarin
    Choomok, Sununta
    Treeratsakulchai, Thanabhorn
    Wita, Surarat
    ADVANCES IN WEB-BASED LEARNING - ICWL 2018, 2018, 11007 : 140 - 145
  • [25] Graph-based APT detection
    Debatty, Thibault
    Mees, Wim
    Gilon, Thomas
    2018 INTERNATIONAL CONFERENCE ON MILITARY COMMUNICATIONS AND INFORMATION SYSTEMS (ICMCIS), 2018,
  • [26] Empirical analysis of score fusion application to combined neural networks for open vocabulary spoken term detection
    Lee, Shi-wook
    Tanaka, Kazuyo
    Itoh, Yoshiaki
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2062 - 2066
  • [27] Graph-Based Resource Sharing in Vehicular Communication
    Liang, Le
    Xie, Shijie
    Li, Geoffrey Ye
    Ding, Zhi
    Yu, Xingxing
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2018, 17 (07) : 4579 - 4592
  • [28] Term-Dependent Confidence for Out-of-Vocabulary Term Detection
    Wang, Dong
    King, Simon
    Frankel, Joe
    Bell, Peter
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2103 - 2106
  • [29] Fusing Multiple Confidence Measures for Chinese Spoken Term Detection
    Ma, Zejun
    Wang, Xiaorui
    Xu, Bo
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1936 - 1939
  • [30] Neural keyword confidence estimation for open-vocabulary keyword spotting
    Liu, Zuozhen
    Li, Ta
    Zhang, Pengyuan
    ELECTRONICS LETTERS, 2022, 58 (03) : 133 - 135