Can You Repeat That? Using Word Repetition to Improve Spoken Term Detection

被引:0
|
作者
Wintrode, Jonathan [1 ]
Khudanpur, Sanjeev [1 ]
机构
[1] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA
来源
PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1 | 2014年
关键词
LANGUAGE MODEL;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We aim to improve spoken term detection performance by incorporating contextual information beyond traditional N-gram language models. Instead of taking a broad view of topic context in spoken documents, variability of word co-occurrence statistics across corpora leads us to focus instead the on phenomenon of word repetition within single documents. We show that given the detection of one instance of a term we are more likely to find additional instances of that term in the same document. We leverage this burstiness of keywords by taking the most confident keyword hypothesis in each document and interpolating with lower scoring hits. We then develop a principled approach to select interpolation weights using only the ASR training data. Using this re-weighting approach we demonstrate consistent improvement in the term detection performance across all five languages in the BABEL program.
引用
收藏
页码:1316 / 1325
页数:10
相关论文
共 50 条
  • [21] Evaluation of Fast Spoken Term Detection Using a Suffix Array
    Katsurada, Kouichi
    Sawada, Shinta
    Teshima, Shigeki
    Iribe, Yurie
    Nitta, Tsuneo
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 916 - 919
  • [22] On Using Composite Word Embeddings To Improve Biomedical Term Similarity
    Singh, Abhishek
    Jin, Wei
    2020 IEEE 20TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE 2020), 2020, : 281 - 287
  • [23] SPOKEN TERM DETECTION USING DYNAMIC MATCH SUBWORD CONFUSION NETWORK
    Gao, Jie
    Shao, Jian
    Zhang, Qingqing
    Zhao, Qingwei
    Yan, Yonghong
    ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 4, PROCEEDINGS, 2008, : 250 - 254
  • [24] Query-by-Example Spoken Term Detection Using Bessel Features
    Vasudev, Drisya
    Gangashetty, Suryakanth V.
    Babu, Anish K. K.
    Riyas, K. S.
    2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, INFORMATICS, COMMUNICATION AND ENERGY SYSTEMS (SPICES), 2015,
  • [25] Acoustic Word Embedding System for Code-Switching Query-by-example Spoken Term Detection
    Ma, Murong
    Wu, Haiwei
    Wang, Xuyang
    Yang, Lin
    Wang, Junjie
    Li, Ming
    2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
  • [26] Enhancing spoken term detection with deep acoustic word embeddings and cross-modal matching techniques
    Kasikorn Labs, Kasikorn Business-Technology Group, Pop Pu La Road, Nonthaburi
    11120, Thailand
    Int J Speech Technol, 2024, 4 (875-886): : 875 - 886
  • [27] AN INITIAL ATTEMPT TO IMPROVE SPOKEN TERM DETECTION BY LEARNING OPTIMAL WEIGHTS FOR DIFFERENT INDEXING FEATURES
    Chen, Yu-Hui
    Chou, Chia-Chen
    Lee, Hung-Yi
    Lee, Lin-Shan
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5278 - 5281
  • [28] Combination of syllable based N-gram search and word search for spoken term detection through spoken queries and IV/OOV classification
    Toyohashi University of Technology, Japan
    IEEE Workshop Autom. Speech Recognit. Underst., ASRU - Proc., 2015, (200-206):
  • [29] COMBINATION OF SYLLABLE BASED N-GRAM SEARCH AND WORD SEARCH FOR SPOKEN TERM DETECTION THROUGH SPOKEN QUERIES AND IV/OOV CLASSIFICATION
    Sakamoto, Nagisa
    Yamamoto, Kazumasa
    Nakagawa, Seiichi
    2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 200 - 206
  • [30] When half a word is enough: Infants can recognize spoken words using partial phonetic information
    Fernald, A
    Swingley, D
    Pinto, JP
    CHILD DEVELOPMENT, 2001, 72 (04) : 1003 - 1015