Can You Repeat That? Using Word Repetition to Improve Spoken Term Detection

被引：0

作者：

Wintrode, Jonathan ^{[1
]}

Khudanpur, Sanjeev ^{[1
]}

机构：

[1] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA

来源：

PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1 | 2014年

关键词：

LANGUAGE MODEL;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We aim to improve spoken term detection performance by incorporating contextual information beyond traditional N-gram language models. Instead of taking a broad view of topic context in spoken documents, variability of word co-occurrence statistics across corpora leads us to focus instead the on phenomenon of word repetition within single documents. We show that given the detection of one instance of a term we are more likely to find additional instances of that term in the same document. We leverage this burstiness of keywords by taking the most confident keyword hypothesis in each document and interpolating with lower scoring hits. We then develop a principled approach to select interpolation weights using only the ASR training data. Using this re-weighting approach we demonstrate consistent improvement in the term detection performance across all five languages in the BABEL program.

引用

页码：1316 / 1325

页数：10

共 50 条

[21] Evaluation of Fast Spoken Term Detection Using a Suffix Array
Katsurada, Kouichi
Sawada, Shinta
Teshima, Shigeki
Iribe, Yurie
Nitta, Tsuneo
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 916 - 919
[22] On Using Composite Word Embeddings To Improve Biomedical Term Similarity
Singh, Abhishek
Jin, Wei
2020 IEEE 20TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE 2020), 2020, : 281 - 287
[23] SPOKEN TERM DETECTION USING DYNAMIC MATCH SUBWORD CONFUSION NETWORK
Gao, Jie
Shao, Jian
Zhang, Qingqing
Zhao, Qingwei
Yan, Yonghong
ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 4, PROCEEDINGS, 2008, : 250 - 254
[24] Query-by-Example Spoken Term Detection Using Bessel Features
Vasudev, Drisya
Gangashetty, Suryakanth V.
Babu, Anish K. K.
Riyas, K. S.
2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, INFORMATICS, COMMUNICATION AND ENERGY SYSTEMS (SPICES), 2015,
[25] Acoustic Word Embedding System for Code-Switching Query-by-example Spoken Term Detection
Ma, Murong
Wu, Haiwei
Wang, Xuyang
Yang, Lin
Wang, Junjie
Li, Ming
2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
[26] Enhancing spoken term detection with deep acoustic word embeddings and cross-modal matching techniques
Kasikorn Labs, Kasikorn Business-Technology Group, Pop Pu La Road, Nonthaburi
11120, Thailand
Int J Speech Technol, 2024, 4 (875-886): : 875 - 886
[27] AN INITIAL ATTEMPT TO IMPROVE SPOKEN TERM DETECTION BY LEARNING OPTIMAL WEIGHTS FOR DIFFERENT INDEXING FEATURES
Chen, Yu-Hui
Chou, Chia-Chen
Lee, Hung-Yi
Lee, Lin-Shan
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5278 - 5281
[28] Combination of syllable based N-gram search and word search for spoken term detection through spoken queries and IV/OOV classification
Toyohashi University of Technology, Japan
IEEE Workshop Autom. Speech Recognit. Underst., ASRU - Proc., 2015, (200-206):
[29] COMBINATION OF SYLLABLE BASED N-GRAM SEARCH AND WORD SEARCH FOR SPOKEN TERM DETECTION THROUGH SPOKEN QUERIES AND IV/OOV CLASSIFICATION
Sakamoto, Nagisa
Yamamoto, Kazumasa
Nakagawa, Seiichi
2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 200 - 206
[30] When half a word is enough: Infants can recognize spoken words using partial phonetic information
Fernald, A
Swingley, D
Pinto, JP
CHILD DEVELOPMENT, 2001, 72 (04) : 1003 - 1015

← 1 2 3 4 5 →