FAST LATTICE-FREE KEYWORD FILTERING FOR ACCELERATED SPOKEN TERM DETECTION

被引:0
|
作者
Wintrode, Jonathan [1 ]
Wilkes, Jenny [1 ]
机构
[1] Raytheon Appl Signal Technol, Annapolis Jct, MD 20701 USA
关键词
speech recognition; keyword spotting; term detection;
D O I
10.1109/icassp40776.2020.9054221
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present a novel set of keyword detection techniques to accelerate spoken term detection for known queries with minimal loss in accuracy. Using only ASR frame-level acoustic posteriors we can train multiple models to effectively detect non-target segments for which we need not perform full lattice decoding. We estimate phone n-gram soft counts for each segment in a single pass over the frame-level output. From this we can efficiently detect a fixed set of keywords with both linear and DNN-based classifiers. Furthermore we can train the linear classifiers on a small number of labeled examples. Experiments on the PSC and VAST English subset of NIST's 2019 OpenSAT evaluation demonstrate we can filter out half of the test audio segments while only increasing the keyword miss rate by under 3%.
引用
收藏
页码:7469 / 7473
页数:5
相关论文
共 31 条
  • [21] Fast Spoken Term Detection Using Pre-retrieval Results of Syllable Bigrams
    Saito, Hiroyuki
    Itoh, Yoshiaki
    Kojima, Kazunori
    Ishigame, Masaaki
    Tanaka, Kazuyo
    Lee, Shi-Wook
    2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [22] IMPROVED SPOKEN TERM DETECTION USING SUPPORT VECTOR MACHINES BASED ON LATTICE CONTEXT CONSISTENCY
    Lee, Hung-yi
    Tu, Tsung-wei
    Chen, Chia-ping
    Huang, Chao-yu
    Lee, Lin-shan
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5648 - 5651
  • [23] Improved dynamic match phone lattice search for Persian spoken term detection system in online and offline applications
    Tabibian, Shima
    Akbari, Ahmad
    Nasersharif, Babak
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (01) : 205 - 217
  • [24] Improved dynamic match phone lattice search for Persian spoken term detection system in online and offline applications
    Shima Tabibian
    Ahmad Akbari
    Babak Nasersharif
    International Journal of Speech Technology, 2019, 22 : 205 - 217
  • [25] A robust/fast spoken term detection method based on a syllable n-gram index with a distance metric
    Nakagawa, Seiichi
    Iwami, Keisuke
    Fujii, Yasuhisa
    Yamamoto, Kazumasa
    SPEECH COMMUNICATION, 2013, 55 (03) : 470 - 485
  • [26] Modification in Sequential Dynamic Time Warping for Fast Computation of Query-by-Example Spoken Term Detection Task
    Madhavi, Maulik C.
    Patil, Hemant A.
    2016 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2016,
  • [27] Water chicken swarm optimization-based deep segmental neural network for spoken term detection using bayesian filtering
    Kulkarni, Sushil Venkatesh
    Pal, Sukomal
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (30) : 74711 - 74737
  • [28] DOUBLE-LAYER NEIGHBORHOOD GRAPH BASED SIMILARITY SEARCH FOR FAST QUERY-BY-EXAMPLE SPOKEN TERM DETECTION
    Aoyama, Kazuo
    Ogawa, Atsunori
    Hattori, Takashi
    Hori, Takaaki
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5216 - 5220
  • [30] Spoken Term Detection from Bilingual Spontaneous Speech Using Code-switched Lattice-based Structures for Words and Subword Units
    Lee, Hung-Yi
    Tang, Yueh-Lien
    Tang, Hao
    Lee, Lin-Shan
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 410 - +