FAST LATTICE-FREE KEYWORD FILTERING FOR ACCELERATED SPOKEN TERM DETECTION

被引:0
|
作者
Wintrode, Jonathan [1 ]
Wilkes, Jenny [1 ]
机构
[1] Raytheon Appl Signal Technol, Annapolis Jct, MD 20701 USA
关键词
speech recognition; keyword spotting; term detection;
D O I
10.1109/icassp40776.2020.9054221
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present a novel set of keyword detection techniques to accelerate spoken term detection for known queries with minimal loss in accuracy. Using only ASR frame-level acoustic posteriors we can train multiple models to effectively detect non-target segments for which we need not perform full lattice decoding. We estimate phone n-gram soft counts for each segment in a single pass over the frame-level output. From this we can efficiently detect a fixed set of keywords with both linear and DNN-based classifiers. Furthermore we can train the linear classifiers on a small number of labeled examples. Experiments on the PSC and VAST English subset of NIST's 2019 OpenSAT evaluation demonstrate we can filter out half of the test audio segments while only increasing the keyword miss rate by under 3%.
引用
收藏
页码:7469 / 7473
页数:5
相关论文
共 31 条
  • [1] Lattice-Free Open Vocabulary Keyword Spotting
    Ramesh, Gundluru
    Doppa, Naveen
    Murty, K. Sri Rama
    2024 NATIONAL CONFERENCE ON COMMUNICATIONS, NCC, 2024,
  • [2] Targeted Keyword Filtering for Accelerated Spoken Topic Identification
    Wintrode, Jonathan
    INTERSPEECH 2021, 2021, : 786 - 790
  • [3] WakeWord Detection with Alignment-Free Lattice-Free MMI
    Wang, Yiming
    Lv, Hang
    Povey, Daniel
    Xie, Lei
    Khudanpur, Sanjeev
    INTERSPEECH 2020, 2020, : 4258 - 4262
  • [4] SYSTEM AND KEYWORD DEPENDENT FUSION FOR SPOKEN TERM DETECTION
    Van Tung Pham
    Chen, Nancy F.
    Sivadas, Sunil
    Xu, Haihua
    Chen, I-Fan
    Ni, Chongjia
    Chng, Eng Siong
    Li, Haizhou
    2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 430 - 435
  • [5] Lattice Indexing for Spoken Term Detection
    Can, Dogan
    Saraclar, Murat
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (08): : 2338 - 2347
  • [6] Selection of Best Match Keyword using Spoken Term Detection for Spoken Document Indexing
    Domoto, Kentaro
    Utsuro, Takehito
    Sawada, Naoki
    Nishizaki, Hiromitsu
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [7] GPU-ACCELERATED FORWARD-BACKWARD ALGORITHM WITH APPLICATION TO LATTICE-FREE MMI
    Ondel, Lucas
    Lam-Yee-Mui, Lea-Marie
    Kocour, Martin
    Corro, Caio Filippo
    Burget, Lukas
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8417 - 8421
  • [8] ORDER-FREE SPOKEN TERM DETECTION
    Mangu, Lidia
    Saon, George
    Picheny, Michael
    Kingsbury, Brian
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5331 - 5335
  • [9] Metric Subspace Indexing for Fast Spoken Term Detection
    Kaneko, Taisuke
    Akiba, Tomoyosi
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 689 - 692
  • [10] SPOKEN TERM DETECTION USING FAST PHONETIC DECODING
    Wallace, Roy
    Vogt, Robbie
    Sridharan, Sridha
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4881 - 4884