A Lightweight Architecture for Query-by-Example Keyword Spotting on Low-Power IoT Devices

被引:5
|
作者
Li, Meirong [1 ]
机构
[1] Xian Aeronaut Univ, Sch Comp Sci, Xian 710077, Peoples R China
关键词
Feature extraction; Internet of Things; Computer architecture; Neural networks; Keyword search; Task analysis; Recurrent neural networks; Keyword spotting; convolutional recurrent neural network; model compression; segmental local normalized DTW algorithm; SMALL-FOOTPRINT; NEURAL-NETWORK;
D O I
10.1109/TCE.2022.3213075
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Keyword spotting (KWS) is a task to recognize a keyword or a particular command in a continuous audio stream, which can be effectively applied to a voice trigger system that automatically monitors and processes speech signals. This paper focuses on the problem of user-defined keyword spotting in low-resource settings. A lightweight neural network architecture is developed for tackling the keyword detection task using query-by-example (QbyE) techniques. The architecture uses a convolutional recurrent neural network (CRNN) to extract the frame-level features of input audio signals. A customized model compression method is proposed to compress the network, making it suitable for low power settings. In the keyword enrollment, all enrolled keyword examples are merged to generate a single keyword template, which is responsible for detecting a target keyword in keyword search. To improve the efficiency of keyword searching, a segmental local normalized DTW algorithm is introduced. Experiments on the real-world collected datasets show that our approach consistently outperforms the state-of-the-art methods, and the proposed system can run on an ARM Cortex-A7 processor and achieve real-time keyword detection.
引用
收藏
页码:65 / 75
页数:11
相关论文
共 50 条
  • [1] QUERY-BY-EXAMPLE ON-DEVICE KEYWORD SPOTTING
    Kim, Byeonggeun
    Lee, Mingu
    Lee, Jinkyu
    Kim, Yeonseok
    Hwang, Kyuwoong
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 532 - 538
  • [2] Hypersphere Embedding and Additive Margin for Query-by-example Keyword Spotting
    Ma, Haoxin
    Bai, Ye
    Yi, Jiangyan
    Tao, Jianhua
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 868 - 872
  • [3] Target Speaker Extraction for Customizable Query-by-Example Keyword Spotting
    Shao, Qijie
    Hou, Jingyong
    Hu, Yanxin
    Wang, Qing
    Xie, Lei
    Lei, Xin
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 672 - 678
  • [4] High Performance Query-by-Example Keyword Spotting Using Query-by-String Techniques
    Vidal, Enrique
    Toselli, Alejandro H.
    Puigcerver, Joan
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 741 - 745
  • [5] QUERY-BY-EXAMPLE KEYWORD SPOTTING USING LONG SHORT-TERM MEMORY NETWORKS
    Chen, Guoguo
    Parada, Carolina
    Sainath, Tara N.
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5236 - 5240
  • [6] QUERY-BY-EXAMPLE KEYWORD SPOTTING SYSTEM USING MULTI-HEAD ATTENTION AND SOFTTRIPLE LOSS
    Huang, Jinmiao
    Gharbieh, Waseem
    Shim, Han Suk
    Kim, Eugene
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6858 - 6862
  • [7] QbyE-MLPMixer: Query-by-Example Open-Vocabulary Keyword Spotting using MLPMixer
    Huang, Jinmiao
    Gharbieh, Waseem
    Wan, Qianhui
    Shim, Han Suk
    Lee, Hyun Chul
    INTERSPEECH 2022, 2022, : 5200 - 5204
  • [8] Query by Example Keyword Spotting in Streams of Audio
    Camarena-Ibarrola, Antonio
    Ruiz-Perez, Martin
    2015 IEEE INTERNATIONAL AUTUMN MEETING ON POWER, ELECTRONICS AND COMPUTING (ROPEC), 2015,
  • [9] Grouping Historical Postcards Using Query-by-Example Word Spotting
    Fink, Gernot A.
    Rothacker, Leonard
    Grzeszick, Rene
    2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 470 - 475
  • [10] DEMO: Mobile Relay Architecture for Low-Power IoT Devices
    Manzoor, Ahsan
    Porambage, Pawani
    Liyanage, Madhsanka
    Ylianttila, Mika
    Gurtov, Andrei
    2018 IEEE 19TH INTERNATIONAL SYMPOSIUM ON A WORLD OF WIRELESS, MOBILE AND MULTIMEDIA NETWORKS (WOWMOM), 2018,