A Binary Keyword Spotting System with Error-Diffusion Based Feature Binarization

被引:0
|
作者
Wang, Dingyi [1 ]
Luo, Mengjie [1 ,2 ]
Li, Lin [1 ,2 ]
Wang, Xiaoqin [1 ,2 ]
Qiao, Shushan [1 ,2 ]
Zhou, Yumei
机构
[1] Chinese Acad Sci, Inst Microelect, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
来源
关键词
Keyword spotting; binary neural network; error diffusion; convolutional neural networks;
D O I
10.21437/Interspeech.2023-258
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Binary-neural-network based keyword spotting (KWS) for resource-constrained devices has gained much attention in recent years. Although several works proved their success, a fully binary KWS system is yet to come, considering high-precision speech feature maps are still required for satisfying accuracy. Such precision mismatch results in non-binary activation layers, thus leading to extra computational costs. In this paper, we present an extremely compact KWS system using a binary neural network and error-diffusion binarized speech features. The system eliminates all high-precision multiplications and requires only hardware-friendly bit-wise operations and additions for inference. Experiments on the Google speech commands show that our binary KWS system yields 98.54% accuracy on a 1-keyword task and 95.05% on a 2-keyword task, outperforming 8-bit KWS systems of bigger size. The result proves the feasibility of a fully binary KWS system and can be inspiring for hardware implementations.
引用
收藏
页码:1424 / 1428
页数:5
相关论文
共 50 条
  • [1] Error-Diffusion Based Speech Feature Quantization for Small-Footprint Keyword Spotting
    Luo, Mengjie
    Wang, Dingyi
    Wang, Xiaoqin
    Qiao, Shushan
    Zhou, Yumei
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1357 - 1361
  • [2] ERROR-DIFFUSION BINARIZATION FOR JOINT TRANSFORM CORRELATORS
    INBAR, H
    MENDLOVIC, D
    MAROM, E
    APPLIED OPTICS, 1993, 32 (05) : 707 - 714
  • [3] Error-diffusion binarization for joint transform correlators
    Inbar, H., 1600, Optical Society of America (OSA) (32):
  • [4] Error-diffusion fitting of direct-binary-search halftones
    Ho, Ken-Chung
    Liao, Jing-Cheng
    ISM WORKSHOPS 2007: NINTH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA - WORKSHOPS, PROCEEDINGS, 2007, : 104 - 109
  • [5] ONE-DIMENSIONAL ERROR-DIFFUSION TECHNIQUE ADAPTED FOR BINARIZATION OF ROTATIONALLY SYMMETRICAL PUPIL FILTERS
    KOWALCZYK, M
    MARTINEZCORRAL, M
    CICHOCKI, T
    ANDRES, P
    OPTICS COMMUNICATIONS, 1995, 114 (3-4) : 211 - 218
  • [6] Simple gradient-based error-diffusion method
    Hu, Xiangyu Y.
    JOURNAL OF ELECTRONIC IMAGING, 2016, 25 (04)
  • [7] A dialogue system based on keyword spotting technology
    Ru, HB
    Chen, YN
    Liu, J
    Liu, RS
    2003 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS, INTELLIGENT SYSTEMS AND SIGNAL PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2003, : 38 - 42
  • [8] Sub-mW Keyword Spotting on an MCU: Analog Binary Feature Extraction and Binary Neural Networks
    Cerutti, Gianmarco
    Cavigelli, Lukas
    Andri, Renzo
    Magno, Michele
    Farella, Elisabetta
    Benini, Luca
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2022, 69 (05) : 2002 - 2012
  • [9] A keyword spotting method based on speech feature space trace matching
    Wu, XH
    Wu, YD
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 4193 - 4193
  • [10] Keyword spotting method based on speech feature space trace matching
    Wu, YD
    Liu, BL
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 3188 - 3192