A Binary Keyword Spotting System with Error-Diffusion Based Feature Binarization

被引:0
|
作者
Wang, Dingyi [1 ]
Luo, Mengjie [1 ,2 ]
Li, Lin [1 ,2 ]
Wang, Xiaoqin [1 ,2 ]
Qiao, Shushan [1 ,2 ]
Zhou, Yumei
机构
[1] Chinese Acad Sci, Inst Microelect, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
来源
关键词
Keyword spotting; binary neural network; error diffusion; convolutional neural networks;
D O I
10.21437/Interspeech.2023-258
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Binary-neural-network based keyword spotting (KWS) for resource-constrained devices has gained much attention in recent years. Although several works proved their success, a fully binary KWS system is yet to come, considering high-precision speech feature maps are still required for satisfying accuracy. Such precision mismatch results in non-binary activation layers, thus leading to extra computational costs. In this paper, we present an extremely compact KWS system using a binary neural network and error-diffusion binarized speech features. The system eliminates all high-precision multiplications and requires only hardware-friendly bit-wise operations and additions for inference. Experiments on the Google speech commands show that our binary KWS system yields 98.54% accuracy on a 1-keyword task and 95.05% on a 2-keyword task, outperforming 8-bit KWS systems of bigger size. The result proves the feasibility of a fully binary KWS system and can be inspiring for hardware implementations.
引用
收藏
页码:1424 / 1428
页数:5
相关论文
共 50 条
  • [31] Database Design for Error Searching System Based on Keyword Priority
    Yang, Fan
    Dong, Zhenghong
    Liu, Zhiwei
    2016 3RD INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2016, : 526 - 530
  • [32] HMM Based Keyword Spotting System in Printed/Handwritten Arabic/Latin Documents with Identification Stage
    Rouhou, Ahmed Cheikh
    Kessentini, Yousri
    Kanoun, Slim
    IMAGE ANALYSIS AND RECOGNITION, ICIAR 2019, PT I, 2019, 11662 : 309 - 320
  • [33] A Russian Keyword Spotting System Based on Large Vocabulary Continuous Speech Recognition and Linguistic Knowledge
    Smirnov, Valentin
    Ignatov, Dmitry
    Gusev, Michael
    Farkhadov, Mais
    Rumyantseva, Natalia
    Farkhadova, Mukhabbat
    JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2016, 2016
  • [34] Adaptive Digital Hologram Binarization Method Based on Local Thresholding, Block Division and Error Diffusion
    Cheremkhin, Pavel A.
    Kurbatova, Ekaterina A.
    Evtikhiev, Nikolay N.
    Krasnov, Vitaly V.
    Rodin, Vladislav G.
    Starikov, Rostislav S.
    JOURNAL OF IMAGING, 2022, 8 (02)
  • [35] Duration Model-Based Post-processing for the Performance Improvement of a Keyword Spotting System
    Lee, Min Ji
    Yoon, Jae Sam
    Oh, Yoo Rhee
    Kim, Hong Kook
    Choi, Song Ha
    Kim, Ji Woon
    Kim, Myeong Bo
    COMMUNICATION AND NETWORKING, PT II, 2010, 120 : 148 - +
  • [36] A 23-μW Keyword Spotting IC With Ring-Oscillator-Based Time-Domain Feature Extraction
    Kim, Kwantae
    Gao, Chang
    Graca, Rui
    Kiselev, Ilya
    Yoo, Hoi-Jun
    Delbruck, Tobi
    Liu, Shih-Chii
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2022, 57 (11) : 3298 - 3311
  • [37] Time-Delay-Neural-Network-Based Audio Feature Extractor for Ultra-Low Power Keyword Spotting
    Fuketa, Hiroshi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (02) : 334 - 338
  • [38] A novel spoken document retrieval system using Auto Associative Neural Network based keyword spotting
    Sangeetha, J.
    Jothilakshmi, S.
    PROCEEDINGS OF 2015 IEEE 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO), 2015,
  • [39] Thermometer Code of Log Mel-Frequency Spectral Coefficient for BNN-based Keyword Spotting System
    Jiao, Yuzhong
    Li, Yiu Kei
    Chan, Chi Hong
    Li, Yun
    Ai, Zhilin
    2022 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS, APCCAS, 2022, : 414 - 418
  • [40] DLiGRU-X: Efficient X-Vector-Based Embeddings for Small-Footprint Keyword Spotting System
    Wu, Zong-En
    Chan, Shao-Jung
    Wubet, Yeshanew Ale
    Lian, Kuang-Yow
    IEEE ACCESS, 2025, 13 : 23498 - 23507