Energy-friendly keyword spotting system using add-based convolution

被引:1
|
作者
Zhou, Hang [1 ]
Hu, Wenchao [1 ]
Yeung, Yu Ting [1 ]
Chen, Xiao [1 ]
机构
[1] Huawei Noahs Ark Lab, Hong Kong, Peoples R China
来源
关键词
keyword spotting; energy-friendly; human-computer interaction;
D O I
10.21437/Interspeech.2021-458
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Wake-up keyword of a keyword spotting (KWS) system represents brand name of a smart device. Performance of KWS is also crucial for modern speech based human-device interaction. An on-device KWS with both high accuracy and low power consumption is desired. We propose a KWS with add-based convolution layers, namely Add TC-ResNet. Add-based convolution paves a new way to reduce power consumption of KWS system, as addition is more energy efficient than multiplication at hardware level. On Google Speech Commands dataset V2, Add TC-ResNet achieves an accuracy of 97.1%, with 99% of multiplication operations are replaced by addition operations. The result is competitive to a state-of-the-art fully multiplication-based TC-ResNet KWS. We also investigate knowledge distillation and a mixed addition-multiplication design for the proposed KWS, which leads to further performance improvement.
引用
收藏
页码:4234 / 4238
页数:5
相关论文
共 50 条
  • [31] Speech Recognition for Keyword Spotting using a Set of Modulation Based Features - Preliminary Results
    Gopalan, Kaliappan
    Chu, Tao
    IMCIC 2010: INTERNATIONAL MULTI-CONFERENCE ON COMPLEXITY, INFORMATICS AND CYBERNETICS, VOL II, 2010, : 32 - 36
  • [32] Improving Keyword Detection Rate Using a Set of Rules to Merge HMM-based and SVM-based Keyword Spotting Results
    Shokri, Akram
    Davarpour, Mohammad Hossein
    Akbari, Ahmad
    2014 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2014, : 1715 - 1718
  • [33] Improvement of Audio-Visual Keyword Spotting System Accuracy Using Excitation Source Feature
    Nandakishor, Salam
    Pati, Debadatta
    SPEECH AND COMPUTER, SPECOM 2023, PT II, 2023, 14339 : 344 - 356
  • [34] FPGA Implementation of Keyword Spotting System Using Depthwise Separable Binarized and Ternarized Neural Networks
    Bae, Seongwoo
    Kim, Haechan
    Lee, Seongjoo
    Jung, Yunho
    SENSORS, 2023, 23 (12)
  • [35] An improved Mandarin keyword spotting system using mce training and context-enhanced verification
    Liang, JiaEn
    Meng, Meng
    Wang, XiaoRui
    Ding, Peng
    Xu, Bo
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 1145 - 1148
  • [36] KEYWORD SPOTTING SYSTEM WITH NANO 33 BLE SENSE USING EMBEDDED MACHINE LEARNING APPROACH
    Abbas, Nurul Atikah
    Ahmad, Mohd Ridzuan
    JURNAL TEKNOLOGI-SCIENCES & ENGINEERING, 2023, 85 (03): : 175 - 182
  • [37] HMM Based Keyword Spotting System in Printed/Handwritten Arabic/Latin Documents with Identification Stage
    Rouhou, Ahmed Cheikh
    Kessentini, Yousri
    Kanoun, Slim
    IMAGE ANALYSIS AND RECOGNITION, ICIAR 2019, PT I, 2019, 11662 : 309 - 320
  • [38] A Russian Keyword Spotting System Based on Large Vocabulary Continuous Speech Recognition and Linguistic Knowledge
    Smirnov, Valentin
    Ignatov, Dmitry
    Gusev, Michael
    Farkhadov, Mais
    Rumyantseva, Natalia
    Farkhadova, Mukhabbat
    JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2016, 2016
  • [39] Duration Model-Based Post-processing for the Performance Improvement of a Keyword Spotting System
    Lee, Min Ji
    Yoon, Jae Sam
    Oh, Yoo Rhee
    Kim, Hong Kook
    Choi, Song Ha
    Kim, Ji Woon
    Kim, Myeong Bo
    COMMUNICATION AND NETWORKING, PT II, 2010, 120 : 148 - +
  • [40] QUERY-BY-EXAMPLE KEYWORD SPOTTING SYSTEM USING MULTI-HEAD ATTENTION AND SOFTTRIPLE LOSS
    Huang, Jinmiao
    Gharbieh, Waseem
    Shim, Han Suk
    Kim, Eugene
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6858 - 6862