Memory-efficient modeling and search techniques for hardware ASR decoders

被引:7
|
作者
Price, Michael [1 ,2 ]
Chandrakasan, Anantha [2 ]
Glass, James [1 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] MIT, Microsyst Technol Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
关键词
speech recognition; neural networks; fixed-point arithmetic; embedded systems; SPEECH RECOGNITION; MW;
D O I
10.21437/Interspeech.2016-287
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper gives an overview of acoustic modeling and search techniques for low-power embedded ASR decoders. Our design decisions prioritize memory bandwidth, which is the main driver in system power consumption. We evaluate three acoustic modeling approaches Gaussian mixture model (GMM), subspace GMM (SGMM) and deep neural network (DNN) and identify tradeoffs between memory bandwidth and recognition accuracy. We also present an HMM search scheme with WFST compression and caching, predictive beam width control, and a word lattice. Our results apply to embedded system implementations using microcontrollers, DSPs, FPGAs, or ASICs.
引用
收藏
页码:1893 / 1897
页数:5
相关论文
共 50 条
  • [21] Memory-Efficient Hierarchical Neural Architecture Search for Image Restoration
    Haokui Zhang
    Ying Li
    Hao Chen
    Chengrong Gong
    Zongwen Bai
    Chunhua Shen
    International Journal of Computer Vision, 2022, 130 : 157 - 178
  • [22] Memory-Efficient Hierarchical Neural Architecture Search for Image Restoration
    Zhang, Haokui
    Li, Ying
    Chen, Hao
    Gong, Chengrong
    Bai, Zongwen
    Shen, Chunhua
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (01) : 157 - 178
  • [23] A Memory-Efficient Search Strategy for Multiobjective Shortest Path Problems
    Mandow, L.
    Perez de la Cruz, J. L.
    KI 2009: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, 5803 : 25 - 32
  • [24] Memory-Efficient Hierarchical Neural Architecture Search for Image Denoising
    Zhang, Haokui
    Li, Ying
    Chen, Hao
    Shen, Chunhua
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3654 - 3663
  • [25] Fast and memory-efficient NN search in wireless data broadcast
    Lee, Myong-Soo
    Lee, SangKeun
    HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, PROCEEDINGS, 2006, 4208 : 662 - 671
  • [26] An memory-efficient variable length decoding scheme for embedded MPEG-4 video decoders
    Guo, Hongxing
    Xia, Xiaojian
    Sun, Weiping
    Zbou, Jingli
    Yu, Shengsheng
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 1694 - +
  • [27] Hardware- and Memory-Efficient Architecture for Disparity Estimation of Large Label Counts
    Wu, Sih-Sian
    Chen, Hon-Hui
    Chen, Liang-Gee
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (09) : 3679 - 3693
  • [28] A Memory-Efficient True-RMS Estimator in a Limited-Resources Hardware
    Flores-Arias, Jose-Maria
    Ortiz-Lopez, Manuel
    Quiles Latorre, Francisco J.
    Jose Bellido-Outeirino, Francisco
    Moreno-Munoz, Antonio
    ENERGIES, 2019, 12 (09)
  • [29] A High-Throughput and Memory-Efficient Deblocking Filter Hardware Architecture for VVC
    Hou, Bingjing
    Huang, Leilei
    Jing, Minge
    Fan, Yibo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 13569 - 13583
  • [30] A Memory-efficient Multi-dimensional Hardware-specific Algorithm for Packet Classification
    Huo Hongwei
    Ye Mangu
    Gao Dongpei
    CHINESE JOURNAL OF ELECTRONICS, 2010, 19 (04): : 634 - 636