FPGA-based Low-power Speech Recognition with Recurrent Neural Networks

被引:38
|
作者
Lee, Minjae [1 ]
Hwang, Kyuyeon [1 ]
Park, Jinhwan [1 ]
Choi, Sungwook [1 ]
Shin, Sungho [1 ]
Sung, Wonyong [1 ]
机构
[1] Seoul Natl Univ, Dept Elect & Comp Engn, 1 Gwanak Ro, Seoul 08826, South Korea
关键词
D O I
10.1109/SiPS.2016.48
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, a neural network based real-time speech recognition (SR) system is developed using an FPGA for very low-power operation. The implemented system employs two recurrent neural networks (RNNs); one is a speech-to-character RNN for acoustic modeling (AM) and the other is for character-level language modeling (LM). The system also employs a statistical word-level LM to improve the recognition accuracy. The results of the AM, the character-level LM, and the word-level LM are combined using a fairly simple N-best search algorithm instead of the hidden Markov model (HMM) based network. The RNNs are implemented using massively parallel processing elements (PEs) for low latency and high throughput. The weights are quantized to 6 bits to store all of them in the on-chip memory of an FPGA. The proposed algorithm is implemented on a Xilinx XC7Z045, and the system can operate much faster than real-time.
引用
收藏
页码:230 / 235
页数:6
相关论文
共 50 条
  • [21] Visual speech recognition by recurrent neural networks
    Rabi, G
    Lu, SW
    1997 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CONFERENCE PROCEEDINGS, VOLS I AND II: ENGINEERING INNOVATION: VOYAGE OF DISCOVERY, 1997, : 55 - 58
  • [22] SPEECH RECOGNITION WITH DEEP RECURRENT NEURAL NETWORKS
    Graves, Alex
    Mohamed, Abdel-rahman
    Hinton, Geoffrey
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6645 - 6649
  • [23] Unfolded Recurrent Neural Networks for Speech Recognition
    Saon, George
    Soltau, Hagen
    Emami, Ahmad
    Picheny, Michael
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 343 - 347
  • [24] Speech recognition with hierarchical recurrent neural networks
    Natl Chiao Tung Univ, Hsinchu, Taiwan
    Pattern Recognit, 6 (795-805):
  • [25] FPGA-based Accelerator for Long Short-Term Memory Recurrent Neural Networks
    Guan, Yijin
    Yuan, Zhihang
    Sun, Guangyu
    Cong, Jason
    2017 22ND ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2017, : 629 - 634
  • [26] An FPGA Implementation of Deep Spiking Neural Networks for Low-Power and Fast Classification
    Ju, Xiping
    Fang, Biao
    Yan, Rui
    Xu, Xiaoliang
    Tang, Huajin
    NEURAL COMPUTATION, 2020, 32 (01) : 182 - 204
  • [27] Monolithic 3D IC Designs for Low-Power Deep Neural Networks Targeting Speech Recognition
    Chang, Kyungwook
    Kadetotad, Deepak
    Cao, Yu
    Seo, Jae-sun
    Lim, Sung Kyu
    2017 IEEE/ACM INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN (ISLPED), 2017,
  • [28] An FPGA-Based Low-Latency Accelerator for Randomly Wired Neural Networks
    Kuramochi, Ryosuke
    Nakahara, Hiroki
    2020 30TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2020, : 298 - 303
  • [29] Low-Power FPGA-Based Display Processing Module for Head-Mounted Displays
    Sengupta, Dipanjan
    Hoskinson, Reynald
    Mirabbasi, Shahriar
    Ivanov, Milen
    Abdollahi, Hamid
    IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE 2011), 2011, : 661 - +
  • [30] A FPGA-based HMM for a discrete arabic speech recognition system
    Elmisery, FA
    Khalil, AH
    Salama, AE
    Hammed, HF
    ICM 2003: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON MICROELECTRONICS, 2003, : 322 - 325