FPGA-based Low-power Speech Recognition with Recurrent Neural Networks

被引:38
|
作者
Lee, Minjae [1 ]
Hwang, Kyuyeon [1 ]
Park, Jinhwan [1 ]
Choi, Sungwook [1 ]
Shin, Sungho [1 ]
Sung, Wonyong [1 ]
机构
[1] Seoul Natl Univ, Dept Elect & Comp Engn, 1 Gwanak Ro, Seoul 08826, South Korea
关键词
D O I
10.1109/SiPS.2016.48
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, a neural network based real-time speech recognition (SR) system is developed using an FPGA for very low-power operation. The implemented system employs two recurrent neural networks (RNNs); one is a speech-to-character RNN for acoustic modeling (AM) and the other is for character-level language modeling (LM). The system also employs a statistical word-level LM to improve the recognition accuracy. The results of the AM, the character-level LM, and the word-level LM are combined using a fairly simple N-best search algorithm instead of the hidden Markov model (HMM) based network. The RNNs are implemented using massively parallel processing elements (PEs) for low latency and high throughput. The weights are quantized to 6 bits to store all of them in the on-chip memory of an FPGA. The proposed algorithm is implemented on a Xilinx XC7Z045, and the system can operate much faster than real-time.
引用
收藏
页码:230 / 235
页数:6
相关论文
共 50 条
  • [41] Arabic speech recognition using recurrent neural networks
    El Choubassi, MM
    El Khoury, HE
    Alagha, CEJ
    Skaf, JA
    Al-Alaoui, MA
    PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2003, : 543 - 547
  • [42] A new model of recurrent neural networks for speech recognition
    Xu, W
    Zhu, XY
    ICONIP'98: THE FIFTH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING JOINTLY WITH JNNS'98: THE 1998 ANNUAL CONFERENCE OF THE JAPANESE NEURAL NETWORK SOCIETY - PROCEEDINGS, VOLS 1-3, 1998, : 1134 - 1137
  • [43] Chaotic recurrent neural networks and their application to speech recognition
    Ryeu, JK
    Chung, HS
    NEUROCOMPUTING, 1996, 13 (2-4) : 281 - 294
  • [44] Implementing High-Performance, Low-Power FPGA-Based Optical Flow Accelerators in C
    Monson, Josh
    Wirthlin, Mike
    Hutchings, Brad L.
    PROCEEDINGS OF THE 2013 IEEE 24TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 13), 2013, : 363 - 369
  • [45] Discrete Event System Specification, Synthesis, and Optimization of Low-Power FPGA-based Embedded Systems
    Pifer, Tim
    Schwartz, David
    Lysecky, Roman
    Seo, Chungman
    Zeigler, Bernard P.
    PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), 2013, : 98 - 105
  • [46] Small-Area and Low-Power FPGA-Based Multipliers using Approximate Elementary Modules
    Guo, Yi
    Sun, Heming
    Kimura, Shinji
    2020 25TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2020, 2020, : 599 - 604
  • [47] Sensor Fusion Neural Networks for Gesture Recognition on Low-power Edge Devices
    Balazs, Gabor
    Chmurski, Mateusz
    Stechele, Walter
    Zubert, Mariusz
    ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2021, : 141 - 150
  • [48] An FPGA-Based On-the-Fly Reconfigurable Low-Power SHEPWM Inverter With a Compact SiP Implementation
    Aimaier, Nueraimaiti
    Blaquiere, Yves
    Constantin, Nicolas G.
    Cowan, Glenn E. R.
    IEEE TRANSACTIONS ON POWER ELECTRONICS, 2024, 39 (05) : 5942 - 5953
  • [49] A low-power integrated circuit for remote speech recognition
    Borgatti, M
    Felici, M
    Ferrari, A
    Guerrieri, R
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1998, 33 (07) : 1082 - 1089
  • [50] Modulation recognition using an FPGA-based convolutional neural network
    Liu, Xueyuan
    Shang, Jing
    Leong, Philip H. W.
    Liu, Cheng
    2019 22ND INTERNATIONAL CONFERENCE ON ELECTRICAL MACHINES AND SYSTEMS (ICEMS 2019), 2019, : 3165 - 3170