Bangla Speech Recognition System using LPC and ANN

被引:36
|
作者
Paul, Anup Kumar [1 ]
Das, Dipankar [2 ]
Kamal, Md. Mustafa [1 ]
机构
[1] Dhaka City Coll, Dhaka, Bangladesh
[2] Rajshahi Univ, Dept Informat & Commun Engn, Rajshahi 6205, Bangladesh
关键词
D O I
10.1109/ICAPR.2009.80
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents the Bangla speech recognition system. Bangla speech recognition system is divided mainly into two major parts. The first part is speech signal processing and the second part is speech pattern recognition technique. The speech processing stage consists of speech starting and end point detection, windowing, filtering, calculating the Linear Predictive Coding(LPC) and Cepstral Coefficients and finally constructing the codebook by vector quantization. The second part consists of pattern recognition system using Artificial Neural Network(ANN). Speech signals are recorded using an audio wave recorder in the normal room environment. The recorded speech signal is passed through the speech starting and end-point detection algorithm to detect the presence of the speech signal and remove the silence and pauses portions of the signals. The resulting signal is then filtered for the removal of unwanted background noise from the speech signals. The filtered signal is then windowed ensuring half frame overlap. After windowing, the speech signal is then subjected to calculate the LPC coefficient and Cepstral coefficient. The feature extractor uses a standard LPC Cepstrum coder, which converts the incoming speech signal into LPC Cepstrurn feature space. The Self Organizing Map(SOM) Neural Network makes each variable length LPC trajectory of an isolated word into a fixed length LPC trajectory and thereby making the fixed length feature vector, to be fed into to the recognizer. The structures of the neural network is designed with Multi Layer Perceptron approach and tested with 3, 4, 5 hidden layers using the Transfer functions of Tanh Sigmoid for the Bangla speech recognition system. Comparison among different structures of Neural Networks conducted here for a better understanding of the problem and its possible solutions.
引用
收藏
页码:171 / 174
页数:4
相关论文
共 50 条
  • [1] Analysis on Handwritten Bangla Character Recognition Using ANN
    Rahaman, Arifur
    Hasan, Md Mehedi
    Shuvo, Md Faisal
    Ovi, Md Abu Saleh
    Rahman, Md Mostafizur
    2014 INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV), 2014,
  • [2] Pitch and Formant Estimation of Bangla Speech Signal Using Autocorrelation, Cepstrum and LPC Algorithm
    Aadit, Muhammad Navid Anjum
    Kirtania, Sharadindu Gopal
    Mahin, Mehnaz Tabassum
    PROCEEDINGS OF THE 2016 19TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2016, : 371 - 376
  • [3] BANGLA ISOLATED WORD SPEECH RECOGNITION
    Firoze, Adnan
    Arifin, M. Shamsul
    Quadir, Ryana
    Rahman, Rashedur M.
    ICEIS 2011: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 2, 2011, : 73 - 82
  • [4] Bangla Speech Recognition for Voice Search
    Saurav, Jillur Rahman
    Amin, Shakhawat
    Kibria, Shafkat
    Rahman, M. Shahidur
    2018 INTERNATIONAL CONFERENCE ON BANGLA SPEECH AND LANGUAGE PROCESSING (ICBSLP), 2018,
  • [5] A DC Motor Speed Control Using The LPC-ANFIS Speech Recognition System
    Akil, Muhammad
    Nurtanio, Ingrid
    Sadjad, Rhiza Samsoe'oed
    2017 15TH INTERNATIONAL CONFERENCE ON QUALITY IN RESEARCH (QIR) - INTERNATIONAL SYMPOSIUM ON ELECTRICAL AND COMPUTER ENGINEERING, 2017, : 215 - 220
  • [6] LPC AND LPCC METHOD OF FEATURE EXTRACTION IN SPEECH RECOGNITION SYSTEM
    Gupta, Harshita
    Gupta, Divya
    2016 6th International Conference - Cloud System and Big Data Engineering (Confluence), 2016, : 498 - 502
  • [7] An LPC cepstrum processor for speech recognition
    Hwang, IC
    Kim, SN
    Kim, YW
    Kim, SW
    ISCAS '98 - PROCEEDINGS OF THE 1998 INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-6, 1998, : C233 - C236
  • [8] Bangla Short Speech Commands Recognition Using Convolutional Neural Networks
    Sumon, Shakil Ahmed
    Chowdhury, Joydip
    Debnath, Sujit
    Mohammed, Nabeel
    Momen, Sifat
    2018 INTERNATIONAL CONFERENCE ON BANGLA SPEECH AND LANGUAGE PROCESSING (ICBSLP), 2018,
  • [9] Recent Advancement in Speech Recognition for Bangla: A Survey
    Sultana, Sadia
    Rahman, M. Shahidur
    Iqbal, M. Zafar
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (03) : 546 - 552
  • [10] Acoustic Modeling using Deep Belief Network for Bangla Speech Recognition
    Ahmed, Mahtab
    Shill, Pintu Chandra
    Islam, Kaidul
    Mollah, Md. Abdus Salim
    Akhand, M. A. H.
    2015 18TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2015, : 306 - 311