Continuous Speech Recognition of Kannada Language using Triphone Modeling

被引:0
|
作者
Sajjan, Sharada C. [1 ]
Vijaya, C. [1 ]
机构
[1] SDM Coll Engn & Technol, Dept Elect & Commun Engn, Dharwad, Karnataka, India
关键词
Acoustic Model; GMM; HMM; Hidden Markov Model Tool Kit(HTK); Kannada language; Language model; MFCC; Monophones; Pronunciation dictionary; Tied State Triphones;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper aims to build continuous speech recognition of regional language Kannada using phoneme modeling wherein each phoneme is represented by tristate Hidden Markov Model(HMM) with each state being represented by Gaussian Mixture Model (GMM). The recordings were sampled at the rate of 16 kHz, blocked into overlapped frames of 25msec duration with 10 msec frame overlap using hamming window, converted into 39 dimensional Mel Frequency Cepstral Coefficients(MFCC), then modeled using continuous density HMM. Kannada language has 46 phonemes, out of which 12 phonemes represent vowels (swaragalu) and 34 phonemes represent consonants (vyanjanagalu). The recognition performance is tested for monophone modeling, word internal triphone modeling and tied state triphone modeling for different gaussian mixtures and results have been presented.
引用
收藏
页码:451 / 455
页数:5
相关论文
共 50 条
  • [1] Application of Triphone Clustering in Acoustic Modeling for Continuous Speech Recognition in Bengali
    Banerjee, Pratyush
    Garg, Gaurav
    Mitra, Pabitra
    Basu, Anupam
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 1682 - 1685
  • [2] Continuous speech recognition based on the triphone DDBHMM
    You, Zhan
    Xiao, Xi
    Wang, Zuoying
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2009, 49 (04): : 506 - 509
  • [3] Kannada Continuous Speech Recognition Using Deep Learning
    Paul, Shubhojeet
    Bhattacharjee, Vandana
    Saha, Sujan Kumar
    ADVANCED NETWORK TECHNOLOGIES AND INTELLIGENT COMPUTING, ANTIC 2023, PT IV, 2024, 2093 : 258 - 269
  • [4] Continuous Kannada Noisy Speech Recognition
    Pasha, Nadeem
    Roopa, S.
    2018 INTERNATIONAL CONFERENCE ON RECENT INNOVATIONS IN ELECTRICAL, ELECTRONICS & COMMUNICATION ENGINEERING (ICRIEECE 2018), 2018, : 857 - 861
  • [5] Amazigh speech recognition using triphone modeling and clustering tree decision
    El Ouahabi, Safaa
    Atounti, Mohamed
    Bellouki, Mohamed
    ANNALS OF THE UNIVERSITY OF CRAIOVA-MATHEMATICS AND COMPUTER SCIENCE SERIES, 2019, 46 (01): : 56 - 65
  • [6] Speaker Dependent Continuous Kannada Speech Recognition Using HMM
    Hemakumar, G.
    Punitha, P.
    2014 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING APPLICATIONS (ICICA 2014), 2014, : 402 - 405
  • [7] Effective Triphone Mapping for Acoustic Modeling in Speech Recognition
    Darjaa, Sakhia
    Cernak, Milos
    Trnka, Marian
    Rusko, Milan
    Sabo, Robert
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1728 - 1731
  • [8] A Robust Bengali Continuous Speech Recognizer Using Triphone and Trigram Language Model
    Mandal, Sandipan
    Das, Biswajit
    Mitra, Pabitra
    Basu, Anupam
    CONTEMPORARY COMPUTING, 2011, 168 : 556 - 558
  • [9] CONTINUOUS TOPIC LANGUAGE MODELING FOR SPEECH RECOGNITION
    Chueh, Chuang-Hua
    Chien, Jen-Tzung
    2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 193 - 196
  • [10] A Bayesian approach for building triphone models for continuous speech recognition
    Ming, J
    O'Boyle, P
    Owens, M
    Smith, FJ
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (06): : 678 - 684