Throat Microphone Speech Recognition using MFCC

被引:0
|
作者
Vijayan, Amritha [1 ]
Mathai, Bipil Mary [1 ]
Valsalan, Karthik [1 ]
Johnson, Riyanka Raji [1 ]
Mathew, Lani Rachel [1 ]
Gopakumar, K. [2 ]
机构
[1] Mar Baselios Coll Engn & Technol, Dept Elect & Commun Engn, Trivandrum, Kerala, India
[2] TKM Coll Engn & Technol, Dept Elect & Commun Engn, Kollam, Kerala, India
关键词
Throat Microphone; MFCC; vocal fold vibrations; Minimum Mean Square Analysis;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The Throat Microphone (TM) is a non-acoustic device, relying on the vibrations of vocal folds rather than the audible sound produced. Correctly capturing vocal fold vibrations is difficult due to poor signal representation capabilities. The system recognizes the TM vibrations and produces the corresponding speech sound. This is done by extracting features from the spectrum of the TM vibrations and comparing the obtained features with the values stored in a database. The extracted features include characteristic features of the speech waveform called Mel-Frequency Cepstral Coefficients (MFCC). The selection of the closest speech signal is chosen by the minimum mean square error estimation method, where the signal in the database whose corresponding MFCC values show the least difference from the input speech MFCCs is selected. This system has the potential of having applications for giving voice to those with defective speech and in military communications.
引用
收藏
页码:392 / 395
页数:4
相关论文
共 50 条
  • [1] Analysis of Throat Microphone Using MFCC Features for Speaker Recognition
    Visalakshi, R.
    Dhanalakshmi, P.
    Palanivel, S.
    COMPUTATIONAL INTELLIGENCE, CYBER SECURITY AND COMPUTATIONAL MODELS, ICC3 2015, 2016, 412 : 35 - 41
  • [2] Knowledge Distillation for Throat Microphone Speech Recognition
    Suzuki, Takahito
    Ogata, Jun
    Tsunakawa, Takashi
    Nishida, Masafumi
    Nishimura, Masafumi
    INTERSPEECH 2019, 2019, : 461 - 465
  • [3] Improving Throat Microphone Speech Recognition by Joint Analysis of Throat and Acoustic Microphone Recordings
    Erzin, Engin
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (07): : 1316 - 1324
  • [4] Effects of microphone mounting location and gender on accuracy in speech recognition using a throat microphone
    Konuma, Y.
    Asakura, T.
    JASA EXPRESS LETTERS, 2023, 3 (09):
  • [5] Improving phoneme recognition of throat microphone speech recordings using transfer learning
    Turan, M. A. Tugtekin
    Erzin, Engin
    SPEECH COMMUNICATION, 2021, 129 : 25 - 32
  • [6] DNN-based Feature Transformation for Speech Recognition Using Throat Microphone
    Lin, Shengke
    Tsunakawa, Takashi
    Nishida, Masafumi
    Nishimura, Masafumi
    2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 596 - 599
  • [7] Speech Disorder Recognition using MFCC
    Jhawar, Gunjan
    Nagraj, Prajacta
    Mahalakshmi, P.
    2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), VOL. 1, 2016, : 246 - 250
  • [8] Speech Recognition using MFCC and DTW
    Mohan, Bhadragiri Jagan
    Babu, Ramesh N.
    2014 INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL ENGINEERING (ICAEE), 2014,
  • [9] Improving Recognition of Syallabic Units of Hindi Languagae Using Combined Features of Throat Microphone and Normal Microphone Speech
    Radha, N.
    Shahina, A.
    Vinoth, G.
    Khan, A. Nayeemulla
    2014 INTERNATIONAL CONFERENCE ON CONTROL, INSTRUMENTATION, COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICCICCT), 2014, : 1343 - 1348
  • [10] IMPROVED SPEECH RECOGNITION IN NOISY ENVIRONMENTS BY USING A THROAT MICROPHONE FOR ACCURATE VOICING DETECTION
    Dekens, Tomas
    Verhelst, Werner
    Capman, Francois
    Beaugendre, Frederic
    18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 1978 - 1982