Deep Learning Bidirectional LSTM based Detection of Prolongation and Repetition in Stuttered Speech using Weighted MFCC

被引:0
|
作者
Gupta, Sakshi [1 ]
Shukla, Ravi S. [2 ]
Shukla, Rajesh K. [1 ]
Verma, Rajesh [3 ]
机构
[1] Invertis Univ, Dept Comp Sci & Engn, Bareilly, Uttar Pradesh, India
[2] Saudi Elect Univ, Dept Comp Sci, Riyadh, Saudi Arabia
[3] King Khalid Univ, Dept Elect Engn, Abha, Saudi Arabia
关键词
Speech; stuttering; deep learning; WMFCC; Bi-LSTM; CLASSIFICATION; DYSFLUENCIES; RECOGNITION;
D O I
10.14569/IJACSA.2020.0110941
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Stuttering is a neuro-development disorder during which normal speech flow is not fluent. Traditionally Speech-Language Pathologists used to assess the extent of stuttering by counting the speech disfluencies manually. Such sorts of stuttering assessments are arbitrary, incoherent, lengthy, and error-prone. The present study focused on objective assessment to speech disfluencies such as prolongation and syllable, word, and phrase repetition. The proposed method is based on the Weighted Mel Frequency Cepstral Coefficient feature extraction algorithm and deep-learning Bidirectional Long-Short term Memory neural network for classification of stuttered events. The work has utilized the UCLASS stuttering dataset for analysis. The speech samples of the database are initially preprocessed, manually segmented, and labeled as a type of disfluency. The labeled speech samples are parameterized to Weighted MFCC feature vectors. Then extracted features are inputted to the Bidirectional-LSTM network for training and testing of the model. The effect of different hyper-parameters on classification results is examined. The test results show that the proposed method reaches the best accuracy of 96.67%, as compared to the LSTM model. The promising recognition accuracy of 97.33%, 98.67%, 97.5%, 97.19%, and 97.67% was achieved for the detection of fluent, prolongation, syllable, word, and phrase repetition, respectively.
引用
收藏
页码:345 / 356
页数:12
相关论文
共 50 条
  • [41] A weighted network community detection algorithm based on deep learning
    Li, Shudong
    Jiang, Laiyuan
    Wu, Xiaobo
    Han, Weihong
    Zhao, Dawei
    Wang, Zhen
    APPLIED MATHEMATICS AND COMPUTATION, 2021, 401
  • [42] Improving Sinhala Hate Speech Detection Using Deep Learning
    Gamage, Kavishka
    Welgama, Viraj
    Weerasinghe, Ruvan
    2022 22ND INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER), 2022,
  • [43] Detection of hate speech in Arabic tweets using deep learning
    Al-Hassan, Areej
    Al-Dossari, Hmood
    MULTIMEDIA SYSTEMS, 2022, 28 (06) : 1963 - 1974
  • [44] Detection of hate speech in Arabic tweets using deep learning
    Areej Al-Hassan
    Hmood Al-Dossari
    Multimedia Systems, 2022, 28 : 1963 - 1974
  • [45] Hybrid Deep Learning-Based Model for Wind Speed Forecasting Based on DWPT and Bidirectional LSTM Network
    Dolatabadi, Amirhossein
    Abdeltawab, Hussein
    Mohamed, Yasser Abdel-Rady, I
    IEEE ACCESS, 2020, 8 : 229219 - 229232
  • [46] IoT security using deep learning algorithm: intrusion detection model using LSTM
    Lija, Abitha V. K.
    Shobana, R.
    Misbha, J. Caroline
    Chandrakala, S.
    INTERNATIONAL JOURNAL OF ELECTRONIC SECURITY AND DIGITAL FORENSICS, 2025, 17 (1-2)
  • [47] A deep learning based model using RNN-LSTM for the Detection of Schizophrenia from EEG data
    Supakar, Rinku
    Satvaya, Parthasarathi
    Chakrabarti, Prasun
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 151
  • [48] Deep Learning Based Fusion Approach for Hate Speech Detection
    Zhou, Yanling
    Yang, Yanyan
    Liu, Han
    Liu, Xiufeng
    Savage, Nick
    IEEE ACCESS, 2020, 8 : 128923 - 128929
  • [49] Sarcasm detection using enhanced glove and bi-LSTM model based on deep learning techniques
    Anusha, M.
    Leelavathi, R.
    INTERNATIONAL JOURNAL OF INTELLIGENT ENGINEERING INFORMATICS, 2025, 13 (01)
  • [50] Dementia Detection from Speech Using Machine Learning and Deep Learning Architectures
    Kumar, M. Rupesh
    Vekkot, Susmitha
    Lalitha, S.
    Gupta, Deepa
    Govindraj, Varasiddhi Jayasuryaa
    Shaukat, Kamran
    Alotaibi, Yousef Ajami
    Zakariah, Mohammed
    SENSORS, 2022, 22 (23)