Diagnosis of pathological speech with streamlined features for long short-term memory learning

被引:4
|
作者
Pham, Tuan D. [1 ]
Holmes, Simon B. [1 ]
Zou, Lifong [1 ]
Patel, Mangala [1 ]
Coulthard, Paul [1 ]
机构
[1] Queen Mary Univ London, Barts & London Fac Med & Dent, Turner St, London E1 2AD, England
关键词
Pathological voice; Diagnosis; Feature extraction; Deep learning; Artificial intelligence; PARKINSONS-DISEASE; WAVE-PROPAGATION; SAMPLING THEORY; CLASSIFICATION; SCATTERING;
D O I
10.1016/j.compbiomed.2024.107976
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Pathological speech diagnosis is crucial for identifying and treating various speech disorders. Accurate diagnosis aids in developing targeted intervention strategies, improving patients' communication abilities, and enhancing their overall quality of life. With the rising incidence of speech -related conditions globally, including oral health, the need for efficient and reliable diagnostic tools has become paramount, emphasizing the significance of advanced research in this field. Methods: This paper introduces novel features for deep learning in the analysis of short voice signals. It proposes the incorporation of time -space and time-frequency features to accurately discern between two distinct groups: Individuals exhibiting normal vocal patterns and those manifesting pathological voice conditions. These advancements aim to enhance the precision and reliability of diagnostic procedures, paving the way for more targeted treatment approaches. Results: Utilizing a publicly available voice database, this study carried out training and validation using long short-term memory (LSTM) networks learning on the combined features, along with a data balancing strategy. The proposed approach yielded promising performance metrics: 90% accuracy, 93% sensitivity, 87% specificity, 88% precision, an F1 score of 0.90, and an area under the receiver operating characteristic curve of 0.96. The results surpassed those obtained by the networks trained using wavelet -time scattering coefficients, as well as several algorithms trained with alternative feature types. Conclusions: The incorporation of time-frequency and time -space features extracted from short segments of voice signals for LSTM learning demonstrates significant promise as an AI tool for the diagnosis of speech pathology. The proposed approach has the potential to enhance the accuracy and allow for real-time pathological speech assessment, thereby facilitating more targeted and effective therapeutic interventions.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Learning Term Weight with Long Short-Term Memory for Question Retrieval
    Huang, Xifeng
    Dai, Xiang
    CHINESE LEXICAL SEMANTICS, CLSW 2018, 2018, 11173 : 615 - 622
  • [22] Learning Sparse Hidden States in Long Short-Term Memory
    Yu, Niange
    Weber, Cornelius
    Hu, Xiaolin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: DEEP LEARNING, PT II, 2019, 11728 : 288 - 298
  • [23] Speaker-Aware Long Short-Term Memory Multi-Task Learning for Speech Recognition
    Pironkov, Gueorgui
    Dupont, Stephane
    Dutoit, Thierry
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 1911 - 1915
  • [24] ROLE OF SPEECH RESPONSES IN SHORT-TERM MEMORY
    MURRAY, DJ
    CANADIAN JOURNAL OF PSYCHOLOGY, 1967, 21 (03): : 263 - 263
  • [25] Dissociable components of short-term memory and their relation to long-term learning
    Freedman, ML
    Martin, RC
    COGNITIVE NEUROPSYCHOLOGY, 2001, 18 (03) : 193 - 226
  • [26] Emotion Recognition From Speech and Text using Long Short-Term Memory
    Venkateswarlu, Sonagiri China
    Jeevakala, Siva Ramakrishna
    Kumar, Naluguru Udaya
    Munaswamy, Pidugu
    Pendyala, Dhanalaxmi
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2023, 13 (04) : 11166 - 11169
  • [27] Speech Emotion Recognition for Indonesian Language Using Long Short-Term Memory
    Lasiman, Jeremia Jason
    Lestari, Dessi Puji
    2018 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL, INFORMATICS AND ITS APPLICATIONS (IC3INA), 2018, : 40 - 43
  • [28] Long Short-Term Memory Recurrent Neural Network for Automatic Speech Recognition
    Oruh, Jane
    Viriri, Serestina
    Adegun, Adekanmi
    IEEE ACCESS, 2022, 10 : 30069 - 30079
  • [29] Detecting Overlapping Speech with Long Short-Term Memory Recurrent Neural Networks
    Geiger, Juergen T.
    Eyben, Florian
    Schuller, Bjoern
    Rigoll, Gerhard
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1667 - 1671
  • [30] Short-term Load Forecasting with Distributed Long Short-Term Memory
    Dong, Yi
    Chen, Yang
    Zhao, Xingyu
    Huang, Xiaowei
    2023 IEEE POWER & ENERGY SOCIETY INNOVATIVE SMART GRID TECHNOLOGIES CONFERENCE, ISGT, 2023,