Diagnosis of pathological speech with streamlined features for long short-term memory learning

被引：4

作者：

Pham, Tuan D. ^{[1
]}

Holmes, Simon B. ^{[1
]}

Zou, Lifong ^{[1
]}

Patel, Mangala ^{[1
]}

Coulthard, Paul ^{[1
]}

机构：

[1] Queen Mary Univ London, Barts & London Fac Med & Dent, Turner St, London E1 2AD, England

来源：

COMPUTERS IN BIOLOGY AND MEDICINE | 2024年 / 170卷

关键词：

Pathological voice; Diagnosis; Feature extraction; Deep learning; Artificial intelligence; PARKINSONS-DISEASE; WAVE-PROPAGATION; SAMPLING THEORY; CLASSIFICATION; SCATTERING;

D O I：

10.1016/j.compbiomed.2024.107976

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Background: Pathological speech diagnosis is crucial for identifying and treating various speech disorders. Accurate diagnosis aids in developing targeted intervention strategies, improving patients' communication abilities, and enhancing their overall quality of life. With the rising incidence of speech -related conditions globally, including oral health, the need for efficient and reliable diagnostic tools has become paramount, emphasizing the significance of advanced research in this field. Methods: This paper introduces novel features for deep learning in the analysis of short voice signals. It proposes the incorporation of time -space and time-frequency features to accurately discern between two distinct groups: Individuals exhibiting normal vocal patterns and those manifesting pathological voice conditions. These advancements aim to enhance the precision and reliability of diagnostic procedures, paving the way for more targeted treatment approaches. Results: Utilizing a publicly available voice database, this study carried out training and validation using long short-term memory (LSTM) networks learning on the combined features, along with a data balancing strategy. The proposed approach yielded promising performance metrics: 90% accuracy, 93% sensitivity, 87% specificity, 88% precision, an F1 score of 0.90, and an area under the receiver operating characteristic curve of 0.96. The results surpassed those obtained by the networks trained using wavelet -time scattering coefficients, as well as several algorithms trained with alternative feature types. Conclusions: The incorporation of time-frequency and time -space features extracted from short segments of voice signals for LSTM learning demonstrates significant promise as an AI tool for the diagnosis of speech pathology. The proposed approach has the potential to enhance the accuracy and allow for real-time pathological speech assessment, thereby facilitating more targeted and effective therapeutic interventions.

引用

页数：14

共 50 条

[31] APPLICATION RESEARCH ON LONG SHORT-TERM MEMORY NETWORK IN FAULT DIAGNOSIS
Wang, Wei-Feng
Qiu, Xue-Huan
Chen, Cai-Sen
Lin, Bo
Zhang, Hui-Min
PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 2, 2018, : 360 - 365
[32] A short-term prediction model of global ionospheric VTEC based on the combination of long short-term memory and convolutional long short-term memory
Peng Chen
Rong Wang
Yibin Yao
Hao Chen
Zhihao Wang
Zhiyuan An
Journal of Geodesy, 2023, 97
[33] A short-term prediction model of global ionospheric VTEC based on the combination of long short-term memory and convolutional long short-term memory
Chen, Peng
Wang, Rong
Yao, Yibin
Chen, Hao
Wang, Zhihao
An, Zhiyuan
JOURNAL OF GEODESY, 2023, 97 (05)
[34] QUANTUM LONG SHORT-TERM MEMORY
Chen, Samuel Yen-Chi
Yoo, Shinjae
Fang, Yao-Lung L.
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8622 - 8626
[35] LIPREADING WITH LONG SHORT-TERM MEMORY
Wand, Michael
Koutnik, Jan
Schmidhuber, Jurgen
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6115 - 6119
[36] Associative Long Short-Term Memory
Danihelka, Ivo
Wayne, Greg
Uria, Benigno
Kalchbrenner, Nal
Graves, Alex
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
[37] L2 SPEECH LEARNING IN ADULTHOOD AND PHONOLOGICAL SHORT-TERM MEMORY
Aliaga-Garcia, Cristina
Mora, Joan C.
Cervino-Povedano, Eva
POZNAN STUDIES IN CONTEMPORARY LINGUISTICS, 2011, 47 (01): : 1 - 14
[38] SHORT-TERM VERBAL MEMORY AND LEARNING
PETERSON, LR
PSYCHOLOGICAL REVIEW, 1966, 73 (03) : 193 - &
[39] Deep Chronnectome Learning via Full Bidirectional Long Short-Term Memory Networks for MCI Diagnosis
Yan, Weizheng
Zhang, Han
Sui, Jing
Shen, Dinggang
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, PT III, 2018, 11072 : 249 - 257
[40] The contribution of short-term memory for sound features to speech-in-noise perception and cognition
Lad, Meher
Taylor, John -Paul
Griffiths, Timothy D.
HEARING RESEARCH, 2024, 451

← 1 2 3 4 5 →