Unraveling the complexities of pathological voice through saliency analysis

被引:1
|
作者
Shaikh, Abdullah Abdul Sattar [1 ]
Bhargavi, M. S. [1 ]
Naik, Ganesh R. [2 ]
机构
[1] Bangalore Inst Technol, Dept Comp Sci & Engn, Bangalore 560004, Karnataka, India
[2] Flinders Univ S Australia, Adelaide Inst Sleep Hlth, Adelaide, SA 5042, Australia
关键词
Pathological voice; Saliency analysis; Autoencoders; Multi-class classification; UNet plus plus; AUTOMATIC DETECTION; CLASSIFICATION; SPEECH; IMPAIRMENTS; FEATURES; HEALTHY;
D O I
10.1016/j.compbiomed.2023.107566
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The human voice is an essential communication tool, but various disorders and habits can disrupt it. Diagnosis of pathological and abnormal voices is very important. Conventional diagnosis of these voice pathologies can be invasive and costly. Voice pathology disorders can be effectively detected using Artificial Intelligence and computer-aided voice pathology classification tools. Previous studies focused primarily on binary classification, leaving limited attention to multi-class classification. This study proposes three different neural network architectures to investigate the feature characteristics of three voice pathologies-Hyperkinetic Dysphonia, Hypokinetic Dysphonia, Reflux Laryngitis, and healthy voices using multi-class classification and the Voice ICar fEDerico II (VOICED) dataset. The study proposes UNet++ autoencoder-based denoiser techniques for accurate feature extraction to overcome noisy data. The architectures include a Multi-Layer Perceptron (MLP) trained on structured feature sets, a Short-Time Fourier Transform (STFT) model, and a Mel-Frequency Cepstral Coefficients (MFCC) model. The MLP model on 143 features achieved 97.1% accuracy, while the STFT model showed similar performance with increased sensitivity of 99.8%. The MFCC model maintained 97.1% accuracy but with a smaller model size and improved accuracy on the Reflux Laryngitis class. The study identifies crucial features through saliency analysis and reveals that detecting voice abnormalities requires the identification of regions of inaudible high-pitch sounds. Additionally, the study highlights the challenges posed by limited and disjointed pathological voice databases and proposes solutions for enhancing the performance of voice abnormality classification. Overall, the study's findings have potential applications in clinical applications and specialized audio-capturing tools.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Pathological Voice Analysis via Digital Signal Processing
    Lo Bue, Francesco
    Galioto, Natale
    Giaconia, Costantino
    APPLICATIONS IN ELECTRONICS PERVADING INDUSTRY, ENVIRONMENT AND SOCIETY, APPLEPIES 2014, 2016, 351 : 185 - 193
  • [32] Pitch deviation analysis of pathological voice in connected speech
    Laflen, J. Brandon
    Lazarus, Cathy L.
    Amin, Milan R.
    ANNALS OF OTOLOGY RHINOLOGY AND LARYNGOLOGY, 2008, 117 (02): : 90 - 97
  • [33] A Case of Complexities in the Diagnosis of Pathological Crying
    Christenson, Eric
    Bobrin, Brad
    JOURNAL OF NEUROPSYCHIATRY AND CLINICAL NEUROSCIENCES, 2012, 24 (02) : 20 - 20
  • [34] Unraveling the complexities of drought stress in cotton: a multifaceted analysis of selection criteria and breeding approaches
    Goren, Hatice Kubra
    Tan, Ugur
    PEERJ, 2024, 12
  • [35] Characterization of Healthy and Pathological Voice Through Measures Based on Nonlinear Dynamics
    Henriquez, Patricia
    Alonso, Jesus B.
    Ferrer, Miguel A.
    Travieso, Carlos M.
    Godino-Llorente, Juan I.
    Diaz-de-Maria, Fernando
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (06): : 1186 - 1195
  • [36] Pathological voice assessment
    Dibazar, Alireza A.
    Berger, Theodore W.
    Narayanan, Shrikanth S.
    2006 28TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-15, 2006, : 3144 - +
  • [37] Passing through the Past to Future of Placenta Accreta Spectrum (PAS): Unraveling the Complexities and Hidden Facets of PAS
    Shamshirsaz, Alireza A.
    Silver, Robert
    AMERICAN JOURNAL OF PERINATOLOGY, 2023, 40 (09) : 960 - 961
  • [38] Unraveling the Complexities Leading to Health Inequities: A Critical Ethnography
    Makie, Kawabata
    Gastaldo, Denise
    INTERNATIONAL JOURNAL OF QUALITATIVE METHODS, 2010, 9 (04): : 367 - 368
  • [39] Nature as stereochemist: Unraveling the complexities of polyketide antibiotic biosynthesis
    Cane, David E.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2013, 246
  • [40] Unraveling the Complexities of Tuberculosis and Sarcoidosis: Insights into Diagnosis and Differentiation
    Kiani, Arda
    Soltani, Pegah
    Moradkhani, Azadeh
    Abedini, Atefeh
    BIOMEDICAL AND BIOTECHNOLOGY RESEARCH JOURNAL, 2024, 8 : S7 - S7