Using SincNet for Learning Pathological Voice Disorders

被引:9
|
作者
Hung, Chao-Hsiang [1 ]
Wang, Syu-Siang [1 ]
Wang, Chi-Te [2 ]
Fang, Shih-Hau [1 ]
机构
[1] Yuan Ze Univ, Dept Elect Engn, Taoyuan 320, Taiwan
[2] Far Eastern Mem Hosp, Dept Otolaryngol Head & Neck Surg, New Taipei 220, Taiwan
关键词
pathological voice; classification; sinc functions; convolutional neural network; SincNet;
D O I
10.3390/s22176634
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Deep learning techniques such as convolutional neural networks (CNN) have been successfully applied to identify pathological voices. However, the major disadvantage of using these advanced models is the lack of interpretability in explaining the predicted outcomes. This drawback further introduces a bottleneck for promoting the classification or detection of voice-disorder systems, especially in this pandemic period. In this paper, we proposed using a series of learnable sinc functions to replace the very first layer of a commonly used CNN to develop an explainable SincNet system for classifying or detecting pathological voices. The applied sinc filters, a front-end signal processor in SincNet, are critical for constructing the meaningful layer and are directly used to extract the acoustic features for following networks to generate high-level voice information. We conducted our tests on three different Far Eastern Memorial Hospital voice datasets. From our evaluations, the proposed approach achieves the highest 7%-accuracy and 9%-sensitivity improvements from conventional methods and thus demonstrates superior performance in predicting input pathological waveforms of the SincNet system. More importantly, we intended to give possible explanations between the system output and the first-layer extracted speech features based on our evaluated results.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Analysis and Detection of Pathological Voice Using Glottal Source Features
    Kadiri, Sudarsana Reddy
    Alku, Paavo
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2020, 14 (02) : 367 - 379
  • [22] Pathological voice quality assessment using artificial neural networks
    Ritchings, RT
    McGillion, M
    Moore, CJ
    MEDICAL ENGINEERING & PHYSICS, 2002, 24 (7-8) : 561 - 564
  • [23] Improving the recognition of pathological voice using the discriminant HLDA transformation
    Lachhab, Othman
    Di Martino, Joseph
    Ibn Elhaj, El Hassane
    Hammouch, Ahmed
    2014 THIRD IEEE INTERNATIONAL COLLOQUIUM IN INFORMATION SCIENCE AND TECHNOLOGY (CIST'14), 2014, : 370 - 373
  • [24] Discrimination between pathological voice categories using Matching Pursuit
    Kumar, Ashwini Jaya
    Daoudi, Khalid
    2015 4TH INTERNATIONAL WORK CONFERENCE ON BIOINSPIRED INTELLIGENCE (IWOBI), 2015, : 215 - 218
  • [25] Pathological voice detection using efficient combination of heterogeneous features
    Lee, Ji-Yeoun
    Jeong, Sangbae
    Hahn, Minsoo
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (02): : 367 - 370
  • [26] Unravelling Voice Tremor in Movement Disorders: A Machine Learning Study
    Asci, F.
    Di Leo, P.
    Ruoppolo, G.
    Saggio, G.
    Costantini, G.
    Berardelli, A.
    Suppa, A.
    MOVEMENT DISORDERS, 2021, 36 : S128 - S128
  • [27] Identification of Voice Disorders: A Comparative Study of Machine Learning Algorithms
    Coelho, Sharal
    Shashirekha, Hosahalli Lakshmaiah
    SPEECH AND COMPUTER, SPECOM 2023, PT I, 2023, 14338 : 565 - 578
  • [28] A Survey on Machine Learning Approaches for Automatic Detection of Voice Disorders
    Hegde, Sarika
    Shetty, Surendra
    Rai, Smitha
    Dodderi, Thejaswi
    JOURNAL OF VOICE, 2019, 33 (06) : 947.e11 - 947.e33
  • [29] Using ambulatory voice monitoring to investigate common voice disorders: research update
    Mehta, Daryush D.
    Van Stan, Jarrad H.
    Zanartu, Matias
    Ghassemi, Marzyeh
    Gttuag, John, V
    Espinoza, Victor M.
    Cortes, Juan P.
    Cheyne, Harold A., II
    Hillman, Robert E.
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2015, 3
  • [30] Clinical Usefulness of Voice Recordings using a Smartphone as a Screening Tool for Voice Disorders
    Lee, Seung Jin
    Lee, Kwang Yong
    Choi, Hong-Shik
    COMMUNICATION SCIENCES AND DISORDERS-CSD, 2018, 23 (04): : 1065 - 1077