Using SincNet for Learning Pathological Voice Disorders

被引：9

作者：

Hung, Chao-Hsiang ^{[1
]}

Wang, Syu-Siang ^{[1
]}

Wang, Chi-Te ^{[2
]}

Fang, Shih-Hau ^{[1
]}

机构：

[1] Yuan Ze Univ, Dept Elect Engn, Taoyuan 320, Taiwan

[2] Far Eastern Mem Hosp, Dept Otolaryngol Head & Neck Surg, New Taipei 220, Taiwan

来源：

SENSORS | 2022年 / 22卷 / 17期

关键词：

pathological voice; classification; sinc functions; convolutional neural network; SincNet;

D O I：

10.3390/s22176634

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Deep learning techniques such as convolutional neural networks (CNN) have been successfully applied to identify pathological voices. However, the major disadvantage of using these advanced models is the lack of interpretability in explaining the predicted outcomes. This drawback further introduces a bottleneck for promoting the classification or detection of voice-disorder systems, especially in this pandemic period. In this paper, we proposed using a series of learnable sinc functions to replace the very first layer of a commonly used CNN to develop an explainable SincNet system for classifying or detecting pathological voices. The applied sinc filters, a front-end signal processor in SincNet, are critical for constructing the meaningful layer and are directly used to extract the acoustic features for following networks to generate high-level voice information. We conducted our tests on three different Far Eastern Memorial Hospital voice datasets. From our evaluations, the proposed approach achieves the highest 7%-accuracy and 9%-sensitivity improvements from conventional methods and thus demonstrates superior performance in predicting input pathological waveforms of the SincNet system. More importantly, we intended to give possible explanations between the system output and the first-layer extracted speech features based on our evaluated results.

引用

页数：18

共 50 条

[21] Analysis and Detection of Pathological Voice Using Glottal Source Features
Kadiri, Sudarsana Reddy
Alku, Paavo
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2020, 14 (02) : 367 - 379
[22] Pathological voice quality assessment using artificial neural networks
Ritchings, RT
McGillion, M
Moore, CJ
MEDICAL ENGINEERING & PHYSICS, 2002, 24 (7-8) : 561 - 564
[23] Improving the recognition of pathological voice using the discriminant HLDA transformation
Lachhab, Othman
Di Martino, Joseph
Ibn Elhaj, El Hassane
Hammouch, Ahmed
2014 THIRD IEEE INTERNATIONAL COLLOQUIUM IN INFORMATION SCIENCE AND TECHNOLOGY (CIST'14), 2014, : 370 - 373
[24] Discrimination between pathological voice categories using Matching Pursuit
Kumar, Ashwini Jaya
Daoudi, Khalid
2015 4TH INTERNATIONAL WORK CONFERENCE ON BIOINSPIRED INTELLIGENCE (IWOBI), 2015, : 215 - 218
[25] Pathological voice detection using efficient combination of heterogeneous features
Lee, Ji-Yeoun
Jeong, Sangbae
Hahn, Minsoo
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (02): : 367 - 370
[26] Unravelling Voice Tremor in Movement Disorders: A Machine Learning Study
Asci, F.
Di Leo, P.
Ruoppolo, G.
Saggio, G.
Costantini, G.
Berardelli, A.
Suppa, A.
MOVEMENT DISORDERS, 2021, 36 : S128 - S128
[27] Identification of Voice Disorders: A Comparative Study of Machine Learning Algorithms
Coelho, Sharal
Shashirekha, Hosahalli Lakshmaiah
SPEECH AND COMPUTER, SPECOM 2023, PT I, 2023, 14338 : 565 - 578
[28] A Survey on Machine Learning Approaches for Automatic Detection of Voice Disorders
Hegde, Sarika
Shetty, Surendra
Rai, Smitha
Dodderi, Thejaswi
JOURNAL OF VOICE, 2019, 33 (06) : 947.e11 - 947.e33
[29] Using ambulatory voice monitoring to investigate common voice disorders: research update
Mehta, Daryush D.
Van Stan, Jarrad H.
Zanartu, Matias
Ghassemi, Marzyeh
Gttuag, John, V
Espinoza, Victor M.
Cortes, Juan P.
Cheyne, Harold A., II
Hillman, Robert E.
FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2015, 3
[30] Clinical Usefulness of Voice Recordings using a Smartphone as a Screening Tool for Voice Disorders
Lee, Seung Jin
Lee, Kwang Yong
Choi, Hong-Shik
COMMUNICATION SCIENCES AND DISORDERS-CSD, 2018, 23 (04): : 1065 - 1077

← 1 2 3 4 5 →