Class-Imbalanced Voice Pathology Detection and Classification Using Fuzzy Cluster Oversampling Method

被引:18
|
作者
Fan, Ziqi [1 ]
Wu, Yuanbo [1 ]
Zhou, Changwei [1 ]
Zhang, Xiaojun [1 ]
Tao, Zhi [1 ]
机构
[1] Soochow Univ, Sch Optoelect Sci & Engn, Suzhou 215000, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 08期
基金
中国国家自然科学基金;
关键词
imbalanced learning; voice pathology detection and classification; SMOTE; intelligence medical diagnosis system; SMOTE;
D O I
10.3390/app11083450
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The Massachusetts Eye and Ear Infirmary (MEEI) database is an international-standard training database for voice pathology detection (VPD) systems. However, there is a class-imbalanced distribution in normal and pathological voice samples and different types of pathological voice samples in the MEEI database. This study aimed to develop a VPD system that uses the fuzzy clustering synthetic minority oversampling technique algorithm (FC-SMOTE) to automatically detect and classify four types of pathological voices in a multi-class imbalanced database. The proposed FC-SMOTE algorithm processes the initial class-imbalanced dataset. A set of machine learning models was evaluated and validated using the resulting class-balanced dataset as an input. The effectiveness of the VPD system with FC-SMOTE was further verified by an external validation set and another pathological voice database (Saarbruecken Voice Database (SVD)). The experimental results show that, in the multi-classification of pathological voice for the class-imbalanced dataset, the method we propose can significantly improve the diagnostic accuracy. Meanwhile, FC-SMOTE outperforms the traditional imbalanced data oversampling algorithms, and it is preferred for imbalanced voice diagnosis in practical applications.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] A Cluster-Based Under-Sampling Algorithm for Class-Imbalanced Data
    Guzman-Ponce, A.
    Valdovinos, R. M.
    Sanchez, J. S.
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, HAIS 2020, 2020, 12344 : 299 - 311
  • [42] A Cost-Sensitive Ensemble Method for Class-Imbalanced Datasets
    Zhang, Yong
    Wang, Dapeng
    ABSTRACT AND APPLIED ANALYSIS, 2013,
  • [43] Oversampling for Imbalanced Data Classification Using Adversarial Network
    Lee, Sang-Kwang
    Hong, Seung-Jin
    Yang, Seong-Il
    2018 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2018, : 1255 - 1257
  • [44] MFC-GAN: Class-imbalanced dataset classification using Multiple Fake Class Generative Adversarial Network
    Ali-Gombe, Adamu
    Elyan, Eyad
    NEUROCOMPUTING, 2019, 361 : 212 - 221
  • [45] Fuzzy voice segment classifier for voice pathology classification
    School of Mechatronic Engineering, Universiti Malaysia Perlis, Perlis, Malaysia
    不详
    Proc. - CSPA: Int. Colloq. Signal Process. Appl., (190-195):
  • [46] A novel oversampling method based on Wasserstein CGAN for imbalanced classification
    Zhou, Hongfang
    Pan, Heng
    Zheng, Kangyun
    Wu, Zongling
    Xiang, Qingyu
    CYBERSECURITY, 2025, 8 (01):
  • [47] Using Modulation Spectra for Voice Pathology Detection and Classification
    Markaki, Maria
    Stylianou, Yannis
    2009 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-20, 2009, : 2514 - 2517
  • [48] A novel oversampling method based on SeqGAN for imbalanced text classification
    Luo, Yin
    Weng, Xuanlong
    Zheng, Huang
    Feng, Haishan
    Luang, Ke
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 2891 - 2894
  • [49] DETECTION AND CLASSIFICATION OF VOICE PATHOLOGY USING FEATURE SELECTION
    Al Mojaly, Malak
    Muhammad, Ghulam
    Alsulaiman, Mansour
    2014 IEEE/ACS 11TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2014, : 571 - 577
  • [50] Feature selection and classification by minimizing overlap degree for class-imbalanced data in metabolomics
    Fu, Guang-Hui
    Wu, Yuan-Jiao
    Zong, Min-Jie
    Yi, Lun-Zhao
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2020, 196