Instance-based entropy fuzzy support vector machine for imbalanced data

被引:0
|
作者
Poongjin Cho
Minhyuk Lee
Woojin Chang
机构
[1] Seoul National University,Department of Industrial Engineering
[2] Samsung Electronics,Big Data Analytics Group, Mobile Communications Business
来源
关键词
Fuzzy support vector machine; Imbalanced dataset; Entropy; Pattern recognition; Nearest neighbor;
D O I
暂无
中图分类号
学科分类号
摘要
Imbalanced classification has been a major challenge for machine learning because many standard classifiers mainly focus on balanced datasets and tend to have biased results toward the majority class. We modify entropy fuzzy support vector machine (EFSVM) and introduce instance-based entropy fuzzy support vector machine (IEFSVM). Both EFSVM and IEFSVM use the entropy information of k-nearest neighbors to determine the fuzzy membership value for each sample which prioritizes the importance of each sample. IEFSVM considers the diversity of entropy patterns for each sample when increasing the size of neighbors, k, while EFSVM uses single entropy information of the fixed size of neighbors for all samples. By varying k, we can reflect the component change of sample’s neighbors from near to far distance in the determination of fuzzy value membership. Numerical experiments on 35 public and 12 real-world imbalanced datasets are performed to validate IEFSVM, and area under the receiver operating characteristic curve (AUC) is used to compare its performance with other SVMs and machine learning methods. IEFSVM shows a much higher AUC value for datasets with high imbalance ratio, implying that IEFSVM is effective in dealing with the class imbalance problem.
引用
收藏
页码:1183 / 1202
页数:19
相关论文
共 50 条
  • [31] Intuitionistic fuzzy twin support vector machines for imbalanced data
    Rezvani, Salim
    Wang, Xizhao
    NEUROCOMPUTING, 2022, 507 : 16 - 25
  • [32] Combine Sampling Support Vector Machine for Imbalanced Data Classification
    Sain, Hartayuni
    Purnami, Santi Wulan
    THIRD INFORMATION SYSTEMS INTERNATIONAL CONFERENCE 2015, 2015, 72 : 59 - 66
  • [33] Integration of feature vector selection and support vector machine for classification of imbalanced data
    Liu, Jie
    Zio, Enrico
    APPLIED SOFT COMPUTING, 2019, 75 : 702 - 711
  • [34] Kernel local outlier factor-based fuzzy support vector machine for imbalanced classification
    Wang, Kefan
    An, Jing
    Yu, Zibo
    Yin, Xingshu
    Ma, Chao
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (13):
  • [35] Kernel local outlier factor-based fuzzy support vector machine for imbalanced classification
    Wang, Kefan
    An, Jing
    Yu, Zibo
    Yin, Xingshu
    Ma, Chao
    Concurrency and Computation: Practice and Experience, 2021, 33 (13)
  • [36] A new sampling method for classifying imbalanced data based on support vector machine ensemble
    Jian, Chuanxia
    Gao, Jian
    Ao, Yinhui
    NEUROCOMPUTING, 2016, 193 : 115 - 122
  • [37] Instance categorization by support vector machines to adjust weights in AdaBoost for imbalanced data classification
    Lee, Wonji
    Jun, Chi-Hyuck
    Lee, Jong-Seok
    INFORMATION SCIENCES, 2017, 381 : 92 - 103
  • [38] Fuzzy Support Vector Machine with Imbalanced regulator and its Application in stroke Classification
    Zhang, Xueying
    Wei, Xin
    Li, Fenglian
    Hu, Fengyun
    Jia, Wenhui
    Wang, Chao
    2019 IEEE FIFTH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (IEEE BIGDATASERVICE 2019), 2019, : 290 - 295
  • [39] Fuzzy classifier based on fuzzy support vector machine
    Ji, Ai-bing
    Chen, Songcan
    Hua, Qiang
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2014, 26 (01) : 421 - 430
  • [40] Bearing fault diagnosis based on adaptive mutiscale fuzzy entropy and support vector machine
    Li, Yongbo
    Xu, Minqiang
    Wei, Yu
    Huang, Wenhu
    JOURNAL OF VIBROENGINEERING, 2015, 17 (03) : 1188 - 1202