Instance-based entropy fuzzy support vector machine for imbalanced data

被引:0
|
作者
Poongjin Cho
Minhyuk Lee
Woojin Chang
机构
[1] Seoul National University,Department of Industrial Engineering
[2] Samsung Electronics,Big Data Analytics Group, Mobile Communications Business
来源
关键词
Fuzzy support vector machine; Imbalanced dataset; Entropy; Pattern recognition; Nearest neighbor;
D O I
暂无
中图分类号
学科分类号
摘要
Imbalanced classification has been a major challenge for machine learning because many standard classifiers mainly focus on balanced datasets and tend to have biased results toward the majority class. We modify entropy fuzzy support vector machine (EFSVM) and introduce instance-based entropy fuzzy support vector machine (IEFSVM). Both EFSVM and IEFSVM use the entropy information of k-nearest neighbors to determine the fuzzy membership value for each sample which prioritizes the importance of each sample. IEFSVM considers the diversity of entropy patterns for each sample when increasing the size of neighbors, k, while EFSVM uses single entropy information of the fixed size of neighbors for all samples. By varying k, we can reflect the component change of sample’s neighbors from near to far distance in the determination of fuzzy value membership. Numerical experiments on 35 public and 12 real-world imbalanced datasets are performed to validate IEFSVM, and area under the receiver operating characteristic curve (AUC) is used to compare its performance with other SVMs and machine learning methods. IEFSVM shows a much higher AUC value for datasets with high imbalance ratio, implying that IEFSVM is effective in dealing with the class imbalance problem.
引用
收藏
页码:1183 / 1202
页数:19
相关论文
共 50 条
  • [1] Instance-based entropy fuzzy support vector machine for imbalanced data
    Cho, Poongjin
    Lee, Minhyuk
    Chang, Woojin
    PATTERN ANALYSIS AND APPLICATIONS, 2020, 23 (03) : 1183 - 1202
  • [2] Application of Instance-Based Entropy Fuzzy Support Vector Machine in Peer-To-Peer Lending Investment Decision
    Cho, Poongjin
    Chang, Woojin
    Song, Jae Wook
    IEEE ACCESS, 2019, 7 : 16925 - 16939
  • [3] Entropy-based fuzzy support vector machine for imbalanced datasets
    Fan, Qi
    Wang, Zhe
    Li, Dongdong
    Gao, Daqi
    Zha, Hongyuan
    KNOWLEDGE-BASED SYSTEMS, 2017, 115 : 87 - 99
  • [4] Fuzzy Support Vector Machine for Microarray Imbalanced Data Classification
    Ladayya, Faroh
    Purnami, Santi Wulan
    Irhamah
    13TH IMT-GT INTERNATIONAL CONFERENCE ON MATHEMATICS, STATISTICS AND THEIR APPLICATIONS (ICMSA2017), 2017, 1905
  • [5] Fuzzy support vector machine for imbalanced data with borderline noise
    Liu, Jie
    Fuzzy Sets and Systems, 2021, 413 : 64 - 73
  • [6] Fuzzy support vector machine for imbalanced data with borderline noise
    Liu, Jie
    FUZZY SETS AND SYSTEMS, 2021, 413 : 64 - 73
  • [7] Affective detection based on an imbalanced fuzzy support vector machine
    Cheng, Jing
    Liu, Guang-Yuan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2015, 18 : 118 - 126
  • [8] Dense fuzzy support vector machine to binary classification for imbalanced data
    Wang, Qingling
    Zheng, Jian
    Zhang, Wenjing
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) : 9643 - 9653
  • [9] Affinity and class probability-based fuzzy support vector machine for imbalanced data sets
    Tao, Xinmin
    Li, Qing
    Ren, Chao
    Guo, Wenjie
    He, Qing
    Liu, Rui
    Zou, Junrong
    NEURAL NETWORKS, 2020, 122 (122) : 289 - 307
  • [10] Fuzzy Support Vector Machine With Relative Density Information for Classifying Imbalanced Data
    Yu, Hualong
    Sun, Changyin
    Yang, Xibei
    Zheng, Shang
    Zou, Haitao
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2019, 27 (12) : 2353 - 2367