Fuzzy support vector machine with graph for classifying imbalanced datasets

被引:9
|
作者
Chen, Baihua [1 ]
Fan, Yuling [1 ]
Lan, Weiyao [1 ]
Liu, Jinghua [2 ]
Cao, Chao [3 ,4 ]
Gao, Yunlong [1 ]
机构
[1] Xiamen Univ, Dept Automat, Xiamen 361102, Peoples R China
[2] Huaqiao Univ, Coll Comp Sci & Technol, Xiamen 361021, Peoples R China
[3] Minist Nat Resources, Inst Oceanog 3, Xiamen 361005, Peoples R China
[4] Fujian Prov Key Lab Marine Ecol Conservat & Restor, Xiamen 361005, Peoples R China
基金
中国国家自然科学基金;
关键词
Fuzzy support vector machines; Class imbalance; The curse of dimensionality; Kernel space; Graph; CLASSIFICATION; RECOGNITION; ROBUST; MODELS;
D O I
10.1016/j.neucom.2022.09.139
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Since support vector machine (SVM) considers all the training samples equally, it suffers from the problems of noise/outliers and class imbalance. Although many fuzzy support vector machines (FSVMs) have been proposed to suppress the effect of noise/outliers and class imbalance, most of them ignore the impact of the curse of dimensionality on the discriminative performance of fuzzy membership function and do not give the fuzzy membership function corresponding to the kernel space, which seriously reduces the performance of FSVM. To solve these problems, we propose the fuzzy support vector machine with graph (GraphFSVM) in this paper. Specifically, we first design a graph-based fuzzy membership function to accurately assess the importance of samples in original feature space and prove that the function can mine discriminative information between samples in high-dimensional data. Additionally, since the data distribution in kernel space is different from those in the original feature space, a method is provided to calculate the fuzzy membership function in the kernel space. Finally, the GraphFSVM model analyzes samples of each class independently, this suppresses the effect of class imbalance. Following the above principles, we design the graph-based fuzzy support vector machine and propose a detailed optimization method. Experimental results on UCI, gene expression, and image datasets show that the GraphFSVM has better generalization and robustness than other state-of-the-art methods.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:296 / 312
页数:17
相关论文
共 50 条
  • [41] Performance of Support Vector Machine in Imbalanced Data Set
    Novakovic, Jasmina
    Markovic, Suzana
    2020 19TH INTERNATIONAL SYMPOSIUM INFOTEH-JAHORINA (INFOTEH), 2020,
  • [42] Imbalanced classification using support vector machine ensemble
    Tian, Jiang
    Gu, Hong
    Liu, Wenqi
    NEURAL COMPUTING & APPLICATIONS, 2011, 20 (02): : 203 - 209
  • [43] A support vector machine (SVM) approach to imbalanced datasets of customer responses: comparison with other customer response models
    Kim, Gitae
    Chae, Bongsug Kevin
    Olson, David L.
    SERVICE BUSINESS, 2013, 7 (01) : 167 - 182
  • [44] A support vector machine (SVM) approach to imbalanced datasets of customer responses: comparison with other customer response models
    Gitae Kim
    Bongsug Kevin Chae
    David L. Olson
    Service Business, 2013, 7 : 167 - 182
  • [45] Reduced Support Vector Machine Based on Nonhierarchical Clustering Techniques for Classifying Mixed Large-Scale Datasets
    Andari, S.
    Purnami, S. W.
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS & STATISTICS, 2015, 53 (05): : 40 - 46
  • [46] Quantum Support Vector Machine for Classifying Noisy Data
    Li, Jiaye
    Li, Yangding
    Song, Jiagang
    Zhang, Jian
    Zhang, Shichao
    IEEE TRANSACTIONS ON COMPUTERS, 2024, 73 (09) : 2233 - 2247
  • [47] Support Vector Machine as Tool for Classifying Coffee Beverages
    Varela-Aldas, Jose
    Fuentes, Esteban M.
    Buele, Jorge
    Grau Melo, Raul
    Manuel Barat, Jose
    Alcaniz, Miguel
    INFORMATION TECHNOLOGY AND SYSTEMS, ICITS 2020, 2020, 1137 : 275 - 284
  • [48] Interpolation of scattered data and classifying in support vector machine
    Wu, T
    He, HG
    8TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING, VOLS 1-3, PROCEEDING, 2001, : 1317 - 1320
  • [49] Classifying Remote Sensing Data with Support Vector Machines and Imbalanced Training Data
    Waske, Bjorn
    Benediktsson, Jon Atli
    Sveinsson, Johannes R.
    MULTIPLE CLASSIFIER SYSTEMS, PROCEEDINGS, 2009, 5519 : 375 - 384
  • [50] An Improved Fuzzy Support Vector Machine
    Xiao, Xiaoling
    Zhang, Xiang
    2009 INTERNATIONAL SYMPOSIUM ON INTELLIGENT UBIQUITOUS COMPUTING AND EDUCATION, 2009, : 125 - +