Fuzzy support vector machine with graph for classifying imbalanced datasets

被引:9
|
作者
Chen, Baihua [1 ]
Fan, Yuling [1 ]
Lan, Weiyao [1 ]
Liu, Jinghua [2 ]
Cao, Chao [3 ,4 ]
Gao, Yunlong [1 ]
机构
[1] Xiamen Univ, Dept Automat, Xiamen 361102, Peoples R China
[2] Huaqiao Univ, Coll Comp Sci & Technol, Xiamen 361021, Peoples R China
[3] Minist Nat Resources, Inst Oceanog 3, Xiamen 361005, Peoples R China
[4] Fujian Prov Key Lab Marine Ecol Conservat & Restor, Xiamen 361005, Peoples R China
基金
中国国家自然科学基金;
关键词
Fuzzy support vector machines; Class imbalance; The curse of dimensionality; Kernel space; Graph; CLASSIFICATION; RECOGNITION; ROBUST; MODELS;
D O I
10.1016/j.neucom.2022.09.139
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Since support vector machine (SVM) considers all the training samples equally, it suffers from the problems of noise/outliers and class imbalance. Although many fuzzy support vector machines (FSVMs) have been proposed to suppress the effect of noise/outliers and class imbalance, most of them ignore the impact of the curse of dimensionality on the discriminative performance of fuzzy membership function and do not give the fuzzy membership function corresponding to the kernel space, which seriously reduces the performance of FSVM. To solve these problems, we propose the fuzzy support vector machine with graph (GraphFSVM) in this paper. Specifically, we first design a graph-based fuzzy membership function to accurately assess the importance of samples in original feature space and prove that the function can mine discriminative information between samples in high-dimensional data. Additionally, since the data distribution in kernel space is different from those in the original feature space, a method is provided to calculate the fuzzy membership function in the kernel space. Finally, the GraphFSVM model analyzes samples of each class independently, this suppresses the effect of class imbalance. Following the above principles, we design the graph-based fuzzy support vector machine and propose a detailed optimization method. Experimental results on UCI, gene expression, and image datasets show that the GraphFSVM has better generalization and robustness than other state-of-the-art methods.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:296 / 312
页数:17
相关论文
共 50 条
  • [1] Entropy-based fuzzy support vector machine for imbalanced datasets
    Fan, Qi
    Wang, Zhe
    Li, Dongdong
    Gao, Daqi
    Zha, Hongyuan
    KNOWLEDGE-BASED SYSTEMS, 2017, 115 : 87 - 99
  • [2] Classifying Unbalanced Datasets Using Iterative Fuzzy Support Vector Machine
    Kumari, P. Aruna
    Suma, G. Jaya
    HELIX, 2019, 9 (01): : 4802 - 4807
  • [3] Fuzzy Support Vector Machine With Relative Density Information for Classifying Imbalanced Data
    Yu, Hualong
    Sun, Changyin
    Yang, Xibei
    Zheng, Shang
    Zou, Haitao
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2019, 27 (12) : 2353 - 2367
  • [4] Support Vector Machine Failure in Imbalanced Datasets
    Illan, I. A.
    Gorriz, J. M.
    Ramirez, J.
    Martinez-Murcia, F. J.
    Castillo-Barnes, D.
    Segovia, F.
    Salas-Gonzalez, D.
    UNDERSTANDING THE BRAIN FUNCTION AND EMOTIONS, PT I, 2019, 11486 : 412 - 419
  • [5] Combine Vector Quantization and Support Vector Machine for imbalanced datasets
    Yu, Ting
    Debenham, John
    Jan, Tony
    Simoff, Simeon
    ARTIFICIAL INTELLIGENCE IN THEORY AND PRACTICE, 2006, 217 : 81 - +
  • [6] Fuzzy support vector machine using local outlier factor and intuitionistic fuzzy sets for imbalanced datasets
    Hu, Mengya
    Lu, Shaowu
    JOURNAL OF CONTROL AND DECISION, 2024,
  • [7] An improved Support Vector Machine for the classification of imbalanced biological datasets
    Wang, Haiying
    Zheng, Huiru
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, PROCEEDINGS: WITH ASPECTS OF THEORETICAL AND METHODOLOGICAL ISSUES, 2008, 5226 : 63 - +
  • [8] Constructing support vector machine ensemble with segmentation for imbalanced datasets
    Li, Qian
    Yang, Bing
    Li, Yi
    Deng, Naiyang
    Jing, Ling
    NEURAL COMPUTING & APPLICATIONS, 2013, 22 : S249 - S256
  • [9] Constructing support vector machine ensemble with segmentation for imbalanced datasets
    Qian Li
    Bing Yang
    Yi Li
    Naiyang Deng
    Ling Jing
    Neural Computing and Applications, 2013, 22 : 249 - 256
  • [10] Classification of Imbalanced Datasets using Partition Method and Support Vector Machine
    Awasare, Vinod Kumar
    Gupta, Surendra
    PROCEEDINGS OF THE 2017 IEEE SECOND INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES (ICECCT), 2017,