Krein twin support vector machines for imbalanced data classification

被引:4
|
作者
Jimenez-Castano, C. [1 ]
Alvarez-Meza, A. [2 ]
Cardenas-Pena, D. [1 ]
Orozco-Gutierrez, A. [1 ]
Guerrero-Erazo, J. [3 ]
机构
[1] Univ Tecnol Pereira, Automat Res Grp, Pereira, Colombia
[2] Univ Nacl Colombia, Signal Proc & Recognit Grp, Manizales, Colombia
[3] Univ Tecnol Pereira, Water & Sanitat Res Grp, Pereira, Colombia
关键词
Imbalanced classification; Krein spaces; Kernel methods; Support vector machines;
D O I
10.1016/j.patrec.2024.03.017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Conventional classification assumes a balanced sample distribution among classes. However, such a premise leads to biased performance over the majority class (with the highest number of instances). The Twin Support Vector Machines (TWSVM) obtained great prominence due to their low computational burden compared to the standard SVM. Besides, traditional machine learning seeks methods whose solution depends on a convex problem or semi-positive definite similarity matrices. Yet, this kind of matrix cannot adequately represent many real-world applications. The above defines the need to use non-negative measures as an indefinite function in a Reproducing Kernel Krein Space (RKKS). This paper proposes a novel approach called Krein Twin Support Vector Machines (KTSVM), which appropriately incorporates indefinite kernels within a TWSVM-based gradient optimization. To code pertinent input patterns within an imbalanced data discrimination, our KTSVM employs an implicit mapping to a RKKS. Also, our approach takes advantage of the TWSVM scheme's benefits by creating two parallel hyperplanes. This promotes the KTSVM optimization in a gradient-descent framework. Results obtained on synthetic and real-world datasets demonstrate that our solution performs better in terms of imbalanced data classification than state-of-the-art techniques.
引用
收藏
页码:39 / 45
页数:7
相关论文
共 50 条
  • [31] Near-Bayesian Support Vector Machines for imbalanced data classification with equal or unequal misclassification costs
    Datta, Shounak
    Das, Swagatam
    NEURAL NETWORKS, 2015, 70 : 39 - 52
  • [32] Classifying Remote Sensing Data with Support Vector Machines and Imbalanced Training Data
    Waske, Bjorn
    Benediktsson, Jon Atli
    Sveinsson, Johannes R.
    MULTIPLE CLASSIFIER SYSTEMS, PROCEEDINGS, 2009, 5519 : 375 - 384
  • [33] Combine Sampling Support Vector Machine for Imbalanced Data Classification
    Sain, Hartayuni
    Purnami, Santi Wulan
    THIRD INFORMATION SYSTEMS INTERNATIONAL CONFERENCE 2015, 2015, 72 : 59 - 66
  • [34] Fuzzy Support Vector Machine for Microarray Imbalanced Data Classification
    Ladayya, Faroh
    Purnami, Santi Wulan
    Irhamah
    13TH IMT-GT INTERNATIONAL CONFERENCE ON MATHEMATICS, STATISTICS AND THEIR APPLICATIONS (ICMSA2017), 2017, 1905
  • [35] Integration of feature vector selection and support vector machine for classification of imbalanced data
    Liu, Jie
    Zio, Enrico
    APPLIED SOFT COMPUTING, 2019, 75 : 702 - 711
  • [36] Balance method for imbalanced support vector machines
    Department of Applied Mathematics, Xidian University, Xi'an 710071, China
    不详
    不详
    Moshi Shibie yu Rengong Zhineng, 2008, 2 (136-141):
  • [37] Weighted Least Squares Twin Support Vector Machines for Pattern Classification
    Chen, Jing
    Ji, Guangrong
    2010 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2010), VOL 2, 2010, : 242 - 246
  • [38] Applying support vector machines to imbalanced datasets
    Akbani, R
    Kwek, S
    Japkowicz, N
    MACHINE LEARNING: ECML 2004, PROCEEDINGS, 2004, 3201 : 39 - 50
  • [39] ν-twin support vector machine with Universum data for classification
    Yitian Xu
    Mei Chen
    Zhiji Yang
    Guohui Li
    Applied Intelligence, 2016, 44 : 956 - 968
  • [40] An effective Weighted Multi-class Least Squares Twin Support Vector Machine for Imbalanced data classification
    Tomar, Divya
    Agarwal, Sonali
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2015, 8 (04) : 761 - 778