Multiple Kernel Learning With Minority Oversampling for Classifying Imbalanced Data

被引:6
|
作者
Wang, Ling [1 ]
Wang, Hongqiao [1 ]
Fu, Guangyuan [1 ]
机构
[1] Rocket Force Univ Engn, Dept Informat Engn, Xian 710025, Peoples R China
基金
中国国家自然科学基金;
关键词
Training; Sensitivity; Shape; Classification algorithms; Kernel; Task analysis; Standards; Class imbalanced learning; multiple kernel learning; nonlinear oversampling; cost-sensitive;
D O I
10.1109/ACCESS.2020.3046604
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Class imbalance problems, developed due to the sampling bias or measurement error, occur frequently in real-world pattern classification tasks. The traditional classifiers focus on the overall classification accuracy and ignore the minority class, which may degrade the classification performance. However, existing oversampling algorithms generally make specific assumptions to balance the class size and do not sufficiently consider irregularities present in imbalanced data. As a result, these methods can perform well only on certain benchmarks. In this paper, by incorporating minority oversampling and cost-sensitive learning, we propose multiple kernel learning with minority oversampling (MKLMO), for efficiently handling the class imbalance problem with small disjuncts, overlapping, and nonlinear shape. Unlike existing methods where oversampling of the minority class is performed first and then a standard classifier is deployed on the rebalanced data, the proposed MKLMO generates synthetic instances and trains classifier synchronously in the same feature space. Specially, we define a distance metric in the optimal feature space by multiple kernel learning and use kernel trick to expand the original Gram matrix. Moreover, we assign different weights to instances, based on the imbalance ratio, for reducing the bias of the classifier towards the majority class. In order to evaluate the proposed MKLMO method, several experiments are performed with nine artificial and twenty-one real-world datasets. The experimental results show that our algorithm outperforms other baseline algorithms significantly in terms of the assessment metric geometric mean (G-mean), especially in the presence of data irregularities.
引用
收藏
页码:565 / 580
页数:16
相关论文
共 50 条
  • [41] Gravitation balanced multiple kernel learning for imbalanced classification
    Yang, Mengping
    Wang, Zhe
    Li, Yanqiong
    Zhou, Yangming
    Li, Dongdong
    Du, Wenli
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (16): : 13807 - 13823
  • [42] Global Data Distribution Weighted Synthetic Oversampling Technique for Imbalanced Learning
    Wang, Zhenfei
    Wang, Hongju
    IEEE ACCESS, 2021, 9 : 44770 - 44783
  • [43] Oversampling imbalanced data in the string space
    Castellanos, Francisco J.
    Valero-Mas, Jose J.
    Calvo-Zaragoza, Jorge
    Rico-Juan, Juan R.
    PATTERN RECOGNITION LETTERS, 2018, 103 : 32 - 38
  • [44] Oversampling techniques for imbalanced data in regression
    Belhaouari, Samir Brahim
    Islam, Ashhadul
    Kassoul, Khelil
    Al-Fuqaha, Ala
    Bouzerdoum, Abdesselam
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 252
  • [45] Adaptive Oversampling for Imbalanced Data Classification
    Ertekin, Seyda
    INFORMATION SCIENCES AND SYSTEMS 2013, 2013, 264 : 261 - 269
  • [46] Learning Minority Class prior to Minority Oversampling
    Sadhukhan, Payel
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [47] An oversampling method for imbalanced data based on spatial distribution of minority samples SD-KMSMOTE
    Wensheng Yang
    Chengsheng Pan
    Yanyan Zhang
    Scientific Reports, 12
  • [48] CDSMOTE: class decomposition and synthetic minority class oversampling technique for imbalanced-data classification
    Elyan, Eyad
    Moreno-Garcia, Carlos Francisco
    Jayne, Chrisina
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (07): : 2839 - 2851
  • [49] Iterative minority oversampling and its ensemble for ordinal imbalanced datasets
    Wang, Ning
    Zhang, Zhong-Liang
    Luo, Xing-Gang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127
  • [50] Clustering-based improved adaptive synthetic minority oversampling technique for imbalanced data classification
    Jin, Dian
    Xie, Dehong
    Liu, Di
    Gong, Murong
    INTELLIGENT DATA ANALYSIS, 2023, 27 (03) : 635 - 652