Multiple Kernel Learning With Minority Oversampling for Classifying Imbalanced Data

被引:6
|
作者
Wang, Ling [1 ]
Wang, Hongqiao [1 ]
Fu, Guangyuan [1 ]
机构
[1] Rocket Force Univ Engn, Dept Informat Engn, Xian 710025, Peoples R China
基金
中国国家自然科学基金;
关键词
Training; Sensitivity; Shape; Classification algorithms; Kernel; Task analysis; Standards; Class imbalanced learning; multiple kernel learning; nonlinear oversampling; cost-sensitive;
D O I
10.1109/ACCESS.2020.3046604
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Class imbalance problems, developed due to the sampling bias or measurement error, occur frequently in real-world pattern classification tasks. The traditional classifiers focus on the overall classification accuracy and ignore the minority class, which may degrade the classification performance. However, existing oversampling algorithms generally make specific assumptions to balance the class size and do not sufficiently consider irregularities present in imbalanced data. As a result, these methods can perform well only on certain benchmarks. In this paper, by incorporating minority oversampling and cost-sensitive learning, we propose multiple kernel learning with minority oversampling (MKLMO), for efficiently handling the class imbalance problem with small disjuncts, overlapping, and nonlinear shape. Unlike existing methods where oversampling of the minority class is performed first and then a standard classifier is deployed on the rebalanced data, the proposed MKLMO generates synthetic instances and trains classifier synchronously in the same feature space. Specially, we define a distance metric in the optimal feature space by multiple kernel learning and use kernel trick to expand the original Gram matrix. Moreover, we assign different weights to instances, based on the imbalance ratio, for reducing the bias of the classifier towards the majority class. In order to evaluate the proposed MKLMO method, several experiments are performed with nine artificial and twenty-one real-world datasets. The experimental results show that our algorithm outperforms other baseline algorithms significantly in terms of the assessment metric geometric mean (G-mean), especially in the presence of data irregularities.
引用
收藏
页码:565 / 580
页数:16
相关论文
共 50 条
  • [31] Improving interpolation-based oversampling for imbalanced data learning
    Zhu, Tuanfei
    Lin, Yaping
    Liu, Yonghe
    KNOWLEDGE-BASED SYSTEMS, 2020, 187
  • [32] Anomaly detection and oversampling approach for classifying imbalanced data using CLUBS technique in IoT healthcare data
    Subha, S.
    Sathiaseelan, J. G. R.
    INTERNATIONAL JOURNAL OF INTELLIGENT ENGINEERING INFORMATICS, 2023, 11 (03) : 255 - 271
  • [33] Counterfactual-based minority oversampling for imbalanced classification
    Wang, Shu
    Luo, Hao
    Huang, Shanshan
    Li, Qingsong
    Liu, Li
    Su, Guoxin
    Liu, Ming
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 122
  • [34] Imbalanced Twitter Sentiment Analysis using Minority Oversampling
    Ghosh, Kushankur
    Banerjee, Arghasree
    Chatterjee, Sankhadeep
    Sen, Soumya
    2019 IEEE 10TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST 2019), 2019, : 384 - 388
  • [35] Oversampling the minority class in a multi-linear feature space for imbalanced data classification
    Liang, Peifeng
    Li, Weite
    Hu, Jinglu
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2018, 13 (10) : 1483 - 1491
  • [36] SPAW-SMOTE: Space Partitioning Adaptive Weighted Synthetic Minority Oversampling Technique For Imbalanced Data Set Learning
    Zhang, Qiang
    He, Junjiang
    Li, Tao
    Lan, Xiaolong
    Fang, Wenbo
    Li, Yihong
    COMPUTER JOURNAL, 2023, 67 (05): : 1747 - 1762
  • [37] Radius-SMOTE: A New Oversampling Technique of Minority Samples Based on Radius Distance for Learning From Imbalanced Data
    Pradipta, Gede Angga
    Wardoyo, Retantyo
    Musdholifah, Aina
    Sanjaya, I. Nyoman Hariyasa
    IEEE ACCESS, 2021, 9 : 74763 - 74777
  • [38] Classifying imbalanced dataset based on minority detection
    Liu, Tong
    Liang, Yongquan
    Ni, Weijian
    PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012), 2012, : 3236 - 3241
  • [39] Iterative Nearest Neighborhood Oversampling in Semisupervised Learning from Imbalanced Data
    Li, Fengqi
    Yu, Chuang
    Yang, Nanhai
    Xia, Feng
    Li, Guangming
    Kaveh-Yazdy, Fatemeh
    SCIENTIFIC WORLD JOURNAL, 2013,
  • [40] Gravitation balanced multiple kernel learning for imbalanced classification
    Mengping Yang
    Zhe Wang
    Yanqiong Li
    Yangming Zhou
    Dongdong Li
    Wenli Du
    Neural Computing and Applications, 2022, 34 : 13807 - 13823