MEBoost: Mixing Estimators with Boosting for Imbalanced Data Classification

被引:0
|
作者
Rayhan, Farshid [1 ]
Ahmed, Sajid [1 ]
Mahbub, Asif [1 ]
Jani, Md. Rafsan [1 ]
Shatabda, Swakkhar [1 ]
Farid, Dewan Md. [1 ]
Rahman, Chowdhury Mofizur [1 ]
机构
[1] United Int Univ, Dept Comp Sci & Engn, Dhaka, Bangladesh
关键词
Boosting; Class imbalance; Ensemble; Binary classification;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Class imbalance problem has been a challenging research problem in the fields of machine learning and data mining as most real life datasets are imbalanced. Several existing machine learning algorithms try to maximise the accuracy classification by correctly identifying majority class samples while ignoring the minority class. However, the concept of the minority class instances usually represents a higher interest than the majority class. Recently, several cost sensitive methods, ensemble models and sampling techniques have been used in literature in order to classify imbalance datasets. In this paper, we propose MEBoost, a new boosting algorithm for imbalanced datasets. MEBoost mixes two different weak learners with boosting to improve the performance on imbalanced datasets. MEBoost is an alternative to the existing techniques such as SMOTEBoost, RUSBoost, AdaBoost etc. The performance of MEBoost has been evaluated on 12 benchmark imbalanced datasets with state of the art ensemble methods like SMOTEBoost, RUSBoost, Easy Ensemble, EUSBoost, DataBoost. Experimental results show significant improvement over the other methods and it can be concluded that MEBoost is an effective and promising algorithm to deal with imbalance datasets.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] A review of boosting methods for imbalanced data classification
    Li, Qiujie
    Mao, Yaobin
    PATTERN ANALYSIS AND APPLICATIONS, 2014, 17 (04) : 679 - 693
  • [2] An Imbalanced Data Classification Algorithm Based on Boosting
    Li Qiu-Jie
    Mao Yao-Bin
    Wang Zhi-Quan
    2011 30TH CHINESE CONTROL CONFERENCE (CCC), 2011, : 3053 - 3057
  • [3] A review of boosting methods for imbalanced data classification
    Qiujie Li
    Yaobin Mao
    Pattern Analysis and Applications, 2014, 17 : 679 - 693
  • [4] A New Improved Boosting for Imbalanced Data Classification
    Zhang, Zongtang
    Qiu, JiaXing
    Dai, Weiguo
    2019 THE 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, CONTROL AND ROBOTICS (EECR 2019), 2019, 533
  • [5] Oversampling boosting for classification of imbalanced software defect data
    Li, Guangling
    Wang, Shihai
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 4149 - 4154
  • [6] Cost-sensitive boosting for classification of imbalanced data
    Sun, Yamnin
    Kamel, Mohamed S.
    Wong, Andrew K. C.
    Wang, Yang
    PATTERN RECOGNITION, 2007, 40 (12) : 3358 - 3378
  • [7] Imbalanced Data Classification Algorithm Based on Boosting and Cascade Model
    Zhang, Xiaolong
    Cheng, Chao
    PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 2861 - 2866
  • [8] Algorithm of Partition based Network Boosting for Imbalanced Data Classification
    Gou, Shuiping
    Yang, Hui
    Jiao, Licheng
    Zhuang, Xiong
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [9] Gaussian Mixture Based Semi Supervised Boosting For Imbalanced Data Classification
    Paul, Mahit Kumar
    Pal, Biprodip
    2016 2ND INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER & TELECOMMUNICATION ENGINEERING (ICECTE), 2016,
  • [10] IMBoost: A New Weighting Factor for Boosting to Improve the Classification Performance of Imbalanced Data
    Roshan, Seyedehsan
    Tanha, Jafar
    Hallaji, Farzad
    Ghanbari, Mohammad-reza
    COMPLEXITY, 2023, 2023