MEBoost: Mixing Estimators with Boosting for Imbalanced Data Classification

被引:0
|
作者
Rayhan, Farshid [1 ]
Ahmed, Sajid [1 ]
Mahbub, Asif [1 ]
Jani, Md. Rafsan [1 ]
Shatabda, Swakkhar [1 ]
Farid, Dewan Md. [1 ]
Rahman, Chowdhury Mofizur [1 ]
机构
[1] United Int Univ, Dept Comp Sci & Engn, Dhaka, Bangladesh
来源
2017 11TH INTERNATIONAL CONFERENCE ON SOFTWARE, KNOWLEDGE, INFORMATION MANAGEMENT AND APPLICATIONS (SKIMA) | 2017年
关键词
Boosting; Class imbalance; Ensemble; Binary classification;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Class imbalance problem has been a challenging research problem in the fields of machine learning and data mining as most real life datasets are imbalanced. Several existing machine learning algorithms try to maximise the accuracy classification by correctly identifying majority class samples while ignoring the minority class. However, the concept of the minority class instances usually represents a higher interest than the majority class. Recently, several cost sensitive methods, ensemble models and sampling techniques have been used in literature in order to classify imbalance datasets. In this paper, we propose MEBoost, a new boosting algorithm for imbalanced datasets. MEBoost mixes two different weak learners with boosting to improve the performance on imbalanced datasets. MEBoost is an alternative to the existing techniques such as SMOTEBoost, RUSBoost, AdaBoost etc. The performance of MEBoost has been evaluated on 12 benchmark imbalanced datasets with state of the art ensemble methods like SMOTEBoost, RUSBoost, Easy Ensemble, EUSBoost, DataBoost. Experimental results show significant improvement over the other methods and it can be concluded that MEBoost is an effective and promising algorithm to deal with imbalance datasets.
引用
收藏
页数:6
相关论文
共 50 条
  • [11] Post-boosting of classification boundary for imbalanced data using geometric mean
    Du, Jie
    Vong, Chi-Man
    Pun, Chi-Man
    Wong, Pak-Kin
    Ip, Weng-Fai
    NEURAL NETWORKS, 2017, 96 : 101 - 114
  • [12] Boosting methods for multi-class imbalanced data classification: an experimental review
    Jafar Tanha
    Yousef Abdi
    Negin Samadi
    Nazila Razzaghi
    Mohammad Asadpour
    Journal of Big Data, 7
  • [13] Boosting methods for multi-class imbalanced data classification: an experimental review
    Tanha, Jafar
    Abdi, Yousef
    Samadi, Negin
    Razzaghi, Nazila
    Asadpour, Mohammad
    JOURNAL OF BIG DATA, 2020, 7 (01)
  • [14] Majority-to-minority resampling for boosting-based classification under imbalanced data
    Gaoshan Wang
    Jian Wang
    Kejing He
    Applied Intelligence, 2023, 53 : 4541 - 4562
  • [15] Majority-to-minority resampling for boosting-based classification under imbalanced data
    Wang, Gaoshan
    Wang, Jian
    He, Kejing
    APPLIED INTELLIGENCE, 2023, 53 (04) : 4541 - 4562
  • [17] Boosting-GNN: Boosting Algorithm for Graph Networks on Imbalanced Node Classification
    Shi, Shuhao
    Qiao, Kai
    Yang, Shuai
    Wang, Linyuan
    Chen, Jian
    Yan, Bin
    FRONTIERS IN NEUROROBOTICS, 2021, 15
  • [18] Multi-class Boosting for Imbalanced Data
    Fernandez-Baldera, Antonio
    Buenaposada, Jose M.
    Baumela, Luis
    PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2015), 2015, 9117 : 57 - 64
  • [19] Using boosting tree to learn imbalanced data
    Yang Ridong
    Zhang Shiyu
    Li Lin
    Wang Zhe
    Zhou Yi
    The Journal of China Universities of Posts and Telecommunications, 2019, 26 (02) : 43 - 51
  • [20] Online Bagging and Boosting for Imbalanced Data Streams
    Wang, Boyu
    Pineau, Joelle
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (12) : 3353 - 3366