A Hybrid Machine Learning Methodology for Imbalanced Datasets

被引:0
|
作者
Lipitakis, Anastasia-Dimitra [1 ]
Kotsiantis, Sotirios [1 ]
机构
[1] Univ Patras, Dept Math, Patras, Hellas, Greece
关键词
computational intelligence; ensembles of classifiers; imbalanced data sets; supervised machine learning; DECISION TREE; CLASSIFICATION; CLASSIFIERS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the Machine Learning systems several imbalanced data sets exhibit skewed class distributions in which most cases are allocated to a class and far fewer cases to a smaller one. A classifier induced from an imbalanced data set has usually a low error rate for the majority class and an unacceptable error rate for the minority class. In this paper a synoptic review of the various related methodologies is given, a new ensemble methodology is introduced and an experimental study with other ensembles is presented. The proposed method that combines the power of OverBagging and Rotation Forest algorithms improves the identification of a difficult small class, while keeping the classification ability of the other class in an acceptable accuracy level.
引用
收藏
页码:252 / +
页数:6
相关论文
共 50 条
  • [1] Interpretable machine learning for imbalanced credit scoring datasets
    Chen, Yujia
    Calabrese, Raffaella
    Martin-Barragan, Belen
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2024, 312 (01) : 357 - 372
  • [2] Machine Learning with Variational AutoEncoder for Imbalanced Datasets in Intrusion Detection
    Lin, Ying-Dar
    Liu, Zi-Qiang
    Hwang, Ren-Hung
    Nguyen, Van-Linh
    Lin, Po-Ching
    Lai, Yuan-Cheng
    IEEE Access, 2022, 10 : 15247 - 15260
  • [3] Machine Learning With Variational AutoEncoder for Imbalanced Datasets in Intrusion Detection
    Lin, Ying-Dar
    Liu, Zi-Qiang
    Hwang, Ren-Hung
    Van-Linh Nguyen
    Lin, Po-Ching
    Lai, Yuan-Cheng
    IEEE ACCESS, 2022, 10 : 15247 - 15260
  • [4] Imbalanced-learn: A Python']Python Toolbox to Tackle the Curse of Imbalanced Datasets in Machine Learning
    Lemaitre, Guillaume
    Nogueira, Fernando
    Aridas, Christos K.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2017, 18
  • [5] Effect of Imbalanced Datasets on Security of Industrial IoT Using Machine Learning
    Zolanvari, Maede
    Teixeira, Marcio A.
    Jain, Raj
    2018 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), 2018, : 112 - 117
  • [6] Universum based kernelized weighted extreme learning machine for imbalanced datasets
    Raghuwanshi, Bhagat Singh
    Mangal, Akansha
    Shukla, Sanyam
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (11) : 3387 - 3408
  • [7] Universum based kernelized weighted extreme learning machine for imbalanced datasets
    Bhagat Singh Raghuwanshi
    Akansha Mangal
    Sanyam Shukla
    International Journal of Machine Learning and Cybernetics, 2022, 13 : 3387 - 3408
  • [8] A Study on Machine Learning for Imbalanced Datasets with Answer Validation of Question Answering
    Day, Min-Yuh
    Tsai, Cheng-Chia
    PROCEEDINGS OF 2016 IEEE 17TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IEEE IRI), 2016, : 513 - 519
  • [9] An algorithm of robust online extreme learning machine for dynamic imbalanced datasets
    Zhang, Jing
    Feng, Lin
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2015, 52 (07): : 1487 - 1498
  • [10] Machine Learning for Imbalanced Datasets of Recognizing Inference in Text with Linguistic Phenomena
    Day, Min-Yuh
    Tsai, Cheng-Chia
    2015 IEEE 16TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION, 2015, : 562 - 568