A Hybrid Approach Handling Imbalanced Datasets

被引:0
|
作者
Soda, Paolo [1 ]
机构
[1] Univ Campus Biomed Rome, Integrated Res Ctr, Med Informat & Comp Sci Lab, Rome, Italy
关键词
STRATEGIES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Several binary classification problems exhibit imbalance in class distribution, influencing system learning. Indeed, traditional machine learning algorithms are hi sod towards the majority class, thus producing poor predictive accuracy Over the minority One. To overcome this limitation: many approaches have been proposed up to now to build artificially balanced training sets. Further to their specific drawbacks, they achieve more balanced accuracies on each class harming the global accuracy. This paper first reviews the more recent method coping with Unbalanced datasets and then proposes a strategy overcoming the main drawbacks of existing approaches. It is based on an ensemble of classifiers trained on balanced subsets of the original Unbalanced training set working in conjunction with the classifier trained on the original Unbalanced dataset. The performance of the method has been estimated on six public datasets, proving its effectiveness also in comparison with other approaches. It also gives the chance to modify the system behaviour according to the operating scenario.
引用
收藏
页码:209 / 218
页数:10
相关论文
共 50 条
  • [41] A new rule-based knowledge extraction approach for imbalanced datasets
    Mahani, Aouatef
    Baba-Ali, Ahmed Riadh
    KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 61 (03) : 1303 - 1329
  • [42] Hybrid AI model for power transformer assessment using imbalanced DGA datasets
    Wang, Lin
    Littler, Tim
    Liu, Xueqin
    IET RENEWABLE POWER GENERATION, 2023, 17 (08) : 1912 - 1922
  • [43] A new rule-based knowledge extraction approach for imbalanced datasets
    Aouatef Mahani
    Ahmed Riadh Baba-Ali
    Knowledge and Information Systems, 2019, 61 : 1303 - 1329
  • [44] Evidential Undersampling Approach for Imbalanced Datasets with Class-Overlapping and Noise
    Grina, Fares
    Elouedi, Zied
    Lefevre, Eric
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE (MDAI 2021), 2021, 12898 : 181 - 192
  • [45] An Approach for Mining Imbalanced Datasets Combining Specialized Oversampling and Undersampling Methods
    Jedrzejowicz, Joanna
    Jedrzejowicz, Piotr
    IEEE ACCESS, 2023, 11 : 136782 - 136792
  • [46] Active Learning for Imbalanced Datasets
    Aggarwal, Umang
    Popescu, Adrian
    Hudelot, Celine
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1417 - 1426
  • [47] To improve classification of imbalanced datasets
    Shukla, Pratyusha
    Bhowmick, Kiran
    2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2017,
  • [48] A Hybrid Approach for Binary Classification of Imbalanced Data
    Tsai, Hsinhan
    Yang, Ta-Wei
    Wong, Wai-Man
    Kao, Han-Yi
    Chou, Cheng-Fu
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2024, 23 (03)
  • [49] A Study on Classifying Imbalanced Datasets
    Lakshmi, T. Jaya
    Prasad, Ch. Siva Rama
    2014 FIRST INTERNATIONAL CONFERENCE ON NETWORKS & SOFT COMPUTING (ICNSC), 2014, : 141 - 145
  • [50] Handling Imbalanced Datasets for Robust Deep Neural Network-Based Fault Detection in Manufacturing Systems
    Kafunah, Jefkine
    Ali, Muhammad Intizar
    Breslin, John G.
    APPLIED SCIENCES-BASEL, 2021, 11 (21):