Self-paced ensemble and big data identification: a classification of substantial imbalance computational analysis

被引:0
|
作者
Bano, Shahzadi [1 ]
Zhi, Weimei [1 ]
Qiu, Baozhi [1 ]
Raza, Muhammad [2 ]
Sehito, Nabila [3 ]
Kamal, Mian Muhammad [4 ]
Aldehim, Ghadah [5 ]
Alruwais, Nuha [6 ]
机构
[1] Zhengzhou Univ, Sch Comp & Artificial Intelligence, 100 Sci Ave, Zhengzhou 450001, Peoples R China
[2] Xian Technol Univ, Xian, Peoples R China
[3] Zhengzhou Univ, Sch Elect Informat Engn, 100 Sci Ave, Zhengzhou 450001, Henan, Peoples R China
[4] Southeast Univ, Sch Elect Sci & Engn, Joint Int Res Lab Informat Display & Visualizat, Nanjing 210018, Peoples R China
[5] Princess Nourah Bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Informat Syst, POB 84428, Riyadh 11671, Saudi Arabia
[6] King Saud Univ, Coll Appl Studies & Community Serv, Dept Comp Sci & Engn, POB 22459, Riyadh 11495, Saudi Arabia
来源
JOURNAL OF SUPERCOMPUTING | 2024年 / 80卷 / 07期
关键词
Self-paced ensemble; Big data; Classification; Computational; Simulation; Substantial imbalance;
D O I
10.1007/s11227-023-05828-6
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This research paper focuses on the challenges associated with learning classifiers from large-scale, highly imbalanced datasets prevalent in many real-world applications. Traditional algorithms learning often need better performance and high computational efficiency when dealing with imbalanced data. Factors such as class imbalance, noise, and class overlap make it demanding to learn effective classifiers. In this study, we propose a novel self-paced ensemble framework for classifying imbalanced data. The framework employs under-sampling to self-harmonize data hardness and build a robust ensemble. Extensive experimental testing demonstrates promising results in handling overlapping classes and skewed distributions while maintaining computational efficiency. The self-paced ensemble method addresses the challenges of high imbalance ratios, class overlap, and noise presence in large-scale imbalanced classification problems. By incorporating the knowledge of these challenges into our learning framework, we establish the concept of classification hardness distribution, and the self-paced ensemble is a revolutionary learning paradigm for massive imbalance categorization, capable of improving the performance of existing learning algorithms on imbalanced data and providing better results for future applications.
引用
收藏
页码:9848 / 9869
页数:22
相关论文
共 50 条
  • [21] Self-Paced Convolutional Neural Network for PolSAR Images Classification
    Jiao, Changzhe
    Wang, Xinlin
    Gou, Shuiping
    Chen, Wenshuai
    Li, Debo
    Chen, Chao
    Li, Xiaofeng
    REMOTE SENSING, 2019, 11 (04)
  • [22] Adaptive Self-Paced Deep Clustering with Data Augmentation
    Guo, Xifeng
    Liu, Xinwang
    Zhu, En
    Zhu, Xinzhong
    Li, Miaomiao
    Xu, Xin
    Yin, Jianping
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (09) : 1680 - 1693
  • [23] Extreme Learning Machine for Supervised Classification with Self-paced Learning
    Li, Li
    Zhao, Kaiyi
    Li, Sicong
    Sun, Ruizhi
    Cai, Saihua
    NEURAL PROCESSING LETTERS, 2020, 52 (03) : 1723 - 1744
  • [24] Self-Paced Joint Sparse Representation for the Classification of Hyperspectral Images
    Peng, Jiangtao
    Sun, Weiwei
    Du, Qian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (02): : 1183 - 1194
  • [25] Self-paced data augmentation for training neural networks
    Takase, Tomoumi
    Karakida, Ryo
    Asoh, Hideki
    NEUROCOMPUTING, 2021, 442 : 296 - 306
  • [26] SELF-PACED EXERCISE, AFFECT, AND ADHERENCE: A MEDIATOR ANALYSIS USING EMA DATA
    Williams, David M.
    Dunsiger, Shira I.
    Miranda, Robert, Jr.
    Gwaltney, Chad
    Monti, Peter M.
    Emerson, Jessica A.
    ANNALS OF BEHAVIORAL MEDICINE, 2015, 49 : S248 - S248
  • [27] A Robust Pneumonia Classification Approach based on Self-Paced Learning
    Asare, Sarpong Kwadwo
    You, Fei
    Nartey, Obed Tettey
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (04) : 83 - 89
  • [28] Extreme Learning Machine for Supervised Classification with Self-paced Learning
    Li Li
    Kaiyi Zhao
    Sicong Li
    Ruizhi Sun
    Saihua Cai
    Neural Processing Letters, 2020, 52 : 1723 - 1744
  • [29] Multi-modal self-paced learning for image classification
    Xu, Wei
    Liu, Wei
    Huang, Xiaolin
    Yang, Jie
    Qiu, Song
    NEUROCOMPUTING, 2018, 309 : 134 - 144
  • [30] Multi-modal AD classification via self-paced latent correlation analysis
    Zhu, Qi
    Yuan, Ning
    Huang, Jiashuang
    Hao, Xiaoke
    Zhang, Daoqiang
    NEUROCOMPUTING, 2019, 355 : 143 - 154