Semi-supervised Classification Based Mixed Sampling for Imbalanced Data

被引:10
|
作者
Zhao, Jianhua [1 ]
Liu, Ning [2 ]
机构
[1] Shangluo Univ, Coll Math & Comp Applicat, Shangluo 726000, Peoples R China
[2] Shangluo Univ, Coll Econ Management, Shangluo 726000, Peoples R China
来源
OPEN PHYSICS | 2019年 / 17卷 / 01期
关键词
semi-supervised learning; imbalanced data; over sampling; under sampling; ensemble learning; ALGORITHM; SMOTE;
D O I
10.1515/phys-2019-0103
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
In practical application, there are a large amount of imbalanced data containing only a small number of labeled data. In order to improve the classification performance of this kind of problem, this paper proposes a semi-supervised learning algorithm based on mixed sampling for imbalanced data classification (S2MAID), which combines semi-supervised learning, over sampling, under sampling and ensemble learning. Firstly, a kind of under sampling algorithm UD-density is provided to select samples with high information content from majority class set for semi-supervised learning. Secondly, a safe supervised-learning method is used to mark unlabeled sample and expand the labeled sample. Thirdly, a kind of over sampling algorithm SMOTE-density is provided to make the imbalanced data set become balance set. Fourthly, an ensemble technology is used to generate a strong classifier. Finally, the experiment is carried out on imbalanced data with containing only a few labeled samples, and semi-supervised learning process is simulated. The proposed S2MAID is verified and the experimental result shows that the proposed S2MAID has a better classification performance.
引用
收藏
页码:975 / 983
页数:9
相关论文
共 50 条
  • [1] GAN-Based Semi-supervised For Imbalanced Data Classification
    Zhou, Tingting
    Liu, Wei
    Zhou, Congyu
    Chen, Leiting
    2018 4TH INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT (ICIM2018), 2018, : 17 - 21
  • [2] Robust semi-supervised classification for imbalanced and incomplete data
    Chen, Mengxing
    Dou, Jun
    Fan, Yali
    Song, Yan
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (02) : 2781 - 2797
  • [3] Hyperspectral Image Classification with Imbalanced Data Based on Semi-Supervised Learning
    Zheng, Xiaorou
    Jia, Jianxin
    Chen, Jinsong
    Guo, Shanxin
    Sun, Luyi
    Zhou, Chan
    Wang, Yawei
    APPLIED SCIENCES-BASEL, 2022, 12 (08):
  • [4] Imbalanced and semi-supervised classification for prognosis of ACLF
    Xu, Yitian
    Zhang, Yuqun
    Yang, Zhiji
    Pan, Xianli
    Li, Guohui
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2015, 28 (02) : 737 - 745
  • [5] A SEMI-SUPERVISED LEARNING ALGORITHM BASED ON SVM FOR IMBALANCED DATA
    Du, Limin
    Xu, Yang
    He, Xingxing
    UNCERTAINTY MODELLING IN KNOWLEDGE ENGINEERING AND DECISION MAKING, 2016, 10 : 194 - 200
  • [6] AUC-Based Extreme Learning Machines for Supervised and Semi-Supervised Imbalanced Classification
    Wang, Guanjin
    Wong, Kok Wai
    Lu, Jie
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (12): : 7919 - 7930
  • [7] Balanced neighbor exploration for semi-supervised node classification on imbalanced graph data
    Zhu, Zonghai
    Xing, Huanlai
    Xu, Yuge
    INFORMATION SCIENCES, 2023, 631 : 31 - 44
  • [8] Semi-supervised learning for medical image classification using imbalanced training data
    Huynh, Tri
    Nibali, Aiden
    He, Zhen
    Computer Methods and Programs in Biomedicine, 2022, 216
  • [9] Semi-supervised learning for medical image classification using imbalanced training data
    Huynh, Tri
    Nibali, Aiden
    He, Zhen
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 216
  • [10] SCD:Sampling-based Class Distribution for Imbalanced Semi-Supervised Learning
    Qiu, Haomiao
    Liu, Haixing
    Zhang, Chi
    2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 567 - 572