Stratified Normalization LogitBoost for Two-Class Unbalanced Data Classification

被引:3
|
作者
Song, Jie [1 ]
Lu, Xiaoling [2 ,3 ]
Liu, Miao [4 ]
Wu, Xizhi [2 ,3 ]
机构
[1] Capital Univ Econ & Business, Sch Stat, Beijing 100070, Peoples R China
[2] Renmin Univ China, Ctr Appl Stat, Beijing, Peoples R China
[3] Renmin Univ China, Sch Stat, Beijing, Peoples R China
[4] Cent Univ Finance & Econ, Sch Stat, Beijing, Peoples R China
关键词
LogitBoost; Stratified normalization; Unbalanced data;
D O I
10.1080/03610918.2011.589332
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The research on unbalanced data classification is a hot topic in recent years. LogitBoost algorithm is an adaptive algorithm that can get much higher prediction precision. But in the face of unbalanced data, this algorithm could produce a large minority class prediction error. In this article, we propose an improved LogitBoost algorithm named BLogitBoost, based on a stratified normalization method which normalizes within class sampling probability first, then normalizes between classes. The experiments on simulation data and empirical data show that the new algorithm can reduce the minority class prediction error significantly.
引用
收藏
页码:1587 / 1593
页数:7
相关论文
共 50 条
  • [21] Fusing one-class and two-class classification - A case study on the detection of pepper fraud
    Alewijn, Martin
    Akridopoulou, Vasiliki
    Venderink, Tjerk
    Muller-Maatsch, Judith
    Silletti, Erika
    FOOD CONTROL, 2023, 145
  • [22] A generalized boosting algorithm and its application to two-class chemical classification problem
    He, P
    Fang, KT
    Liang, YZ
    Li, BY
    ANALYTICA CHIMICA ACTA, 2005, 543 (1-2) : 181 - 191
  • [23] Motor Imagery, Execution, and Observation Classification using Small Amount of EEG Data with Multiple Two-Class CNNs
    Igasaki, Tomohiko
    Kuramura, Yugo
    Takemoto, Junya
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 443 - 446
  • [24] PARALLEL TWO-CLASS 3D-CNN CLASSIFIERS FOR VIDEO CLASSIFICATION
    Li, Jing
    2017 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS 2017), 2017, : 7 - 11
  • [25] Bi-objective feature selection for discriminant analysis in two-class classification
    Pacheco, Joaquin
    Casado, Silvia
    Angel-Bello, Francisco
    Alvarez, Ada
    KNOWLEDGE-BASED SYSTEMS, 2013, 44 : 57 - 64
  • [26] The impact of feature selection on one and two-class classification performance for plant microRNAs
    Khalifa, Waleed
    Yousef, Malik
    Demirci, Muserref Duygu Sacar
    Allmer, Jens
    PEERJ, 2016, 4
  • [28] A Feature Transformation Method using Multiobjective Genetic Programming for Two-Class Classification
    Hiroyasu, Tomoyuki
    Shiraishi, Toshihide
    Yoshida, Tomoya
    Yamamoto, Utako
    2015 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2015, : 2989 - 2995
  • [29] An efficient kernel-based nonlinear regression method for two-class classification
    Xu, Y
    Yang, JY
    Lu, JF
    PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 4442 - 4445
  • [30] Efficient Learning From Two-Class Categorical Imbalanced Healthcare Data
    Mathews, Lincy
    Seetha, Hari
    INTERNATIONAL JOURNAL OF HEALTHCARE INFORMATION SYSTEMS AND INFORMATICS, 2021, 16 (01) : 81 - 100