Stratified Normalization LogitBoost for Two-Class Unbalanced Data Classification

被引:3
|
作者
Song, Jie [1 ]
Lu, Xiaoling [2 ,3 ]
Liu, Miao [4 ]
Wu, Xizhi [2 ,3 ]
机构
[1] Capital Univ Econ & Business, Sch Stat, Beijing 100070, Peoples R China
[2] Renmin Univ China, Ctr Appl Stat, Beijing, Peoples R China
[3] Renmin Univ China, Sch Stat, Beijing, Peoples R China
[4] Cent Univ Finance & Econ, Sch Stat, Beijing, Peoples R China
关键词
LogitBoost; Stratified normalization; Unbalanced data;
D O I
10.1080/03610918.2011.589332
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The research on unbalanced data classification is a hot topic in recent years. LogitBoost algorithm is an adaptive algorithm that can get much higher prediction precision. But in the face of unbalanced data, this algorithm could produce a large minority class prediction error. In this article, we propose an improved LogitBoost algorithm named BLogitBoost, based on a stratified normalization method which normalizes within class sampling probability first, then normalizes between classes. The experiments on simulation data and empirical data show that the new algorithm can reduce the minority class prediction error significantly.
引用
收藏
页码:1587 / 1593
页数:7
相关论文
共 50 条
  • [1] Two-Class Weather Classification
    Lu, Cewu
    Lin, Di
    Jia, Jiaya
    Tang, Chi-Keung
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3718 - 3725
  • [2] Two-Class Weather Classification
    Lu, Cewu
    Lin, Di
    Jia, Jiaya
    Tang, Chi-Keung
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2510 - 2524
  • [3] Linear regression and two-class classification with gene expression data
    Huang, XH
    Pan, W
    BIOINFORMATICS, 2003, 19 (16) : 2072 - 2078
  • [4] Reliable classification of two-class cancer data using evolutionary algorithms
    Deb, K
    Reddy, AR
    BIOSYSTEMS, 2003, 72 (1-2) : 111 - 129
  • [5] Using two-class classifiers for multiclass classification
    Tax, DMJ
    Duin, RPW
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL II, PROCEEDINGS, 2002, : 124 - 127
  • [6] Empirical study on two-class image classification
    Kumari, Smriti
    Saharia, Navanath
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 583 - 587
  • [7] A multiple linear regression model approach for two-class fNIR data classification
    S. M. Saklain Galib
    Sheikh Md. Rabiul Islam
    Md. Asadur Rahman
    Iran Journal of Computer Science, 2021, 4 (1) : 45 - 58
  • [8] Choosing k for two-class nearest neighbour classifiers with unbalanced classes
    Hand, DJ
    Vinciotti, V
    PATTERN RECOGNITION LETTERS, 2003, 24 (9-10) : 1555 - 1562
  • [9] Elastic-Net Prefiltering for Two-Class Classification
    Hong, Xia
    Chen, Sheng
    Harris, Chris J.
    IEEE TRANSACTIONS ON CYBERNETICS, 2013, 43 (01) : 286 - 295
  • [10] Two-class support vector data description
    Huang, Guangxin
    Chen, Huafu
    Zhou, Zhongli
    Yin, Feng
    Guo, Ke
    PATTERN RECOGNITION, 2011, 44 (02) : 320 - 329