A lazy bagging approach to classification

被引:23
作者
Zhu, Xingquan [1 ]
Yang, Ying [2 ]
机构
[1] Florida Atlantic Univ, Dept Comp Sci & Engn, Boca Raton, FL 33431 USA
[2] Monash Univ, Fac Informat Technol, Melbourne, Vic 3800, Australia
基金
中国国家自然科学基金;
关键词
classification; classifier ensemble; bagging; lazy learning;
D O I
10.1016/j.patcog.2008.03.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
in this paper, we propose lazy bagging (LB), which builds bootstrap replicate bags based on the characteristics of test instances. Upon receiving a test instance X-k, LB trims bootstrap bags by taking into consideration X-k's nearest neighbors in the training data. Our hypothesis is that an unlabeled instance's nearest neighbors provide valuable information to enhance local learning and generate a classifier with refined decision boundaries emphasizing the test instance's surrounding region. In particular, by taking full advantage of X-k's nearest neighbors, classifiers are able to reduce classification bias and variance when classifying X-k. As a result, LB, which is built on these classifiers, can significantly reduce classification error, compared with the traditional bagging (TB) approach. To investigate LB's performance, we first use carefully designed synthetic data sets to gain insight into why LB works and under which conditions it can outperform TB. We then test LB against four rival algorithms on a large suite of 35 real-world benchmark data sets using a variety of statistical tests. Empirical results confirm that LB can statistically significantly outperform alternative methods in terms of reducing classification error. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2980 / 2992
页数:13
相关论文
共 29 条
[1]  
Aha D.W., 1997, ARTIF INTELL REV, V11
[2]  
AHA DW, 1991, MACH LEARN, V6, P37, DOI 10.1007/BF00153759
[3]   An empirical comparison of voting classification algorithms: Bagging, boosting, and variants [J].
Bauer, E ;
Kohavi, R .
MACHINE LEARNING, 1999, 36 (1-2) :105-139
[4]  
BAY SD, 1998, P 15 ICML C
[5]  
BERIMAN L, 1996, 460 UCBERKELEY
[6]  
Blake C.L., 1998, UCI repository of machine learning databases
[7]   Bagging predictors [J].
Breiman, L .
MACHINE LEARNING, 1996, 24 (02) :123-140
[8]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[9]  
Domingos P, 2000, SEVENTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-2001) / TWELFTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-2000), P564
[10]  
Domingos P, 2000, UNIFIED BIAS VARIANC