PREDICTIVE LEARNING VIA RULE ENSEMBLES

被引:692
作者
Frieman, Jerome H. [1 ]
Popescu, Bogdan E. [1 ]
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
关键词
Regression; classification; learning ensembles; rules; interaction effects; variable importance; machine learning; data mining;
D O I
10.1214/07-AOAS148
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
General regression and classification models are constructed as linear combinations of simple rules derived from the data. Each rule consists of a conjunction of a small number of simple statements concerning the values of individual input variables. These rule ensembles are shown to produce predictive accuracy comparable to the best methods. However, their principle advantage lies in interpretation. Because of its simple form, each rule is easy to understand, as it its influence on individual predictions, selected subsets of predictions, or globally over the entire space of joint input variable values. Similarly, the degree of relevance of the respective input variables can be assessed globally, locally in different regions of the input space, or at individual prediction points. Techniques are presented for automatically identifying those variables that are involved in interactions with other variables, the strength and degree of those interactions, as well as the indentities of the other variables with which they interact. Graphical representations are used to visualize both main and interaction effects.
引用
收藏
页码:916 / 954
页数:39
相关论文
共 33 条
[1]  
[Anonymous], 2003, IMPORTANCE SAMPLED L
[2]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]  
Clark P., 1989, Machine Learning, V3, P261, DOI 10.1023/A:1022641700528
[5]  
COHEN W, 1989, MACH LEARN P 12 INT, P115
[6]  
COHFN W, 1999, P 16 NAT C ART INT A, P335
[7]  
DONOHO DL, 1995, J ROY STAT SOC B MET, V57, P301
[8]  
Freund Y, 1996, ICML
[9]   On bagging and nonlinear estimation [J].
Friedman, Jerome H. ;
Hall, Peter .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2007, 137 (03) :669-683
[10]   Greedy function approximation: A gradient boosting machine [J].
Friedman, JH .
ANNALS OF STATISTICS, 2001, 29 (05) :1189-1232