Experimental Investigation of the Effect of Discrete Attributes on the Precision of Classification Methods

被引:0
|
作者
Entezari-Maleki, Reza [1 ]
Iranmanesh, Seyyed Mehdi [2 ]
Minaei-Bidgoli, Behrouz [1 ]
机构
[1] IUST, Dept Comp Engn, Tehran, Iran
[2] IUST, Artificial Intelligence Dept Comp Sci, Tehran, Iran
关键词
DECISION-TREE INDUCTION; LOGISTIC-REGRESSION; NEAREST-NEIGHBOR; MACHINE;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, the precisions of the logistic regression, Naive-Bayes and linear data classification methods, with regard to the Area Under Curve (AUC) metric have been compared. The effect of parameters including size of the dataset, kind of the independent attributes, number of the discrete attributes, and their values have been investigated. From the results, it can be concluded that in datasets consisting of both discrete and continuous attributes, the AUC of the three mentioned classifiers is the same. With increasing the number of the discrete attributes, the AUC of the logistic regression is increased and the precision related to this classifier become more than the other two classifiers. Also considering impact of the discrete attributes it can be seen that with increasing the number of values in discrete attributes the AUC related to the logistic regression classifier increases and linear regressions' AUC decreases, but the AUC of the Naive-Bayes classifier remains constant. The results of this research can help data miners in selecting the more efficient classifiers based on the conditions of feature that exist in their datasets.
引用
收藏
页码:172 / +
页数:3
相关论文
共 50 条