The role of significance tests in consistent interpretation of nested partitions

被引:8
|
作者
Gibert, Karina [1 ]
Sevilla-Villanueva, Beatriz [1 ]
Sanchez-Marre, Miquel [1 ]
机构
[1] Univ Politecn Catalunya BarcelonaTech, Barcelona, Spain
关键词
Clustering; Nested partitions; Statistical tests; Sensitivity of a test; Cluster interpretation; Consistency;
D O I
10.1016/j.cam.2015.01.031
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Cluster interpretation is an important step for a proper understanding of a set of classes, independently of whether they have been automatically discovered or expert-based. An understanding of classes is crucial for the further use of classes as the basis of a decision-making process. The abundant work on cluster validity found in the literature is mainly focused on the validation of clusters from the structural point of view. However, structural validation does not ensure that the clustering is useful, since meaningfulness is the key to guaranteeing that classes can support further decisions. In previous works, special significance tests taken from the field of multivariate analysis were introduced in an interpretation methodology for automatically assessing relevant variables in particular classes. In this paper, we present the interpretation of nested partitions and the relationships between both interpretations are studied. In particular, the inconsistencies produced in interpretation when a second partition refines the first one with a higher level of granularity are studied, diagnosed, and a modification of the original methodology is provided to guarantee consistency in these cases. The relevant characteristics detected in a parent class must also be inherited in subclasses, or at least in some of them. The proposal is evaluated using a real data set on baseline health conditions and dietary habits of a sample of the general population. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:623 / 633
页数:11
相关论文
共 50 条