Entity connectivity vs. hierarchical levelling as a basis for data model clustering: An experimental analysis

被引:0
|
作者
Moody, DL [1 ]
机构
[1] Charles Univ Prague, Dept Software Engn, Prague, Czech Republic
[2] Monash Univ, Sch Business Syst, Melbourne, Vic 3800, Australia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data model clustering is the process of dividing large and complex models into: parts of manageable size, in order to improve understanding and simplify documentation and maintenance. Based on theories of human cognition,: a previous paper proposed;connectivity (defined as the number of relationships an entity participates in) as a basis for clustering data models. This paper describes a series of laboratory experiments which evaluate the validity of this metric as a basis for clustering compared to hierarchical levelling, which has been. the predominant approach used in previous research. The first two experiments investigate the relationship between the metrics and perceptions of importance, while the third experiment investigates their relationship to how people intuitively cluster entities. The results show that connectivity provides an empirically valid basis for clustering data models, which closely matches human perceptions of importance and "chunking" behaviour. No significant results were found for hierarchical level in any of the experiments. The high levels of statistical significance and effect size of the results for connectivity, together with their consistency across different domains and sample populations, suggests the possible discovery of a natural "law" governing data models.
引用
收藏
页码:77 / 87
页数:11
相关论文
共 50 条
  • [41] A hierarchical model for compositional data analysis
    Mark J. Brewer
    João A. N. Filipe
    David A. Elston
    Lorna A. Dawson
    Robert W. Mayes
    Chris Soulsby
    Sarah M. Dunn
    Journal of Agricultural, Biological, and Environmental Statistics, 2005, 10 : 19 - 34
  • [42] Clustering gene expression data:: an experimental analysis
    Ortiz-Gama, S
    Sucar, LE
    Rodríguez, AF
    PROCEEDINGS OF THE FIFTH MEXICAN INTERNATIONAL CONFERENCE IN COMPUTER SCIENCE (ENC 2004), 2004, : 168 - 175
  • [43] A quantitative analysis of Educational Data through the Comparison between Hierarchical and Not-Hierarchical Clustering
    Battaglia, Onofrio Rosario
    Di Paola, Bendetto
    Fazio, Claudio
    EURASIA JOURNAL OF MATHEMATICS SCIENCE AND TECHNOLOGY EDUCATION, 2017, 13 (08) : 4491 - 4512
  • [44] Pediatric defibrillation: Biphasic vs. monophasic waveforms in an experimental model
    Clark, CB
    Davies, LR
    Kerber, RE
    CIRCULATION, 1999, 100 (18) : 90 - 90
  • [45] Numerical vs. experimental analyses of Mustafa Pasha Mosque model
    Lazarov, L.
    Todorov, K.
    PROTECTION OF HISTORICAL BUILDINGS - PROHITECH 09, VOL 1 AND 2, 2009, : 1165 - 1170
  • [46] EXPERIMENTAL MODEL OF OSTEOPOROSIS: COMPARISON OF OVARIECTOMY VS. BOTULINUM TOXIN A
    Atmaca, Halil
    Aydin, Adem
    Musaoglu, Resul
    OSTEOPOROSIS INTERNATIONAL, 2013, 24 : S90 - S90
  • [47] Clustering sparse binary data with hierarchical Bayesian Bernoulli mixture model
    Ye, Mao
    Zhang, Peng
    Nie, Lizhen
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2018, 123 : 32 - 49
  • [48] A hierarchical mixture model for clustering three-way data sets
    Vermunt, Jeroen K.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 51 (11) : 5368 - 5376
  • [49] Recent studies on the pod analysis of "a vs. a" NDI data
    Safizadeh, MS
    Forsyth, DS
    Fahr, A
    REVIEW OF PROGRESS IN QUANTITATIVE NONDESTRUCTIVE EVALUATION, VOLS 22A AND 22B, 2003, 20 : 1846 - 1853
  • [50] Comprehensive vs. comprehensible classifiers in logical analysis of data
    Alexe, Gabriela
    Alexe, Sorin
    Hammer, Peter L.
    Kogan, Alexander
    DISCRETE APPLIED MATHEMATICS, 2008, 156 (06) : 870 - 882