Entity connectivity vs. hierarchical levelling as a basis for data model clustering: An experimental analysis

被引:0
|
作者
Moody, DL [1 ]
机构
[1] Charles Univ Prague, Dept Software Engn, Prague, Czech Republic
[2] Monash Univ, Sch Business Syst, Melbourne, Vic 3800, Australia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data model clustering is the process of dividing large and complex models into: parts of manageable size, in order to improve understanding and simplify documentation and maintenance. Based on theories of human cognition,: a previous paper proposed;connectivity (defined as the number of relationships an entity participates in) as a basis for clustering data models. This paper describes a series of laboratory experiments which evaluate the validity of this metric as a basis for clustering compared to hierarchical levelling, which has been. the predominant approach used in previous research. The first two experiments investigate the relationship between the metrics and perceptions of importance, while the third experiment investigates their relationship to how people intuitively cluster entities. The results show that connectivity provides an empirically valid basis for clustering data models, which closely matches human perceptions of importance and "chunking" behaviour. No significant results were found for hierarchical level in any of the experiments. The high levels of statistical significance and effect size of the results for connectivity, together with their consistency across different domains and sample populations, suggests the possible discovery of a natural "law" governing data models.
引用
收藏
页码:77 / 87
页数:11
相关论文
共 50 条
  • [31] A theoretical and experimental investigation of hetero- vs. homo-connectivity in barium silicates
    Moulton, Benjamin J. A.
    Gomes, Eduardo O.
    Cunha, Thiago R.
    Doerenkamp, Carsten
    Gracia, Lourdes
    Eckert, Hellmut
    Andres, Juan
    Pizani, Paulo S.
    AMERICAN MINERALOGIST, 2022, 107 (04) : 716 - 728
  • [32] Classification in high-dimensional spectral data: Accuracy vs. interpretability vs. model size
    Backhaus, Andreas
    Seiffert, Udo
    NEUROCOMPUTING, 2014, 131 : 15 - 22
  • [33] Presupposition Projection From 'and' vs. 'or': Experimental Data and Theoretical Implications
    Kalomoiros, Alexandros
    Schwarz, Florian
    JOURNAL OF SEMANTICS, 2024, 41 (3-4) : 331 - 372
  • [34] Data vs. information: Using clustering techniques to enhance stock returns forecasting
    Saenz, Javier Vasquez
    Quiroga, Facundo Manuel
    Bariviera, Aurelio F.
    INTERNATIONAL REVIEW OF FINANCIAL ANALYSIS, 2023, 88
  • [35] Experimental vs. simulation analysis of LoRa for vehicular communications
    Ortiz, Fernando M.
    de Almeida, Thales T.
    Ferreira, Ana E.
    Costa, Luis Henrique M. K.
    COMPUTER COMMUNICATIONS, 2020, 160 (160) : 299 - 310
  • [36] Piqueteros: An Experimental Analysis of Direct vs. Indirect Reciprocity
    Blanco, Mariana
    PEACE ECONOMICS PEACE SCIENCE AND PUBLIC POLICY, 2015, 21 (01) : 37 - 57
  • [37] Coal elemental (compositional) data analysis with hierarchical clustering algorithms
    Xu, Na
    Xu, Chuanpeng
    Finkelman, Robert B.
    Engle, Mark A.
    Li, Qing
    Peng, Mengmeng
    He, Lizhi
    Huang, Bin
    Yang, Yuchen
    INTERNATIONAL JOURNAL OF COAL GEOLOGY, 2022, 249
  • [38] A hierarchical model for compositional data analysis
    Brewer, MJ
    Filipe, JAN
    Elston, DA
    Dawson, LA
    Mayes, RW
    Soulsby, C
    Dunn, SM
    JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2005, 10 (01) : 19 - 34
  • [39] Echidna: Efficient clustering of hierarchical data for network traffic analysis
    Mahmood, Abdun Naser
    Leckie, Christopher
    Udaya, Parampalli
    NETWORKING 2006: NETWORKING TECHNOLOGIES, SERVICES, AND PROTOCOLS; PERFORMANCE OF COMPUTER AND COMMUNICATION NETWORKS; MOBILE AND WIRELESS COMMUNICATIONS SYSTEMS, 2006, 3976 : 1092 - 1098
  • [40] A multiseed non-hierarchical clustering technique for data analysis
    Chaudhuri, D.
    Chaudhuri, B.B.
    International Journal of Systems Science, 1995, 26 (02): : 375 - 385