Entity connectivity vs. hierarchical levelling as a basis for data model clustering: An experimental analysis

被引:0
|
作者
Moody, DL [1 ]
机构
[1] Charles Univ Prague, Dept Software Engn, Prague, Czech Republic
[2] Monash Univ, Sch Business Syst, Melbourne, Vic 3800, Australia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data model clustering is the process of dividing large and complex models into: parts of manageable size, in order to improve understanding and simplify documentation and maintenance. Based on theories of human cognition,: a previous paper proposed;connectivity (defined as the number of relationships an entity participates in) as a basis for clustering data models. This paper describes a series of laboratory experiments which evaluate the validity of this metric as a basis for clustering compared to hierarchical levelling, which has been. the predominant approach used in previous research. The first two experiments investigate the relationship between the metrics and perceptions of importance, while the third experiment investigates their relationship to how people intuitively cluster entities. The results show that connectivity provides an empirically valid basis for clustering data models, which closely matches human perceptions of importance and "chunking" behaviour. No significant results were found for hierarchical level in any of the experiments. The high levels of statistical significance and effect size of the results for connectivity, together with their consistency across different domains and sample populations, suggests the possible discovery of a natural "law" governing data models.
引用
收藏
页码:77 / 87
页数:11
相关论文
共 50 条
  • [1] Entity connectivity vs. hierarchical levelling as a basis for data model clustering: An experimental analysis
    Moody, Daniel L.
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2003, 2736 : 77 - 87
  • [2] Database clustering vs. flat file data clustering
    Ryu, TW
    Chung, H
    Chang, WY
    Hanadi, S
    IC-AI'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS I-III, 2001, : 490 - 496
  • [3] ENTITY MODEL CLUSTERING - STRUCTURING A DATA MODEL BY ABSTRACTION
    FELDMAN, P
    MILLER, D
    COMPUTER JOURNAL, 1986, 29 (04): : 348 - 360
  • [4] Improve CRUD performance on hierarchical data Nested Interval model vs. nested set model
    Badders, Blake
    Olmsted, Aspen
    2017 12TH INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS (ICITST), 2017, : 370 - 371
  • [5] Hierarchical clustering to measure connectivity in fMRI resting-state data
    Cordes, D
    Haughton, V
    Carew, JD
    Arfanakis, K
    Maravilla, K
    MAGNETIC RESONANCE IMAGING, 2002, 20 (04) : 305 - 317
  • [6] Integrated data envelopment analysis: Linear vs. nonlinear model
    Mahdiloo, Mahdi
    Toloo, Mehdi
    Thach-Thao Duong
    Saen, Reza Farzipoor
    Tatham, Peter
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2018, 268 (01) : 255 - 267
  • [7] Improve Query Performance On Hierarchical Data. Adjacency List Model Vs. Nested Set Model
    Gyorodi, Cornelia
    Gyorodi, Robert
    Moldovan-Duse, Romulus-Radu
    Pecherle, George
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (04) : 272 - 278
  • [8] A hierarchical co-clustering approach for entity exploration over Linked Data
    Zheng, Liang
    Qu, Yuzhong
    Qian, Xinqi
    Cheng, Gong
    KNOWLEDGE-BASED SYSTEMS, 2018, 141 : 200 - 210
  • [9] Experimental vs. computational system analysis
    DeGiorgi, VG
    Hogan, EA
    SIMULATION OF ELECTROCHEMICAL PROCESSES, 2005, 48 : 37 - 46
  • [10] Agglomerative hierarchical clustering for nonlinear data analysis
    Wattanachon, U
    Lursinsap, C
    2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 1420 - 1425