Study of clustering algorithm based on model data

被引:0
|
作者
Li, Kai [1 ]
Cui, Li-Juan [2 ]
机构
[1] HeBei Univ, Sch Math & Comp, Baoding 071002, Peoples R China
[2] HeBei Univ, Lib, Baoding 071002, Peoples R China
关键词
model clustering; measure space; validation of clustering; diversity;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering technique is a key tool in data mining and pattern recognition. Usually, objects for some traditional clustering algorithms are expressed in the form of vectors, which consist of some components to be described as features. However, objects in real tasks may be some models which are clustered other than data points, for example! neural networks, decision trees, support vector machines, etc. This paper studies the clustering algorithm based on model data. By defining the extended measure, clustering methods are studied for the abstract data objects. Framework of clustering algorithm for models is presented. To validate the effectiveness of models clustering algorithm, we choose the hierarchical model clustering algorithm in the experiments. Models in clustering algorithm are BP(Back Propagation) neural networks and learning method is BP algorithm. Measures are chosen as both same-fault measure and double-fault measure for pairwise of models. Distances between clusters are the single link and the complete link, respectively. By this way, we may obtain part of neural network models which are from each cluster and improve diversity of neural network models. Then, part of models is ensembled. Moreover, we also study the relations between the number of clusters in clustering analysis, the size of ensemble learning, and performance of ensemble learning by experiments. Experimental results show that performance of ensemble learning by choosing part of models using clustering of models is improved.
引用
收藏
页码:3961 / +
页数:2
相关论文
共 50 条
  • [31] A clustering algorithm based on coarsen graph model
    Chen, JB
    He, YL
    Song, HT
    CONCURRENT ENGINEERING: THE WORLDWIDE ENGINEERING GRID, PROCEEDINGS, 2004, : 285 - 289
  • [32] Automatic summarization model based on clustering algorithm
    Dai, Wenzhuo
    He, Qing
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [33] A Model Selection Algorithm For Mixture Model Clustering Of Heterogeneous Multivariate Data
    Erol, Hamza
    2013 IEEE INTERNATIONAL SYMPOSIUM ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (IEEE INISTA), 2013,
  • [34] Study on the model reduction for flexible structure based on clustering algorithm and DOFs concentration
    Li, Cheng-Tao
    Xiao, Yi-Qing
    Ou, Jin-Ping
    Jisuan Lixue Xuebao/Chinese Journal of Computational Mechanics, 2012, 29 (02): : 236 - 241
  • [35] Study on Music Emotion Recognition Based on the Machine Learning Model Clustering Algorithm
    Xia, Yu
    Xu, Fumei
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [36] A novel data clustering algorithm based on modified gravitational search algorithm
    Han, XiaoHong
    Quan, Long
    Xiong, XiaoYan
    Almeter, Matt
    Xiang, Jie
    Lan, Yuan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 61 : 1 - 7
  • [37] A Clustering Algorithm for Tumor Gene Data Based on Improved DPC Algorithm
    Wang W.
    Gao B.
    International Journal Bioautomation, 2022, 26 (02): : 175 - 192
  • [38] Massive Data Mining Algorithm for Web Text Based on Clustering Algorithm
    Luo, Nan-Chao
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2019, 23 (02) : 362 - 365
  • [39] THE CLUSTERING ALGORITHM OF EVOLUTIONAL DATA STREAM BASED ON DENSITY
    Meng, Yuyu
    Zheng, Liying
    3RD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND COMPUTER SCIENCE (ITCS 2011), PROCEEDINGS, 2011, : 473 - 477
  • [40] A Similarity-Based Clustering Algorithm for Fuzzy Data
    Hung, Wen-Liang
    Yang, Miin-Shen
    2010 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2010), 2010,