Study of clustering algorithm based on model data

被引:0
|
作者
Li, Kai [1 ]
Cui, Li-Juan [2 ]
机构
[1] HeBei Univ, Sch Math & Comp, Baoding 071002, Peoples R China
[2] HeBei Univ, Lib, Baoding 071002, Peoples R China
关键词
model clustering; measure space; validation of clustering; diversity;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering technique is a key tool in data mining and pattern recognition. Usually, objects for some traditional clustering algorithms are expressed in the form of vectors, which consist of some components to be described as features. However, objects in real tasks may be some models which are clustered other than data points, for example! neural networks, decision trees, support vector machines, etc. This paper studies the clustering algorithm based on model data. By defining the extended measure, clustering methods are studied for the abstract data objects. Framework of clustering algorithm for models is presented. To validate the effectiveness of models clustering algorithm, we choose the hierarchical model clustering algorithm in the experiments. Models in clustering algorithm are BP(Back Propagation) neural networks and learning method is BP algorithm. Measures are chosen as both same-fault measure and double-fault measure for pairwise of models. Distances between clusters are the single link and the complete link, respectively. By this way, we may obtain part of neural network models which are from each cluster and improve diversity of neural network models. Then, part of models is ensembled. Moreover, we also study the relations between the number of clusters in clustering analysis, the size of ensemble learning, and performance of ensemble learning by experiments. Experimental results show that performance of ensemble learning by choosing part of models using clustering of models is improved.
引用
收藏
页码:3961 / +
页数:2
相关论文
共 50 条
  • [41] Clustering Algorithm Based on Data indeterminacy in Neutrosophic Set
    Zhang D.
    Ma Y.
    Smarandache F.
    Dai X.
    Qiao Y.
    Neutrosophic Sets and Systems, 2022, 51 : 556 - 569
  • [42] A novel clustering algorithm based on data transformation approaches
    Azimi, Rasool
    Ghayekhloo, Mohadeseh
    Ghofrani, Mahmoud
    Sajedi, Hedieh
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 76 : 59 - 70
  • [43] A Combined Standard Deviation Based Data Clustering Algorithm
    Thangavel, Kuttiannan
    Kumar, Durairaj Ashok
    JOURNAL OF MODERN APPLIED STATISTICAL METHODS, 2006, 5 (01) : 258 - 265
  • [44] Data publishing Anonymity Algorithm Research Based on Clustering
    Yang, Yu
    Zhang, Longjun
    PROCEEDINGS OF THE 2016 INTERNATIONAL FORUM ON MANAGEMENT, EDUCATION AND INFORMATION TECHNOLOGY APPLICATION, 2016, 47 : 758 - 762
  • [45] A Data Clustering Algorithm Based on Mussels Wandering Optimization
    Yan, Peng
    Liu, ShiYao
    Kang, Qi
    Huang, BingYao
    Zhou, MengChu
    2014 IEEE 11TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL (ICNSC), 2014, : 713 - 718
  • [46] A multi-agent-based algorithm for data clustering
    Lutiele M. Godois
    Diana F. Adamatti
    Leonardo R. Emmendorfer
    Progress in Artificial Intelligence, 2020, 9 : 305 - 313
  • [47] A Support Based Initialization Algorithm for Categorical Data Clustering
    Kumar, Ajay
    Kumar, Shishir
    JOURNAL OF INFORMATION TECHNOLOGY RESEARCH, 2018, 11 (02) : 53 - 67
  • [48] Clustering algorithm for imbalanced data based on nearest neighbor
    Wu S.
    Wang Y.-Z.
    Gao X.-N.
    Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2020, 42 (09): : 1209 - 1219
  • [49] An ant-based clustering algorithm in data mining
    Tang, Y
    Ma, YK
    SHAPING BUSINESS STRATEGY IN A NETWORKED WORLD, VOLS 1 AND 2, PROCEEDINGS, 2004, : 1101 - 1105
  • [50] An Automatic Data Clustering Algorithm based on Differential Evolution
    Tsai, Chun-Wei
    Tai, Chiech-An
    Chiang, Ming-Chao
    2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 794 - 799