Clustering nominal and numerical data: A new distance concept for a hybrid genetic algorithm

被引:0
|
作者
Vermeulen-Jourdan, L [1 ]
Dhaenens, C [1 ]
Talbi, EG [1 ]
机构
[1] Univ Lille 1, LIFL, F-59655 Villeneuve Dascq, France
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As intrinsic structures, like the number of clusters, is, for real data, a major issue of the clustering problem, we propose, in this paper, CHyGA (Clustering Hybrid Genetic Algorithm) an hybrid genetic algorithm for clustering. CHyGA treats the clustering problem as an optimization problem and searches for an optimal number of clusters characterized by an optimal distribution of instances into the clusters. CHyGA introduces a new representation of solutions and uses dedicated operators, such as one iteration of K--mearis as a mutation operator. In order to deal with nominal data, we propose a new definition of the cluster center concept and demonstrate its properties. Experimental results on classical benchmarks are given.
引用
收藏
页码:220 / 229
页数:10
相关论文
共 50 条
  • [1] A New Clustering Algorithm On Nominal Data Sets
    Wang, Bin
    INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS (IMECS 2010), VOLS I-III, 2010, : 605 - 610
  • [2] Introduce a New Algorithm for Data Clustering by Genetic Algorithm
    Vahidi, J.
    Mirpour, Saeed
    JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE-JMCS, 2014, 10 (02): : 144 - 156
  • [3] SpectralCAT: Categorical spectral clustering of numerical and nominal data
    David, Gil
    Averbuch, Amir
    PATTERN RECOGNITION, 2012, 45 (01) : 416 - 433
  • [4] A new data clustering algorithm based on critical distance methodology
    Kuwil, Farag Hamed
    Shaar, Fadi
    Topcu, Ahmet Ercan
    Murtagh, Fionn
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 129 : 296 - 310
  • [5] A new hybrid imperialist competitive algorithm on data clustering
    Niknam, Taher
    Fard, Elahe Taherian
    Ehrampoosh, Shervin
    Rousta, Alireza
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2011, 36 (03): : 293 - 315
  • [6] A new hybrid imperialist competitive algorithm on data clustering
    TAHER NIKNAM
    ELAHE TAHERIAN FARD
    SHERVIN EHRAMPOOSH
    ALIREZA ROUSTA
    Sadhana, 2011, 36 : 293 - 315
  • [7] A dynamical clustering algorithm for multi-nominal data
    Verde, R
    de Carvalho, FDT
    Lechevallier, Y
    DATA ANALYSIS, CLASSIFICATION, AND RELATED METHODS, 2000, : 387 - 393
  • [8] A Hybrid Data Clustering Using Firefly Algorithm Based Improved Genetic Algorithm
    Maheshwar
    Kaushik, Keshav
    Arora, Vikram
    SECOND INTERNATIONAL SYMPOSIUM ON COMPUTER VISION AND THE INTERNET (VISIONNET'15), 2015, 58 : 249 - 256
  • [9] Hybrid Algorithm to Data Clustering
    Gil, Miguel
    Ochoa, Alberto
    Zamarron, Antonio
    Carpio, Juan
    HYBRID ARTIFICIAL INTELLIGENCE SYSTEMS, 2009, 5572 : 678 - +
  • [10] Hybrid Genetic Clustering by Using FCM and Geodesic Distance for Complex Distributed Data
    Yang, Yongsheng
    Li, Gang
    Zhu, Yongsheng
    Zhang, Youyun
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 2597 - +