Clustering Mixed Numeric and Categorical Data With Cuckoo Search

被引:14
|
作者
Ji, Jinchao [1 ,2 ,3 ,4 ]
Pang, Wei [5 ,6 ]
Li, Zairong [7 ]
He, Fei [1 ,2 ,3 ,4 ]
Feng, Guozhong [1 ,2 ,3 ]
Zhao, Xiaowei [1 ,2 ,3 ]
机构
[1] Northeast Normal Univ, Sch Informat Sci & Technol, Changchun 130117, Jilin, Peoples R China
[2] Northeast Normal Univ, Inst Computat Biol, Changchun 130117, Jilin, Peoples R China
[3] Northeast Normal Univ, Key Lab Intelligent Informat Proc Jilin Univ, Changchun 130117, Jilin, Peoples R China
[4] Jilin Univ, Minist Educ, Key Lab Symbol Computat & Knowledge Engn, Changchun 130012, Jilin, Peoples R China
[5] Heriot Watt Univ, Sch Math & Comp Sci, Edinburgh EH14 4AS, Midlothian, Scotland
[6] Shaanxi Key Lab Complex Syst Control & Intelligen, Xian 710048, Shaanxi, Peoples R China
[7] Northeast Normal Univ, Sch Media Sci, Changchun 130117, Jilin, Peoples R China
基金
中国国家自然科学基金;
关键词
Data clustering; cuckoo search; mixed data; numeric and categorical attributes; ALGORITHM;
D O I
10.1109/ACCESS.2020.2973216
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering analysis, as an important technique in data mining, aims to identify the nature groups or clusters of data objects in the attribute space. Data objects in real-world applications are commonly described by both numeric and categorical attributes. In this research, considering that the partitional clustering algorithms designed for this type of mixed data are prone to get trapped into local optima and the cuckoo search approach is efficient in solving global optimization problems, we propose CCS-K-Prototypes, a novel partitional Clustering algorithm based on Cuckoo Search and K-Prototypes, for clustering mixed numeric and categorical data. To deal with different types of attributes, we develop a novel representation for candidate solutions, and suggest two formulas for the cuckoo to search for the potential solution around the existing solutions or in the entire attribute space. Finally, the performance of the proposed algorithm is assessed by a series of experiments on five benchmark datasets.
引用
收藏
页码:30988 / 31003
页数:16
相关论文
共 50 条
  • [21] A novel fuzzy K-mean algorithm with fuzzy centroid for clustering mixed numeric and categorical data
    Ji, Jinchao
    Zhou, Chunguang
    Bai, Tian
    Zhao, Jian
    Wang, Zhe
    Advances in Information Sciences and Service Sciences, 2012, 4 (07): : 256 - 264
  • [22] A Modified Cuckoo Search Algorithm for Data Clustering
    Mohanty, Preeti Pragyan
    Nayak, Subrat Kumar
    INTERNATIONAL JOURNAL OF APPLIED METAHEURISTIC COMPUTING, 2022, 13 (01)
  • [23] A k-means type clustering algorithm for subspace clustering of mixed numeric and categorical datasets
    Ahmad, Amir
    Dey, Lipika
    PATTERN RECOGNITION LETTERS, 2011, 32 (07) : 1062 - 1069
  • [24] Data Clustering Using Cuckoo Search Algorithm (CSA)
    Manikandan, P.
    Selvarajan, S.
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2012), 2014, 236 : 1275 - 1283
  • [25] A fuzzy k-prototype clustering algorithm for mixed numeric and categorical data (vol 30, pg 129, 2012)
    Ji, Jinchao
    Pang, Wei
    Zhou, Chunguang
    Han, Xiao
    Wang, Zhe
    KNOWLEDGE-BASED SYSTEMS, 2012, 36 : 363 - 363
  • [26] Performances of parallel clustering algorithm for categorical and mixed data
    Hai, NTM
    Susumu, H
    PARALLEL AND DISTRIBUTED COMPUTING: APPLICATIONS AND TECHNOLOGIES, PROCEEDINGS, 2004, 3320 : 252 - 256
  • [27] Clustering mixed numerical and categorical data with missing values
    Dinh, Duy-Tai
    Huynh, Van-Nam
    Sriboonchitta, Songsak
    INFORMATION SCIENCES, 2021, 571 : 418 - 442
  • [28] Clustering based on compressed data for categorical and mixed attributes
    Rendon, Erendira
    Sanchez, Jose Salvador
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2006, 4109 : 817 - 825
  • [29] Data Clustering Using Hybrid Improved Cuckoo Search Method
    Pandey, Avinash Chandra
    Rajpoot, Dharmveer Singh
    Saraswat, Mukesh
    2016 NINTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2016, : 25 - 30
  • [30] A new quantum chaotic cuckoo search algorithm for data clustering
    Boushaki, Saida Ishak
    Kamel, Nadjet
    Bendjeghaba, Omar
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 96 : 358 - 372