Clustering Mixed Numeric and Categorical Data With Cuckoo Search

被引:14
|
作者
Ji, Jinchao [1 ,2 ,3 ,4 ]
Pang, Wei [5 ,6 ]
Li, Zairong [7 ]
He, Fei [1 ,2 ,3 ,4 ]
Feng, Guozhong [1 ,2 ,3 ]
Zhao, Xiaowei [1 ,2 ,3 ]
机构
[1] Northeast Normal Univ, Sch Informat Sci & Technol, Changchun 130117, Jilin, Peoples R China
[2] Northeast Normal Univ, Inst Computat Biol, Changchun 130117, Jilin, Peoples R China
[3] Northeast Normal Univ, Key Lab Intelligent Informat Proc Jilin Univ, Changchun 130117, Jilin, Peoples R China
[4] Jilin Univ, Minist Educ, Key Lab Symbol Computat & Knowledge Engn, Changchun 130012, Jilin, Peoples R China
[5] Heriot Watt Univ, Sch Math & Comp Sci, Edinburgh EH14 4AS, Midlothian, Scotland
[6] Shaanxi Key Lab Complex Syst Control & Intelligen, Xian 710048, Shaanxi, Peoples R China
[7] Northeast Normal Univ, Sch Media Sci, Changchun 130117, Jilin, Peoples R China
基金
中国国家自然科学基金;
关键词
Data clustering; cuckoo search; mixed data; numeric and categorical attributes; ALGORITHM;
D O I
10.1109/ACCESS.2020.2973216
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering analysis, as an important technique in data mining, aims to identify the nature groups or clusters of data objects in the attribute space. Data objects in real-world applications are commonly described by both numeric and categorical attributes. In this research, considering that the partitional clustering algorithms designed for this type of mixed data are prone to get trapped into local optima and the cuckoo search approach is efficient in solving global optimization problems, we propose CCS-K-Prototypes, a novel partitional Clustering algorithm based on Cuckoo Search and K-Prototypes, for clustering mixed numeric and categorical data. To deal with different types of attributes, we develop a novel representation for candidate solutions, and suggest two formulas for the cuckoo to search for the potential solution around the existing solutions or in the entire attribute space. Finally, the performance of the proposed algorithm is assessed by a series of experiments on five benchmark datasets.
引用
收藏
页码:30988 / 31003
页数:16
相关论文
共 50 条
  • [41] Bi-level clustering of mixed categorical and numerical biomedical data
    Andreopoulos, Bill
    An, Aijun
    Wang, Xiaogang
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2006, 1 (01) : 19 - 56
  • [42] Clustering Data of Mixed Categorical and Numerical Type With Unsupervised Feature Learning
    Lam, Dao
    Wei, Mingzhen
    Wunsch, Donald
    IEEE ACCESS, 2015, 3 : 1605 - 1613
  • [43] A divisive ordering algorithm for mapping categorical data to numeric data
    Kuo, HC
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2005, 3682 : 979 - 985
  • [44] Clustering using improved cuckoo search algorithm
    Zhao, Jie
    Lei, Xiujuan
    Wu, Zhenqiang
    Tan, Ying
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8794 : 479 - 488
  • [45] Clustering using Levy Flight Cuckoo Search
    Senthilnath, J.
    Das, Vipul
    Omkar, S. N.
    Mani, V.
    PROCEEDINGS OF SEVENTH INTERNATIONAL CONFERENCE ON BIO-INSPIRED COMPUTING: THEORIES AND APPLICATIONS (BIC-TA 2012), VOL 2, 2013, 202 : 65 - +
  • [46] Improved Cuckoo Search Algorithm for Document Clustering
    Boushaki, Saida Ishak
    Kamel, Nadjet
    Bendjeghaba, Omar
    COMPUTER SCIENCE AND ITS APPLICATIONS, CIIA 2015, 2015, 456 : 217 - 228
  • [47] Data Clustering based on Data Transformation and Hybrid Step Size-based Cuckoo Search
    Pandey, Avinash Chandra
    Rajpoot, Dharmveer Singh
    Saraswat, Mukesh
    2018 ELEVENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2018, : 227 - 232
  • [48] Cuckoo Search Algorithm for Clustering Food Offers
    Chifu, Viorica R.
    Salomie, Ioan
    St Chifu, Emil
    Izabella, Balla
    Pop, Cristina Bianca
    Antal, Marcel
    2014 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP), 2014, : 17 - 22
  • [49] Clustering using Cuckoo Search Levy Flight
    Palaiah, Aishwarya
    Prabhu, Akshata H.
    Agrawal, Reetika
    Natarajan, S.
    2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 567 - 572
  • [50] Clustering Using Improved Cuckoo Search Algorithm
    Zhao, Jie
    Lei, Xiujuan
    Wu, Zhenqiang
    Tan, Ying
    ADVANCES IN SWARM INTELLIGENCE, PT1, 2014, 8794 : 479 - 488