Local search genetic algorithm-based possibilistic weighted fuzzy c-means for clustering mixed numerical and categorical data

被引:0
|
作者
Thi Phuong Quyen Nguyen
R. J. Kuo
Minh Duc Le
Thi Cuc Nguyen
Thi Huynh Anh Le
机构
[1] The University of Danang–University of Science and Technology,Faculty of Project Management
[2] National Taiwan University of Science and Technology,Department of Industrial Management
[3] The University of Danang–University of Science and Technology,Faculty of Transportation Mechanical Engineering
来源
关键词
Local search genetic algorithm; Mixed data; Possibilistic fuzzy ; -means; Variable neighborhood search;
D O I
暂无
中图分类号
学科分类号
摘要
Clustering for mixed numerical and categorical attributes has attracted many researchers due to its necessity in many real-world applications. One crucial issue concerned in clustering mixed data is to select an appropriate distance metric for each attribute type. Besides, some current clustering methods are sensitive to the initial solutions and easily trap into a locally optimal solution. Thus, this study proposes a local search genetic algorithm-based possibilistic weighted fuzzy c-means (LSGA-PWFCM) for clustering mixed numerical and categorical data. The possibilistic weighted fuzzy c-means (PWFCM) is firstly proposed in which the object-cluster similarity measure is employed to calculate the distance between two mixed-attribute objects. Besides, each attribute is placed a different important role by calculating its corresponding weight in the PWFCM procedure. Thereafter, GA is used to find a set of optimal parameters and the initial clustering centroids for the PFCM algorithm. To avoid local optimal solution, local search-based variable neighborhoods are embedded in the GA procedure. The proposed LSGA-PWFCM algorithm is compared with other benchmark algorithms based on some public datasets in UCI machine learning repository to evaluate its performance. Two clustering validation indices are used, i.e., clustering accuracy and Rand index. The experimental results show that the proposed LSGA-PWFCM outperforms other algorithms on most of the tested datasets.
引用
收藏
页码:18059 / 18074
页数:15
相关论文
共 50 条
  • [1] Local search genetic algorithm-based possibilistic weighted fuzzy c-means for clustering mixed numerical and categorical data
    Thi Phuong Quyen Nguyen
    Kuo, R. J.
    Minh Duc Le
    Thi Cuc Nguyen
    Thi Huynh Anh Le
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (20): : 18059 - 18074
  • [2] An intuitionistic fuzzy possibilistic C-means clustering based on genetic algorithm
    Shang, Ronghua
    Tian, Pingping
    Wen, Ailing
    Liu, Wenzhan
    Jiao, Lieheng
    2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 941 - 947
  • [3] A possibilistic fuzzy c-means clustering algorithm
    Pal, NR
    Pal, K
    Keller, JM
    Bezdek, JC
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2005, 13 (04) : 517 - 530
  • [4] Possibilistic and fuzzy c-means clustering with weighted objects
    Miyamoto, Sadaaki
    Inokuchi, Ryo
    Kuroda, Youhei
    2006 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2006, : 869 - +
  • [5] A Modified Possibilistic Fuzzy c-Means Clustering Algorithm
    Qu, Fuheng
    Hu, Yating
    Xue, Yaohong
    Yang, Yong
    2013 NINTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2013, : 858 - 862
  • [6] A Possibilistic Multivariate Fuzzy c-Means Clustering Algorithm
    Himmelspach, Ludmila
    Conrad, Stefan
    SCALABLE UNCERTAINTY MANAGEMENT, SUM 2016, 2016, 9858 : 338 - 344
  • [7] A Weight Possibilistic Fuzzy C-Means Clustering Algorithm
    Chen, Jiashun
    Zhang, Hao
    Pi, Dechang
    Kantardzic, Mehmed
    Yin, Qi
    Liu, Xin
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [8] A gradient ascent algorithm based on possibilistic fuzzy C-Means for clustering noisy data
    Saberi, Hossein
    Sharbati, Reza
    Farzanegan, Behzad
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 191
  • [9] A hybrid kernel-based possibilistic fuzzy c-means clustering and cuckoo search algorithm
    Viet Duc Do
    Long Thanh Ngo
    Dinh Sinh Mai
    2021 RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF 2021), 2021, : 132 - 137
  • [10] Weighted Fuzzy C-Means Clustering Based on Double Coding Genetic Algorithm
    Chen, Duo
    Cui, Du-Wu
    Wang, Chao-Xue
    INTELLIGENT COMPUTING, PART I: INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING, ICIC 2006, PART I, 2006, 4113 : 622 - 633