Local search genetic algorithm-based possibilistic weighted fuzzy c-means for clustering mixed numerical and categorical data

被引:0
|
作者
Thi Phuong Quyen Nguyen
R. J. Kuo
Minh Duc Le
Thi Cuc Nguyen
Thi Huynh Anh Le
机构
[1] The University of Danang–University of Science and Technology,Faculty of Project Management
[2] National Taiwan University of Science and Technology,Department of Industrial Management
[3] The University of Danang–University of Science and Technology,Faculty of Transportation Mechanical Engineering
来源
关键词
Local search genetic algorithm; Mixed data; Possibilistic fuzzy ; -means; Variable neighborhood search;
D O I
暂无
中图分类号
学科分类号
摘要
Clustering for mixed numerical and categorical attributes has attracted many researchers due to its necessity in many real-world applications. One crucial issue concerned in clustering mixed data is to select an appropriate distance metric for each attribute type. Besides, some current clustering methods are sensitive to the initial solutions and easily trap into a locally optimal solution. Thus, this study proposes a local search genetic algorithm-based possibilistic weighted fuzzy c-means (LSGA-PWFCM) for clustering mixed numerical and categorical data. The possibilistic weighted fuzzy c-means (PWFCM) is firstly proposed in which the object-cluster similarity measure is employed to calculate the distance between two mixed-attribute objects. Besides, each attribute is placed a different important role by calculating its corresponding weight in the PWFCM procedure. Thereafter, GA is used to find a set of optimal parameters and the initial clustering centroids for the PFCM algorithm. To avoid local optimal solution, local search-based variable neighborhoods are embedded in the GA procedure. The proposed LSGA-PWFCM algorithm is compared with other benchmark algorithms based on some public datasets in UCI machine learning repository to evaluate its performance. Two clustering validation indices are used, i.e., clustering accuracy and Rand index. The experimental results show that the proposed LSGA-PWFCM outperforms other algorithms on most of the tested datasets.
引用
收藏
页码:18059 / 18074
页数:15
相关论文
共 50 条
  • [21] Weighted possibilistic c-means clustering algorithms
    Schneider, A
    NINTH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2000), VOLS 1 AND 2, 2000, : 176 - 180
  • [22] Similarity Based Fuzzy and Possibilistic c-means Algorithm
    Zhang, Chunhui
    Zhou, Yiming
    Martin, Trevor
    PROCEEDINGS OF THE 11TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2008,
  • [23] Novel possibilistic fuzzy c-means clustering
    School of Electrical and Information Engineering, Jiangsu University, Zhenjiang 212013, China
    不详
    Tien Tzu Hsueh Pao, 2008, 10 (1996-2000):
  • [24] An Unsupervised Possibilistic C-Means Clustering Algorithm with Data Reduction
    Hu, Yating
    Qu, Fuheng
    Wen, Changji
    2013 10TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2013, : 29 - 33
  • [25] Generalised kernel weighted fuzzy C-means clustering algorithm with local information
    Memon, Kashif Hussain
    Lee, Dong-Ho
    FUZZY SETS AND SYSTEMS, 2018, 340 : 91 - 108
  • [26] Intuitionistic fuzzy c-means clustering algorithm based on a novel weighted proximity measure and genetic algorithm
    Hou, Wen-hui
    Wang, Yi-ting
    Wang, Jian-qiang
    Cheng, Peng-Fei
    Li, Lin
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (03) : 859 - 875
  • [27] A Performance Study of Probabilistic Possibilistic Fuzzy C-Means Clustering Algorithm
    Vijaya, J.
    Syed, Hussian
    ADVANCES IN COMPUTING AND DATA SCIENCES, PT I, 2021, 1440 : 431 - 442
  • [28] Interval-valued possibilistic fuzzy C-means clustering algorithm
    Ji, Zexuan
    Xia, Yong
    Sun, Quansen
    Cao, Guo
    FUZZY SETS AND SYSTEMS, 2014, 253 : 138 - 156
  • [29] Intuitionistic fuzzy c-means clustering algorithm based on a novel weighted proximity measure and genetic algorithm
    Wen-hui Hou
    Yi-ting Wang
    Jian-qiang Wang
    Peng-Fei Cheng
    Lin Li
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 859 - 875
  • [30] A weighted fuzzy c-means clustering model for fuzzy data
    D'Urso, P
    Giordani, P
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 50 (06) : 1496 - 1523