Local search genetic algorithm-based possibilistic weighted fuzzy c-means for clustering mixed numerical and categorical data

被引:0
|
作者
Thi Phuong Quyen Nguyen
R. J. Kuo
Minh Duc Le
Thi Cuc Nguyen
Thi Huynh Anh Le
机构
[1] The University of Danang–University of Science and Technology,Faculty of Project Management
[2] National Taiwan University of Science and Technology,Department of Industrial Management
[3] The University of Danang–University of Science and Technology,Faculty of Transportation Mechanical Engineering
来源
关键词
Local search genetic algorithm; Mixed data; Possibilistic fuzzy ; -means; Variable neighborhood search;
D O I
暂无
中图分类号
学科分类号
摘要
Clustering for mixed numerical and categorical attributes has attracted many researchers due to its necessity in many real-world applications. One crucial issue concerned in clustering mixed data is to select an appropriate distance metric for each attribute type. Besides, some current clustering methods are sensitive to the initial solutions and easily trap into a locally optimal solution. Thus, this study proposes a local search genetic algorithm-based possibilistic weighted fuzzy c-means (LSGA-PWFCM) for clustering mixed numerical and categorical data. The possibilistic weighted fuzzy c-means (PWFCM) is firstly proposed in which the object-cluster similarity measure is employed to calculate the distance between two mixed-attribute objects. Besides, each attribute is placed a different important role by calculating its corresponding weight in the PWFCM procedure. Thereafter, GA is used to find a set of optimal parameters and the initial clustering centroids for the PFCM algorithm. To avoid local optimal solution, local search-based variable neighborhoods are embedded in the GA procedure. The proposed LSGA-PWFCM algorithm is compared with other benchmark algorithms based on some public datasets in UCI machine learning repository to evaluate its performance. Two clustering validation indices are used, i.e., clustering accuracy and Rand index. The experimental results show that the proposed LSGA-PWFCM outperforms other algorithms on most of the tested datasets.
引用
收藏
页码:18059 / 18074
页数:15
相关论文
共 50 条
  • [31] Suppressed possibilistic c-means clustering algorithm
    Yu, Haiyan
    Fan, Jiulun
    Lan, Rong
    APPLIED SOFT COMPUTING, 2019, 80 : 845 - 872
  • [32] An optimized SVM based possibilistic fuzzy c-means clustering algorithm for tumor segmentation
    Kollem, Sreedhar
    Reddy, Katta Ramalinga
    Rao, Duggirala Srinivasa
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (01) : 409 - 437
  • [33] Fuzzy C-means clustering algorithm based on incomplete data
    Jia, Zhiping
    Yu, Zhiqiang
    Zhang, Chenghui
    2006 IEEE INTERNATIONAL CONFERENCE ON INFORMATION ACQUISITION, VOLS 1 AND 2, CONFERENCE PROCEEDINGS, 2006, : 600 - 604
  • [34] An optimized SVM based possibilistic fuzzy c-means clustering algorithm for tumor segmentation
    Sreedhar Kollem
    Katta Ramalinga Reddy
    Duggirala Srinivasa Rao
    Multimedia Tools and Applications, 2021, 80 : 409 - 437
  • [35] Hausdorff distance measure based interval fuzzy possibilistic c-means clustering algorithm
    Jeng, J.-T. (tsong@nfu.edu.tw), 1600, Chinese Fuzzy Systems Association (15):
  • [36] A Weighted Fuzzy c-Means Clustering Algorithm for Incomplete Big Sensor Data
    Li, Peng
    Chen, Zhikui
    Hu, Yueming
    Leng, Yonglin
    Li, Qiucen
    WIRELESS SENSOR NETWORKS (CWSN 2017), 2018, 812 : 55 - 63
  • [37] A generalization of Possibilistic Fuzzy C-Means Method for Statistical Clustering of Data
    Azzouzi S.
    El-Mekkaoui J.
    Hjouji A.
    Khalfi A.E.L.
    International Journal of Circuits, Systems and Signal Processing, 2021, 15 : 1766 - 1780
  • [38] On tolerant fuzzy c-means clustering and tolerant possibilistic clustering
    Hamasuna, Yukihiro
    Endo, Yasunori
    Miyamoto, Sadaaki
    SOFT COMPUTING, 2010, 14 (05) : 487 - 494
  • [39] A possibilistic C-means clustering algorithm based on kernel methods
    Wu, Xiao-Hong
    2006 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1-4: VOL 1: SIGNAL PROCESSING, 2006, : 2062 - 2066
  • [40] Retinal Vessel Segmentation based on Possibilistic Fuzzy c-means Clustering Optimised with Cuckoo Search
    Emary, Eid
    Zawbaa, Hossam M.
    Hassanien, Aboul Ella
    Schaefer, Gerald
    Azar, Ahmad Taher
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 1792 - 1796