A fuzzy SV-k-modes algorithm for clustering categorical data with set-valued attributes

被引:10
|
作者
Cao, Fuyuan [1 ]
Huang, Joshua Zhexue [2 ]
Liang, Jiye [1 ]
机构
[1] Shanxi Univ, Sch Comp & Informat Technol, Minist Educ, Key Lab Computat Intelligence & Chinese Informat, Taiyuan 030006, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
基金
中国国家自然科学基金;
关键词
Categorical data; Set-valued attribute; Set-valued modes; Fuzzy k-modes; Fuzzy SV-k-modes;
D O I
10.1016/j.amc.2016.09.023
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In this paper, we propose a fuzzy SV-k-modes algorithm that uses the fuzzy k-modes clustering process to cluster categorical data with set-valued attributes. In the proposed algorithm, we use Jaccard coefficient to measure the dissimilarity between two objects and represent the center of a cluster with set-valued modes. A heuristic update way of cluster prototype is developed for the fuzzy partition matrix. These extensions make the fuzzy SV-k-modes algorithm can cluster categorical data with single-valued and set-valued attributes together and the fuzzy k-modes algorithm is its special case. Experimental results on the synthetic data sets and the three real data sets from different applications have shown the efficiency and effectiveness of the fuzzy SV-k-modes algorithm. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [21] Genetic intuitionistic weighted fuzzy k-modes algorithm for categorical data
    Kuo, R. J.
    Thi Phuong Quyen Nguyen
    NEUROCOMPUTING, 2019, 330 : 116 - 126
  • [22] A fuzzy k-prototype clustering algorithm for mixed numeric and categorical data
    Ji, Jinchao
    Pang, Wei
    Zhou, Chunguang
    Han, Xiao
    Wang, Zhe
    KNOWLEDGE-BASED SYSTEMS, 2012, 30 : 129 - 135
  • [23] EGA-FMC: enhanced genetic algorithm-based fuzzy k-modes clustering for categorical data
    Narasimhan, Medhini
    Balasubramanian, Balaji
    Kumar, Suryansh D.
    Patil, Nagamma
    INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2018, 11 (04) : 219 - 228
  • [24] Set-valued statistical algorithm for fuzzy membership based on the rough quotient set
    Fang, Hong-Wei
    Li, Chang-Hong
    Ji, Hong-Guang
    Fang, Ling-Ling
    Beijing Keji Daxue Xuebao/Journal of University of Science and Technology Beijing, 2012, 34 (08): : 959 - 965
  • [25] Initialization of K-Modes Clustering for Categorical Data
    Li Tao-ying
    Chen Yan
    Jin Zhi-hong
    Li Ye
    2013 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING (ICMSE), 2013, : 107 - 112
  • [26] Fuzzy set-valued statistical inferences on a system operating data
    Guo, RK
    Love, E
    Advanced Reliability Modeling, 2004, : 165 - 172
  • [27] Multiobjective Genetic Algorithm-Based Fuzzy Clustering of Categorical Attributes
    Mukhopadhyay, Anirban
    Maulik, Ujjwal
    Bandyopadhyay, Sanghamitra
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2009, 13 (05) : 991 - 1005
  • [28] k-mw-modes: An algorithm for clustering categorical matrix-object data
    Cao, Fuyuan
    Yu, Liqin
    Huang, Joshua Zhexue
    Liang, Jiye
    APPLIED SOFT COMPUTING, 2017, 57 : 605 - 614
  • [29] An efficient k-modes algorithm for clustering categorical datasets
    Dorman, Karin S.
    Maitra, Ranjan
    STATISTICAL ANALYSIS AND DATA MINING, 2022, 15 (01) : 83 - 97
  • [30] Multiobjective clustering algorithm with fuzzy centroids for categorical data
    Zhou Z.
    Zhu S.
    Zhang D.
    1600, Science Press (53): : 2594 - 2606