A fuzzy SV-k-modes algorithm for clustering categorical data with set-valued attributes

被引:10
|
作者
Cao, Fuyuan [1 ]
Huang, Joshua Zhexue [2 ]
Liang, Jiye [1 ]
机构
[1] Shanxi Univ, Sch Comp & Informat Technol, Minist Educ, Key Lab Computat Intelligence & Chinese Informat, Taiyuan 030006, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
基金
中国国家自然科学基金;
关键词
Categorical data; Set-valued attribute; Set-valued modes; Fuzzy k-modes; Fuzzy SV-k-modes;
D O I
10.1016/j.amc.2016.09.023
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In this paper, we propose a fuzzy SV-k-modes algorithm that uses the fuzzy k-modes clustering process to cluster categorical data with set-valued attributes. In the proposed algorithm, we use Jaccard coefficient to measure the dissimilarity between two objects and represent the center of a cluster with set-valued modes. A heuristic update way of cluster prototype is developed for the fuzzy partition matrix. These extensions make the fuzzy SV-k-modes algorithm can cluster categorical data with single-valued and set-valued attributes together and the fuzzy k-modes algorithm is its special case. Experimental results on the synthetic data sets and the three real data sets from different applications have shown the efficiency and effectiveness of the fuzzy SV-k-modes algorithm. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [1] An Algorithm for Clustering Categorical Data With Set-Valued Features
    Cao, Fuyuan
    Huang, Joshua Zhexue
    Liang, Jiye
    Zhao, Xingwang
    Meng, Yinfeng
    Feng, Kai
    Qian, Yuhua
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (10) : 4593 - 4606
  • [2] A fuzzy k-modes algorithm for clustering categorical data
    Huang, ZX
    Ng, MK
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1999, 7 (04) : 446 - 452
  • [3] A genetic fuzzy k-Modes algorithm for clustering categorical data
    Gan, G.
    Wu, J.
    Yang, Z.
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) : 1615 - 1620
  • [4] Fuzzy K-prototypes algorithm for clustering mixed numeric and categorical valued data
    Chen, Ning
    Chen, An
    Zhou, Long-Xiang
    Ruan Jian Xue Bao/Journal of Software, 2001, 12 (08): : 1107 - 1119
  • [5] Algorithm for fuzzy clustering of mixed data with numeric and categorical attributes
    Ahmad, A
    Dey, L
    DISTRIBUTED COMPUTING AND INTERNET TECHNOLOGY, PROCEEDINGS, 2005, 3816 : 561 - 572
  • [6] A Global K-modes Algorithm for Clustering Categorical Data
    Bai Tian
    Kulikowski, C. A.
    Gong Leiguang
    Yang Bin
    Huang Lan
    Zhou Chunguang
    CHINESE JOURNAL OF ELECTRONICS, 2012, 21 (03): : 460 - 465
  • [7] A genetic k-modes algorithm for clustering categorical data
    Gan, GJ
    Yang, ZJ
    Wu, JH
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 195 - 202
  • [8] OSPA Barycenters for Clustering Set-Valued Data
    Baum, Marcus
    Balasingam, Balakumar
    Willett, Peter
    Hanebeck, Uwe D.
    2015 18TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2015, : 1375 - 1381
  • [9] Fuzzy rough set model for set-valued data
    Dai, Jianhua
    Tian, Haowei
    FUZZY SETS AND SYSTEMS, 2013, 229 : 54 - 68
  • [10] Clustering of Categorical Data Using Intuitionistic Fuzzy k-modes
    Mehta, Darshan
    Tripathy, B. K.
    PROCEEDINGS OF SIXTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2016), VOL 1, 2017, 546 : 254 - 263