Categorical data clustering: A correlation-based approach for unsupervised attribute weighting

被引:2
|
作者
Carbonera, Joel Luis [1 ]
Abel, Mara [1 ]
机构
[1] Univ Fed Rio Grande do Sul, Inst Informat, Porto Alegre, RS, Brazil
关键词
clustering; subspace clustering; categorical data; attribute weighting; data mining; K-MEANS; ALGORITHM;
D O I
10.1109/ICTAI.2014.46
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The interest in attribute weighting, in clustering tasks, have been increasing in the last years. However, few attempts have been made to apply automated attribute weighting to categorical data clustering. Most of the existing approaches computes the weights based on the frequency of the mode category or according to the average distance of data objects from the mode of a cluster. In this paper, we adopt a different approach, investigating how to use the correlation among categorical attributes for measuring their relevancies in clustering tasks. As a result, we propose a correlation-based attribute weighting approach for categorical attributes.
引用
收藏
页码:259 / 263
页数:5
相关论文
共 50 条
  • [21] A Correlation-Based Feature Weighting Filter for Naive Bayes
    Jiang, Liangxiao
    Zhang, Lungan
    Li, Chaoqun
    Wu, Jia
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (02) : 201 - 213
  • [22] Categorical data clustering using tine combinations of attribute values
    Do, Hee-Jung
    Kim, Jae-Yearn
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2008, PT 2, PROCEEDINGS, 2008, 5073 : 220 - 231
  • [23] Multiobjective approach to categorical data clustering
    Mukhopadhyay, Anirban
    Maulik, Ujjwal
    2007 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-10, PROCEEDINGS, 2007, : 1296 - +
  • [24] A new internal clustering validation index for categorical data based on concentration of attribute values
    Fu L.-W.
    Wu S.
    Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2019, 41 (05): : 682 - 693
  • [25] An New Algorithm-based Rough Set for Selecting Clustering Attribute in Categorical Data
    Baroud, Muftah Mohamed Jomah
    Hashim, Siti Zaiton Mohd
    Zainal, Anazida
    Ahnad, Jamilah
    2020 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2020, : 1358 - 1364
  • [26] Clustering categorical data based on the relational analysis approach and MapReduce
    Lamari Y.
    Slaoui S.C.
    Journal of Big Data, 2017, 4 (01)
  • [27] A Genetic Algorithm Based Ensemble Approach for Categorical Data Clustering
    Goswami, Jyoti Prokash
    Mahanta, Anjana Kakoti
    2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
  • [28] A weighting k-modes algorithm for subspace clustering of categorical data
    Cao, Fuyuan
    Liang, Jiye
    Li, Deyu
    Zhao, Xingwang
    NEUROCOMPUTING, 2013, 108 : 23 - 30
  • [29] CORRELATION-BASED MULTIDIMENSIONAL SCALING FOR UNSUPERVISED SUBSPACE LEARNING
    He, Guanghui
    Zhang, Lingfeng
    Shang, Zhaowei
    INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2012, 10 (03)
  • [30] Unequal Distributed Spatial Correlation-based Tree Clustering for Approximate Data Collection
    Shen, Maiying
    Chen, Shuo
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON SOFT COMPUTING IN INFORMATION COMMUNICATION TECHNOLOGY, 2014, : 93 - 97