Combining multiple weak clusterings

被引:0
|
作者
Topchy, A [1 ]
Jain, AK [1 ]
Punch, W [1 ]
机构
[1] Michigan State Univ, Dept Comp Sci, E Lansing, MI 48824 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A data set can be clustered in many ways depending on the clustering algorithm employed, parameter settings used and other factors. Can multiple clusterings be combined so that the final partitioning of data provides better clustering? The answer depends on the quality of clusterings to be combined as well as the properties of the fusion method. First, we introduce a unified representation for multiple clusterings and formulate the corresponding categorical clustering problem. As a result, we show that the consensus function is related to the classical intra-class variance criterion using the generalized mutual information definition. Second, we show the efficacy of combining partitions generated by weak clustering algorithms that use data projections and random data splits. A simple explanatory model is offered for the behavior of combinations of such weak clustering components. We analyze the combination accuracy as a Junction of parameters controlling the power and resolution of component partitions as well as the learning dynamics vs. the number of clusterings involved. Finally, some empirical studies compare the effectiveness of several consensus functions.
引用
收藏
页码:331 / 338
页数:8
相关论文
共 50 条
  • [21] An efficient and scalable family of algorithms for combining clusterings
    Mimaroglu, Selim
    Erdil, Ertunc
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (10) : 2525 - 2539
  • [22] Finding multiple stable clusterings
    Hu, Juhua
    Qian, Qi
    Pei, Jian
    Jin, Rong
    Zhu, Shenghuo
    KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 51 (03) : 991 - 1021
  • [23] A Low Dimensional Embedding Method for Combining Clusterings
    Xu Sen
    Zhou Tian
    Yu Hualong
    ADVANCED MANUFACTURING SYSTEMS, PTS 1-3, 2011, 201-203 : 2517 - +
  • [24] Learning Multiple Nonredundant Clusterings
    Cui, Ying
    Fern, Xiaoli Z.
    Dy, Jennifer G.
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2010, 4 (03)
  • [25] Multiple Independent Subspace Clusterings
    Wang, Xing
    Wang, Jun
    Domeniconi, Carlotta
    Yu, Guoxian
    Xiao, Guoqiang
    Guo, Maozu
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5353 - 5360
  • [26] Finding Multiple Stable Clusterings
    Hu, Juhua
    Qian, Qi
    Pei, Jian
    Jin, Rong
    Zhu, Shenghuo
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 171 - 180
  • [27] Combining Data Clusterings with Instance Level Constraints
    Duarte, Joao M. M.
    Fred, Ana L. N.
    Duarte, F. Jorge F.
    PATTERN RECOGNITION IN INFORMATION SYSTEMS, PROCEEDINGS, 2009, : 49 - +
  • [28] Combining multiple clusterings via crowd agreement estimation and multi-granularity link analysis
    Huang, Dong
    Lai, Jian-Huang
    Wang, Chang-Dong
    NEUROCOMPUTING, 2015, 170 : 240 - 250
  • [29] Combining Multiple Clusterings of Chemical Structures Using Cumulative Voting-Based Aggregation Algorithm
    Saeed, Faisal
    Salim, Naomie
    Abdo, Ammar
    Hentabli, Hamza
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2013), PT II, 2013, 7803 : 178 - 185
  • [30] Finding multiple stable clusterings
    Juhua Hu
    Qi Qian
    Jian Pei
    Rong Jin
    Shenghuo Zhu
    Knowledge and Information Systems, 2017, 51 : 991 - 1021