Clustering with Partition Level Side Information

被引:27
|
作者
Liu, Hongfu [1 ]
Fu, Yun [1 ,2 ]
机构
[1] Northeastern Univ, Dept Elect & Comp Engn, Boston, MA 02115 USA
[2] Northeastern Univ, Coll Comp & Informat Sci, Boston, MA 02115 USA
关键词
Clustering; Partition level side information; K-means; Utility function; ALGORITHMS;
D O I
10.1109/ICDM.2015.18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Constrained clustering uses pre-given knowledge to improve the clustering performance. Among existing literature, researchers usually focus on Must-Link and Cannot-Link pairwise constraints. However, pairwise constraints not only disobey the way we make decisions, but also suffer from the vulnerability of noisy constraints and the order of constraints. In light of this, we use partition level side information instead of pairwise constraints to guide the process of clustering. Compared with pairwise constraints, partition level side information keeps the consistency within partial structure and avoids self-contradictory and the impact of constraints order. Generally speaking, only small part of the data instances are given labels by human workers, which are used to supervise the procedure of clustering. Inspired by the success of ensemble clustering, we aim to find a clustering solution which captures the intrinsic structure from the data itself, and agrees with the partition level side information as much as possible. Then we derive the objective function and equivalently transfer it into a K-meanlike optimization problem. Extensive experiments on several real-world datasets demonstrate the effectiveness and efficiency of our method compared to pairwise constrained clustering and consensus clustering, which verifies the superiority of partition level side information to pairwise constraints. Besides, our method has high robustness to noisy side information.
引用
收藏
页码:877 / 882
页数:6
相关论文
共 50 条
  • [31] A Multi-level Graph Partition Clustering Method of Vector Residential Area Polygon
    Jin, Cheng
    An, Xiaoya
    Chen, Zhanlong
    Ma, Xiaochuan
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2021, 46 (01): : 19 - 29
  • [32] Data clustering of road transportation information system based on attribute dimension partition and MapReduce
    Zheng, Xiao-Feng, 1600, South China University of Technology (42):
  • [33] Using semantic graphs in clustering process : Enhance information level
    Brunner, JS
    Berrien, I
    IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2004), PROCEEDINGS, 2004, : 221 - 227
  • [34] CATEGORY CLUSTERING AS A FUNCTION OF LEVEL OF INFORMATION AND NUMBER OF STIMULUS PRESENTATIONS
    HUDSON, RL
    JOURNAL OF VERBAL LEARNING AND VERBAL BEHAVIOR, 1968, 7 (06): : 1106 - &
  • [35] Learning from Demonstrations with High-Level Side Information
    Wen, Min
    Papusha, Ivan
    Topcu, Ufuk
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3055 - 3061
  • [36] Use of word level side information to improve speech recognition
    Vergyri, D
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1823 - 1826
  • [37] HIERARCHICAL FUSION OF COLOR AND DEPTH INFORMATION AT PARTITION LEVEL BY COOPERATIVE REGION MERGING
    Calderero, Felipe
    Marques, Ferran
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 973 - 976
  • [38] EXPLOITING SIDE INFORMATION IN DISTANCE DEPENDENT CHINESE RESTAURANT PROCESSES FOR DATA CLUSTERING
    Li, Cheng
    Phung, Dinh
    Rana, Santu
    Venkatesh, Svetha
    2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2013), 2013,
  • [39] Robust Harmonic Fuzzy Partition Local Information C-Means Clustering for Image Segmentation
    Wu, Chengmao
    Zhou, Siyu
    SYMMETRY-BASEL, 2024, 16 (10):
  • [40] Partition Selection Approach for Hierarchical Clustering Based on Clustering Ensemble
    Vega-Pons, Sandro
    Ruiz-Shulcloper, Jose
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, 2010, 6419 : 525 - 532