Transformation-based Probabilistic Clustering with Supervision

被引:0
|
作者
Gopal, Siddharth [1 ]
Yang, Yiming [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the common problems with clustering is that the generated clusters often do not match user expectations. This paper proposes a novel probabilistic framework that exploits supervised information in a discriminative and transferable manner to generate better clustering of unlabeled data. The supervision is provided by revealing the cluster assignments for some subset of the ground truth clusters and is used to learn a transformation of the data such that labeled instances form well-separated clusters with respect to the given clustering objective. This estimated transformation function enables us to fold the remaining unlabeled data into a space where new clusters hopefully match user expectations. While our framework is general, in this paper, we focus on its application to Gaussian and von Mises-Fisher mixture models. Extensive testing on 23 data sets across several application domains revealed substantial improvement in performance over competing methods.
引用
收藏
页码:270 / 279
页数:10
相关论文
共 50 条
  • [1] Transformation-based estimation
    Feng, Zhenghui
    Wang, Tao
    Zhu, Lixing
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 78 : 186 - 205
  • [2] A transformation-based optimiser for Haskell
    Jones, SLP
    Santos, ALM
    SCIENCE OF COMPUTER PROGRAMMING, 1998, 32 (1-3) : 3 - 47
  • [3] Space Transformation-Based Interdependency Modelling for Probabilistic Load Flow Analysis of Power Systems
    李雪
    陈豪杰
    路攀
    杜大军
    JournalofDonghuaUniversity(EnglishEdition), 2016, 33 (05) : 734 - 739
  • [4] Transformation-based spatial join
    Song, JW
    Whang, KY
    Lee, YK
    Lee, MJ
    Kim, SW
    PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON INFORMATION KNOWLEDGE MANAGEMENT, CIKM'99, 1999, : 15 - 26
  • [5] Speaker identification using multi-step clustering algorithm with transformation-based GMM
    Xu L.
    Tang Z.
    Automatic Control and Computer Sciences, 2007, 41 (04) : 224 - 231
  • [6] Improved transformation-based quantile regression
    Geraci, Marco
    Jones, M. C.
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2015, 43 (01): : 118 - 132
  • [7] Analyzing transformation-based simulation metamodels
    Irizarry, MDL
    Kuhl, ME
    Lada, EK
    Subramanian, S
    Wilson, JR
    PROCEEDINGS OF THE 2000 WINTER SIMULATION CONFERENCE, VOLS 1 AND 2, 2000, : 773 - 781
  • [8] The complexity of transformation-based join enumeration
    Pellenkoft, A
    Galindo-Legaria, CA
    Kersten, M
    PROCEEDINGS OF THE TWENTY-THIRD INTERNATIONAL CONFERENCE ON VERY LARGE DATABASES, 1997, : 306 - 315
  • [9] Unscented transformation-based probabilistic optimal power flow for modeling the effect of wind power generation
    Aien, Morteza
    Fotuhi-Firuzabad, Mahmood
    Aminifar, Farrokh
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2013, 21 (05) : 1284 - 1301
  • [10] Transformation-based assessment for C programs
    Li, Guangqiang
    Wu, Weimin
    Sun, Yinai
    Wang, Jing
    Lai, Tianwu
    2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 372 - 375