Correlation Clustering Revisited: The "True" Cost of Error Minimization Problems

被引:0
|
作者
Ailon, Nir [1 ]
Liberty, Edo [2 ]
机构
[1] Google Res, New York, NY 10030 USA
[2] Yale Univ, New Haven, CT USA
来源
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Correlation Clustering was defined by Bansal, Blum, and Chawla is the problem of clustering a set of elements based on a, possibly inconsistent., binary similarity function between element; pairs. Their setting is agnostic in the sense that a ground truth clustering is not assumed to exist, and the cost of a solution is computed against the input similarity function. This problem has been studied in theory and in practice and has been subsequently proven to be APX-Hard. In this work we assume that there does exist all unknown correct clustering of the data. In this setting, we argue that it is more reasonable to measure the output clustering's accuracy against the unknown underlying true clustering. We present two main results. The first is a novel method for continuously morphing a general (non-metric) function into a pseudometric. This technique may be useful for other metric embedding and clustering problems. The second is a simple algorithm for randomly rounding a pseudometric into a clustering. Combining the two, we obtain a certificate for the possibility of getting a solution of factor strictly less than 2 for our problem. This approximation coefficient; could not have been achieved by considering the agnostic version of the problem unless P = NP.
引用
收藏
页码:24 / +
页数:3
相关论文
共 50 条
  • [1] Chromatic Correlation Clustering, Revisited
    Xiu, Qing
    Han, Kai
    Tang, Jing
    Cui, Shuang
    Huang, He
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [2] Minimization of certain cost functions with error bounds
    Drachman, B
    APPLIED NUMERICAL MATHEMATICS, 1997, 23 (03) : 375 - 379
  • [3] Superpixel Generation by Agglomerative Clustering With Quadratic Error Minimization
    Dong, Xiao
    Chen, Zhonggui
    Yao, Junfeng
    Guo, Xiaohu
    COMPUTER GRAPHICS FORUM, 2019, 38 (01) : 405 - 416
  • [4] Stability of Minimization Problems and the Error Bound Condition
    Balashov, Maxim V.
    SET-VALUED AND VARIATIONAL ANALYSIS, 2022, 30 (03) : 1061 - 1076
  • [5] Stability of Minimization Problems and the Error Bound Condition
    Maxim V. Balashov
    Set-Valued and Variational Analysis, 2022, 30 : 1061 - 1076
  • [6] Short Survey on Graph Correlation Clustering with Minimization Criteria
    Il'ev, Victor
    Il'eva, Svetlana
    Kononov, Alexander
    DISCRETE OPTIMIZATION AND OPERATIONS RESEARCH, DOOR 2016, 2016, 9869 : 25 - 36
  • [8] Classification-error cost minimization strategy: dCMS
    Parikh, Devi
    Chen, Tsuhan
    2007 IEEE/SP 14TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING, VOLS 1 AND 2, 2007, : 620 - 624
  • [9] MINIMIZATION PROBLEMS FOR SOME NONCOERCIVE COST FUNCTIONALS
    MAZUMDAR, T
    MATEMATICA APLICADA E COMPUTACIONAL, 1984, 3 (03): : 265 - 280
  • [10] COST MINIMIZATION PROBLEMS TREATED BY GEOMETRIC MEANS
    DUFFIN, RJ
    OPERATIONS RESEARCH, 1962, 10 (05) : 668 - 675