Correlation Clustering Revisited: The "True" Cost of Error Minimization Problems

被引:0
|
作者
Ailon, Nir [1 ]
Liberty, Edo [2 ]
机构
[1] Google Res, New York, NY 10030 USA
[2] Yale Univ, New Haven, CT USA
来源
AUTOMATA, LANGUAGES AND PROGRAMMING, PT I | 2009年 / 5555卷
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Correlation Clustering was defined by Bansal, Blum, and Chawla is the problem of clustering a set of elements based on a, possibly inconsistent., binary similarity function between element; pairs. Their setting is agnostic in the sense that a ground truth clustering is not assumed to exist, and the cost of a solution is computed against the input similarity function. This problem has been studied in theory and in practice and has been subsequently proven to be APX-Hard. In this work we assume that there does exist all unknown correct clustering of the data. In this setting, we argue that it is more reasonable to measure the output clustering's accuracy against the unknown underlying true clustering. We present two main results. The first is a novel method for continuously morphing a general (non-metric) function into a pseudometric. This technique may be useful for other metric embedding and clustering problems. The second is a simple algorithm for randomly rounding a pseudometric into a clustering. Combining the two, we obtain a certificate for the possibility of getting a solution of factor strictly less than 2 for our problem. This approximation coefficient; could not have been achieved by considering the agnostic version of the problem unless P = NP.
引用
收藏
页码:24 / +
页数:3
相关论文
共 50 条
  • [41] Correlation Clustering with Same-Cluster Queries Bounded by Optimal Cost
    Saha, Barna
    Subramanian, Sanjay
    27TH ANNUAL EUROPEAN SYMPOSIUM ON ALGORITHMS (ESA 2019), 2019, 144
  • [42] Correlation clustering with same-cluster queries bounded by optimal cost
    Saha, Barna
    Subramanian, Sanjay
    Leibniz International Proceedings in Informatics, LIPIcs, 2019, 144
  • [43] Random-error minimization during cross-correlation of early-type spectra
    Verschueren, W
    David, W
    ASTRONOMY & ASTROPHYSICS SUPPLEMENT SERIES, 1999, 136 (03): : 591 - 601
  • [44] Dynamic Programming and Error Estimates for Stochastic Control Problems with Maximum Cost
    Olivier Bokanowski
    Athena Picarelli
    Hasnaa Zidani
    Applied Mathematics & Optimization, 2015, 71 : 125 - 163
  • [45] Dynamic Programming and Error Estimates for Stochastic Control Problems with Maximum Cost
    Bokanowski, Olivier
    Picarelli, Athena
    Zidani, Hasnaa
    APPLIED MATHEMATICS AND OPTIMIZATION, 2015, 71 (01): : 125 - 163
  • [46] n-MeRCI: A new Metric to Evaluate the Correlation Between Predictive Uncertainty and True Error
    Moukari, Michel
    Simon, Loic
    Picard, Sylvaine
    Jurie, Frederic
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 5250 - 5255
  • [47] Variational grid adaptation based on the minimization of local truncation error: time-independent problems
    Lapenta, G
    JOURNAL OF COMPUTATIONAL PHYSICS, 2004, 193 (01) : 159 - 179
  • [48] Handling Correlated Rounding Error via Preclustering: A 1.73-approximation for Correlation Clustering
    Cohen-Addad, Vincent
    Lee, Euiwoong
    Li, Shi
    Newman, Alantha
    2023 IEEE 64TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, FOCS, 2023, : 1082 - 1104
  • [49] Improved Common Correlation Matrix Based SMI Algorithm by Channel Estimation Error Minimization with LMS Approach
    Akao, Takashi
    Taroda, Satoshi
    Maruta, Kazuki
    Ahn, Chang-Jun
    2017 20TH INTERNATIONAL SYMPOSIUM ON WIRELESS PERSONAL MULTIMEDIA COMMUNICATIONS (WPMC), 2017, : 63 - 67
  • [50] Measurement error and the correlation between positive and negative affect: Spearman (1904,1907) revisited
    Mutch, C
    Tisak, J
    PSYCHOLOGICAL REPORTS, 2005, 96 (01) : 43 - 46