Correlation Clustering Revisited: The "True" Cost of Error Minimization Problems

被引:0
|
作者
Ailon, Nir [1 ]
Liberty, Edo [2 ]
机构
[1] Google Res, New York, NY 10030 USA
[2] Yale Univ, New Haven, CT USA
来源
AUTOMATA, LANGUAGES AND PROGRAMMING, PT I | 2009年 / 5555卷
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Correlation Clustering was defined by Bansal, Blum, and Chawla is the problem of clustering a set of elements based on a, possibly inconsistent., binary similarity function between element; pairs. Their setting is agnostic in the sense that a ground truth clustering is not assumed to exist, and the cost of a solution is computed against the input similarity function. This problem has been studied in theory and in practice and has been subsequently proven to be APX-Hard. In this work we assume that there does exist all unknown correct clustering of the data. In this setting, we argue that it is more reasonable to measure the output clustering's accuracy against the unknown underlying true clustering. We present two main results. The first is a novel method for continuously morphing a general (non-metric) function into a pseudometric. This technique may be useful for other metric embedding and clustering problems. The second is a simple algorithm for randomly rounding a pseudometric into a clustering. Combining the two, we obtain a certificate for the possibility of getting a solution of factor strictly less than 2 for our problem. This approximation coefficient; could not have been achieved by considering the agnostic version of the problem unless P = NP.
引用
收藏
页码:24 / +
页数:3
相关论文
共 50 条
  • [31] LNLQ: AN ITERATIVE METHOD FOR LEAST-NORM PROBLEMS WITH AN ERROR MINIMIZATION PROPERTY
    Estrin, Ron
    Orban, Dominique
    Saunders, Michael A.
    SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2019, 40 (03) : 1102 - 1124
  • [32] Explicit A Posteriori Error Representation for Variational Problems and Application to TV-Minimization
    Bartels, Soeren
    Kaltenbach, Alex
    FOUNDATIONS OF COMPUTATIONAL MATHEMATICS, 2024,
  • [33] Streaming Algorithms and Lower Bounds for Estimating Correlation Clustering Cost
    Assadi, Sepehr
    Shah, Vihan
    Wang, Chen
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [34] Utilizing the flexibility of distributed thermal storage in solar power forecast error cost minimization
    Huuki, Hannu
    Karhinen, Santtu
    Book, Herman
    Lindfors, Anders V.
    Kopsakangas-Savolainen, Maria
    Svento, Rauli
    JOURNAL OF ENERGY STORAGE, 2020, 28 (28)
  • [35] Goal-Oriented Policies for Cost of Actuation Error Minimization in Wireless Autonomous Systems
    Fountoulakis, Emmanouil
    Pappas, Nikolaos
    Kountouris, Marios
    IEEE COMMUNICATIONS LETTERS, 2023, 27 (09) : 2323 - 2327
  • [37] Error estimates for total-variation regularized minimization problems with singular dual solutions
    Sören Bartels
    Alex Kaltenbach
    Numerische Mathematik, 2022, 152 : 881 - 906
  • [38] On a Local-Search Heuristic for a Class of Tracking Error Minimization Problems in Portfolio Management
    Ulrich Derigs
    Nils-H. Nickel
    Annals of Operations Research, 2004, 131 : 45 - 77
  • [39] Error estimates for total-variation regularized minimization problems with singular dual solutions
    Bartels, Soeren
    Kaltenbach, Alex
    NUMERISCHE MATHEMATIK, 2022, 152 (04) : 881 - 906
  • [40] On a local-search heuristic for a class of tracking error minimization problems in portfolio management
    Derigs, U
    Nickel, NH
    ANNALS OF OPERATIONS RESEARCH, 2004, 131 (1-4) : 45 - 77