Probabilistic Fair Clustering

被引:0
|
作者
Esmaeili, Seyed A. [1 ]
Brubach, Brian [2 ]
Tsepenekas, Leonidas [1 ]
Dickerson, John P. [1 ]
机构
[1] Univ Maryland, College Pk, MD 20742 USA
[2] Wellesley Coll, Dept Comp Sci, Wellesley, MA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In clustering problems, a central decision-maker is given a complete metric graph over vertices and must provide a clustering of vertices that minimizes some objective function. In fair clustering problems, vertices are endowed with a color (e.g., membership in a group), and the requirements of a valid clustering might also include the representation of colors in the solution. Prior work in fair clustering assumes complete knowledge of group membership. In this paper, we generalize this by assuming imperfect knowledge of group membership through probabilistic assignments, and present algorithms in this more general setting with approximation ratio guarantees. We also address the problem of "metric membership", where group membership has a notion of order and distance. Experiments are conducted using our proposed algorithms as well as baselines to validate our approach, and also surface nuanced concerns when group membership is not known deterministically.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Probabilistic D-clustering
    Ben-Israel, Adi
    Iyigun, Cem
    JOURNAL OF CLASSIFICATION, 2008, 25 (01) : 5 - 26
  • [42] Clustering Large Probabilistic Graphs
    Kollios, George
    Potamias, Michalis
    Terzi, Evimaria
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (02) : 325 - 336
  • [43] Probabilistic D-Clustering
    Adi Ben-Israel
    Cem Iyigun
    Journal of Classification, 2008, 25
  • [44] PROBABILISTIC VALIDATION APPROACH FOR CLUSTERING
    HAREVEN, M
    BRAILOVSKY, VL
    PATTERN RECOGNITION LETTERS, 1995, 16 (11) : 1189 - 1196
  • [45] Fair Termination for Parameterized Probabilistic Concurrent Systems
    Lengal, Ondrej
    Lin, Anthony W.
    Majumdar, Rupak
    Rummer, Philipp
    TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, TACAS 2017, PT I, 2017, 10205 : 499 - 517
  • [46] Fair queuing and other probabilistic allocation methods
    Moulin, H
    Stong, R
    MATHEMATICS OF OPERATIONS RESEARCH, 2002, 27 (01) : 1 - 30
  • [47] Factor probabilistic distance clustering (FPDC): a new clustering method
    Tortora, Cristina
    Summa, Mireille Gettler
    Marino, Marina
    Palumbo, Francesco
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2016, 10 (04) : 441 - 464
  • [48] Factor probabilistic distance clustering (FPDC): a new clustering method
    Cristina Tortora
    Mireille Gettler Summa
    Marina Marino
    Francesco Palumbo
    Advances in Data Analysis and Classification, 2016, 10 : 441 - 464
  • [49] Deep Clustering based Fair Outlier Detection
    Song, Hanyu
    Li, Peizhao
    Liu, Hongfu
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 1481 - 1489
  • [50] Fair Clustering via Equitable Group Representations
    Abbasi, Mohsen
    Bhaskara, Aditya
    Venkatasubramanian, Suresh
    PROCEEDINGS OF THE 2021 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, FACCT 2021, 2021, : 504 - 514