Fair Algorithms for Hierarchical Agglomerative Clustering

被引:6
|
作者
Chhabra, Anshuman [1 ]
Mohapatra, Prasant [1 ]
机构
[1] Univ Calif Davis, Dept Comp Sci, Davis, CA 95616 USA
关键词
Clustering; Hierarchical Agglomerative Clustering; Fairness in Clustering; VALIDATION; DEFAULT;
D O I
10.1109/ICMLA55696.2022.00036
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical Agglomerative Clustering (HAC) algorithms are extensively utilized in modern data science, and seek to partition the dataset into clusters while generating a hierarchical relationship between the data samples. HAC algorithms are employed in many applications, such as biology, natural language processing, and recommender systems. Thus, it is imperative to ensure that these algorithms are fair-even if the dataset contains biases against certain protected groups, the cluster outputs generated should not discriminate against samples from any of these groups. However, recent work in clustering fairness has mostly focused on center-based clustering algorithms, such as k-median and k-means clustering. In this paper, we propose fair algorithms for performing HAC that enforce fairness constraints 1) irrespective of the distance linkage criteria used, 2) generalize to any natural measures of clustering fairness for HAC, 3) work for multiple protected groups, and 4) have competitive running times to vanilla HAC. Through extensive experiments on multiple real-world UCI datasets, we show that our proposed algorithm finds fairer clusterings compared to vanilla HAC as well as the only other state-of-the-art fair HAC approach.
引用
收藏
页码:206 / 211
页数:6
相关论文
共 50 条
  • [41] Semantic Clustering of Functional Requirements Using Agglomerative Hierarchical Clustering
    Salman, Hamzeh Eyal
    Hammad, Mustafa
    Seriai, Abdelhak-Djamel
    Al-Sbou, Ahed
    INFORMATION, 2018, 9 (09)
  • [42] Identifying daily water consumption patterns based on K-means Clustering, Agglomerative Hierarchical Clustering, and Spectral Clustering algorithms
    Guo, Hongyuan
    Liu, Xingpo
    Zhang, Qichen
    AQUA-WATER INFRASTRUCTURE ECOSYSTEMS AND SOCIETY, 2024, 73 (05) : 870 - 887
  • [43] Ward's Hierarchical Agglomerative Clustering Method: Which Algorithms Implement Ward's Criterion?
    Murtagh, Fionn
    Legendre, Pierre
    JOURNAL OF CLASSIFICATION, 2014, 31 (03) : 274 - 295
  • [44] Ward’s Hierarchical Agglomerative Clustering Method: Which Algorithms Implement Ward’s Criterion?
    Fionn Murtagh
    Pierre Legendre
    Journal of Classification, 2014, 31 : 274 - 295
  • [45] Fair Algorithms for Clustering
    Bera, Suman K.
    Chakrabarty, Deeparnab
    Flores, Nicolas J.
    Negahbani, Maryam
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [46] An Evaluation of the Objective Clustering Inductive Technology Effectiveness Implemented Using Density-Based and Agglomerative Hierarchical Clustering Algorithms
    Babichev, Sergii
    Durnyak, Bohdan
    Pikh, Iryna
    Senkivskyy, Vsevolod
    LECTURE NOTES IN COMPUTATIONAL INTELLIGENCE AND DECISION MAKING, 2020, 1020 : 532 - 553
  • [47] Development of an efficient hierarchical clustering analysis using an agglomerative clustering algorithm
    Naeem, Arshia
    Rehman, Mariam
    Anjum, Maria
    Asif, Muhammad
    CURRENT SCIENCE, 2019, 117 (06): : 1045 - 1053
  • [48] Hierarchical Agglomerative Clustering of Time-Warped Series
    Kotas, Marian
    Leski, Jacek
    Moron, Tomasz
    Guzman, Jader Giraldo
    MAN-MACHINE INTERACTIONS 5, ICMMI 2017, 2018, 659 : 207 - 216
  • [49] Online Agglomerative Hierarchical Clustering of Neural Fiber Tracts
    Demir, Ali
    Mohamed, Ashraf
    Cetingul, H. Ertan
    2013 35TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2013, : 85 - 88
  • [50] A Secure Distributed Framework for Agglomerative Hierarchical Clustering Construction
    Hamidi, Mona
    Sheikhalishahi, Mina
    Martinelli, Fabio
    2018 26TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2018), 2018, : 430 - 435