Fairness-aware Data Integration

被引:1
|
作者
Mazilu, Lacramioara [1 ,2 ]
Paton, Norman W. [1 ]
Konstantinou, Nikolaos [1 ]
Fernandes, Alvaro A. A. [1 ]
机构
[1] Univ Manchester, Oxford Rd, Manchester M13 9PL, Lancs, England
[2] Peak AI Ltd, Charlotte St, Manchester M1 4ET, Lancs, England
来源
基金
英国工程与自然科学研究理事会;
关键词
Data integration; data preparation; fairness; bias; CLASSIFICATION;
D O I
10.1145/3519419
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning can be applied in applications that take decisions that impact people's lives. Such techniques have the potential to make decision making more objective, but there also is a risk that the decisions can discriminate against certain groups as a result of bias in the underlying data. Reducing bias, or promoting fairness, has been a focus of significant investigation in machine learning, for example, based on preprocessing the training data, changing the learning algorithm, or post-processing the results of the learning. However, prior to these activities, data integration discovers and integrates the data that is used for training, and data integration processes have the potential to produce data that leads to biased conclusions. In this article, we propose an approach that generates schema mappings in ways that take into account: (i) properties that are intrinsic to mapping results that may give rise to bias in analyses; and (ii) bias observed in classifiers trained on the results of different sets of mappings. The approach explores a space of different ways of integrating the data, using a Tabu search algorithm, guided by bias-aware objective functions that represent different types of bias.The resulting approach is evaluated using Adult Census and German Credit datasets to explore the extent to which and the circumstances in which the approach can increase the fairness of the results of the data integration process.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] Tailoring Data Source Distributions for Fairness-aware Data Integration
    Nargesian, Fatemeh
    Asudeh, Abolfazl
    Jagadish, H., V
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 14 (11): : 2519 - 2532
  • [2] Considerations on Fairness-aware Data Mining
    Kamishima, Toshihiro
    Akaho, Shotaro
    Asoh, Hideki
    Sakuma, Jun
    12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2012), 2012, : 378 - 385
  • [3] Empirical analysis of fairness-aware data segmentation
    Okura, Seiji
    Mohri, Takao
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW, 2022, : 155 - 162
  • [4] Fairness-Aware Programming
    Albarghouthi, Aws
    Vinitsky, Samuel
    FAT*'19: PROCEEDINGS OF THE 2019 CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, 2019, : 211 - 219
  • [5] Fairness-Aware PageRank
    Tsioutsiouliklis, Sotiris
    Pitoura, Evaggelia
    Tsaparas, Panayiotis
    Kleftakis, Ilias
    Mamoulis, Nikos
    PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 3815 - 3826
  • [6] Fairness-Aware PAC Learning from Corrupted Data
    Konstantinov, Nikola
    Lampert, Christoph H.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [7] Fairness-Aware Range Queries for Selecting Unbiased Data
    Shetiya, Suraj
    Swift, Ian P.
    Asudeh, Abolfazl
    Das, Gautam
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 1423 - 1436
  • [8] On the Impossibility of Fairness-Aware Learning from Corrupted Data
    Konstantinov, Nikola
    Lampert, Christoph H.
    ALGORITHMIC FAIRNESS THROUGH THE LENS OF CAUSALITY AND ROBUSTNESS WORKSHOP, VOL 171, 2021, 171 : 59 - 72
  • [9] Fairness-Aware PAC Learning from Corrupted Data
    Konstantinov, Nikola
    Lampert, Christoph H.
    Journal of Machine Learning Research, 2022, 23 : 1 - 60
  • [10] The Independence of Fairness-aware Classifiers
    Kamishima, Toshihiro
    Akaho, Shotaro
    Asoh, Hideki
    Sakuma, Jun
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2013, : 849 - 858