Fairness-aware Data Integration

被引:1
|
作者
Mazilu, Lacramioara [1 ,2 ]
Paton, Norman W. [1 ]
Konstantinou, Nikolaos [1 ]
Fernandes, Alvaro A. A. [1 ]
机构
[1] Univ Manchester, Oxford Rd, Manchester M13 9PL, Lancs, England
[2] Peak AI Ltd, Charlotte St, Manchester M1 4ET, Lancs, England
来源
基金
英国工程与自然科学研究理事会;
关键词
Data integration; data preparation; fairness; bias; CLASSIFICATION;
D O I
10.1145/3519419
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning can be applied in applications that take decisions that impact people's lives. Such techniques have the potential to make decision making more objective, but there also is a risk that the decisions can discriminate against certain groups as a result of bias in the underlying data. Reducing bias, or promoting fairness, has been a focus of significant investigation in machine learning, for example, based on preprocessing the training data, changing the learning algorithm, or post-processing the results of the learning. However, prior to these activities, data integration discovers and integrates the data that is used for training, and data integration processes have the potential to produce data that leads to biased conclusions. In this article, we propose an approach that generates schema mappings in ways that take into account: (i) properties that are intrinsic to mapping results that may give rise to bias in analyses; and (ii) bias observed in classifiers trained on the results of different sets of mappings. The approach explores a space of different ways of integrating the data, using a Tabu search algorithm, guided by bias-aware objective functions that represent different types of bias.The resulting approach is evaluated using Adult Census and German Credit datasets to explore the extent to which and the circumstances in which the approach can increase the fairness of the results of the data integration process.
引用
收藏
页数:26
相关论文
共 50 条
  • [31] Fairness-aware Methods in Rankings and Recommenders
    Pitoura, Evaggelia
    Stefanidis, Kostas
    Koutrika, Georgia
    2021 22ND IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2021), 2021, : 1 - 4
  • [32] On Convexity and Bounds of Fairness-aware Classification
    Wu, Yongkai
    Zhang, Lu
    Wu, Xintao
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 3356 - 3362
  • [33] FairGT: A Fairness-aware Graph Transformer
    Luo, Renqiang
    Huang, Huafei
    Yu, Shuo
    Zhang, Xiuzhen
    Xia, Feng
    arXiv,
  • [34] Fairness-aware recommendation with meta learning
    Oh, Hyeji
    Kim, Chulyun
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [35] Learning Fairness-Aware Relational Structures
    Zhang, Yue
    Ramesh, Arti
    ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 2543 - 2550
  • [36] Collaboration- and Fairness-Aware Big Data Management in Distributed Clouds
    Xia, Qiufen
    Xu, Zichuan
    Liang, Weifa
    Zomaya, Albert Y.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (07) : 1941 - 1953
  • [37] Fairness-aware data offloading of IoT applications enabled by heterogeneous UAVs
    Yan, Hui
    Bao, Weidong
    Zhu, Xiaomin
    Wang, Ji
    Wu, Guanlin
    Cao, Jiang
    INTERNET OF THINGS, 2023, 22
  • [38] Algorithmic Bias: From Discrimination Discovery to Fairness-aware Data Mining
    Hajian, Sara
    Bonchi, Francesco
    Castillo, Carlos
    KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 2125 - 2126
  • [39] Can Fairness be Automated? Guidelines and Opportunities for Fairness-aware AutoML
    Weerts, Hilde
    Pfisterer, Florian
    Feurer, Matthias
    Eggensperger, Katharina
    Bergman, Edward
    Awad, Noor
    Vanschoren, Joaquin
    Pechenizkiy, Mykola
    Bischl, Bernd
    Hutter, Frank
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2023, 79 : 639 - 677
  • [40] A survey on datasets for fairness-aware machine learning
    Tai Le Quy
    Roy, Arjun
    Iosifidis, Vasileios
    Zhang, Wenbin
    Ntoutsi, Eirini
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 12 (03)