TREATS: Fairness-aware entity resolution over streaming data

被引:0
|
作者
Araujo, Tiago Brasileiro [1 ,2 ]
Efthymiou, Vasilis [3 ,4 ]
Christophides, Vassilis [5 ]
Pitoura, Evaggelia [6 ]
Stefanidis, Kostas [1 ]
机构
[1] Tampere Univ, Tampere, Finland
[2] Fed Inst Paraiba, Soledade, Brazil
[3] Harokopio Univ Athens, Athens, Greece
[4] FORTH ICS, Iraklion, Greece
[5] ENSEA, ETIS, Paris, France
[6] Univ Ioannina, Ioannina, Greece
关键词
Entity resolution; Streaming data; Fairness; Incremental processing; Distributed processing; Machine learning;
D O I
10.1016/j.is.2024.102506
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Currently, the growing proliferation of information systems generates large volumes of data continuously, stemming from a variety of sources such as web platforms, social networks, and multiple devices. These data, often lacking a defined schema, require an initial process of consolidation and cleansing before analysis and knowledge extraction can occur. In this context, Entity Resolution (ER) plays a crucial role, facilitating the integration of knowledge bases and identifying similarities among entities from different sources. However, the traditional ER process is computationally expensive, and becomes more complicated in the streaming context where the data arrive continuously. Moreover, there is a lack of studies involving fairness and ER, which is related to the absence of discrimination or bias. In this sense, fairness criteria aim to mitigate the implications of data bias in ER systems, which requires more than just optimizing accuracy, as traditionally done. Considering this context, this work presents TREATS, a schema-agnostic and fairness-aware ER workflow developed for managing streaming data incrementally. The proposed fairness-aware ER framework tackles constraints across various groups of interest, presenting a resilient and equitable solution to the related challenges. Through experimental evaluation, the proposed techniques and heuristics are compared against state-of-the-art approaches over five real-world data source pairs, in which the results demonstrated significant improvements in terms of fairness, without degradation of effectiveness and efficiency measures in the streaming environment. In summary, our contributions aim to propel the ER field forward by providing a workflow that addresses both technical challenges and ethical concerns.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] A survey on fairness-aware recommender systems
    Jin, Di
    Wang, Luzhi
    Zhang, He
    Zheng, Yizhen
    Ding, Weiping
    Xia, Feng
    Pan, Shirui
    INFORMATION FUSION, 2023, 100
  • [22] Towards Robust Fairness-aware Recommendation
    Yang, Hao
    Liu, Zhining
    Zhang, Zeyu
    Zhuang, Chenyi
    Chen, Xu
    PROCEEDINGS OF THE 17TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2023, 2023, : 211 - 222
  • [23] FairCF: fairness-aware collaborative filtering
    Shao, Pengyang
    Wu, Le
    Chen, Lei
    Zhang, Kun
    Wang, Meng
    SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (12)
  • [24] FAE: A Fairness-Aware Ensemble Framework
    Iosifidis, Vasileios
    Fetahu, Besnik
    Ntoutsi, Eirini
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 1375 - 1380
  • [25] Fairness-Aware Graph Filter Design
    Kose, O. Deniz
    Shen, Yanning
    Mateos, Gonzalo
    FIFTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, IEEECONF, 2023, : 330 - 334
  • [26] Fairness-Aware Structured Pruning in Transformers
    Zayed, Abdelrahman
    Mordido, Goncalo
    Shabanian, Samira
    Baldini, Ioana
    Chandar, Sarath
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 20, 2024, : 22484 - 22492
  • [27] Predictive Policing: A Fairness-aware Approach
    Downey, Ava
    Islam, Sheikh Rabiul
    Sarker, Md Kamruzzman
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2024, 33 (03)
  • [28] FAIROD: Fairness-aware Outlier Detection
    Shekhar, Shubhranshu
    Shah, Neil
    Akoglu, Leman
    AIES '21: PROCEEDINGS OF THE 2021 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, 2021, : 210 - 220
  • [29] Fairness-Aware Unsupervised Feature Selection
    Xing, Xiaoying
    Liu, Hongfu
    Chen, Chen
    Li, Jundong
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 3548 - 3552
  • [30] Fairness-aware Graph Attention Networks
    Kose, O. Deniz
    Shen, Yanning
    2022 56TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2022, : 843 - 846