Investigating Semi-Automatic Assessment of Data Sets Fairness by Means of Fuzzy Logic

被引:1
|
作者
Gallese, Chiara [1 ]
Scantamburlo, Teresa [2 ]
Manzoni, Luca [3 ]
Nobile, Marco S. [4 ,5 ]
机构
[1] Eindhoven Univ Technol, Dept Elect Engn, Eindhoven, Netherlands
[2] Ca Foscari Univ Venice, European Ctr Living Technol, Dept Environm Sci Informat & Stat, Venice, Italy
[3] Univ Trieste, Dept Math & Geosci, Trieste, Italy
[4] Ca Foscari Univ Venice, Dept Environm Sci Informat & Stat, Venice, Italy
[5] Eindhoven Univ Technol, Dept Ind Engn & Innovat Sci, Eindhoven, Netherlands
关键词
Data Bias; Fairness; Trustworthy Artificial Intelligence; Fuzzy Logic;
D O I
10.1109/CIBCB56990.2023.10264913
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Research has shown how data sets convey social bias in AI systems, especially those based on machine learning. A biased data set is not representative of reality and might contribute to perpetuate societal biases within the model. To tackle this problem, it is important to understand how to avoid biases, errors, and unethical practices while creating the data sets. In this work we offer a preliminary framework for the semi-automated evaluation of fairness in data sets, by combining statistical information about data with qualitative consideration. We address the issue of how much (un)fairness can be included in a data set used for machine learning research, focusing on classification issues. In order to provide guidance for the use of data sets in contexts of critical decision-making, such as health decisions, we identify six fundamental features (balance, numerosity, unevenness, compliance, quality, incompleteness) that could affect model fairness. We developed a rule-based approach based on fuzzy logic that combines these characteristics into a single score and enables a semi-automatic evaluation of a data set in algorithmic fairness research.
引用
收藏
页码:106 / 115
页数:10
相关论文
共 50 条
  • [1] Semi-automatic Quality Control of Topographic Data Sets
    Helmholz, Petra
    Becker, Christian
    Breitkopf, Uwe
    Bueschenfeld, Torsten
    Busch, Andreas
    Braun, Carola
    Gruenreich, Dietmar
    Mueller, Soenke
    Ostermann, Joern
    Pahl, Martin
    Rottensteiner, Franz
    Vogt, Karsten
    Ziems, Marcel
    Heipke, Christian
    PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2012, 78 (09): : 959 - 972
  • [2] Semi-automatic Objects Recognition Process Based on Fuzzy Logic
    Prandi, Federico
    Brumana, Raffaella
    PERSONAL SATELLITE SERVICES, 2010, 43 : 343 - 353
  • [3] ASSESSMENT ROLLS AND THE SEMI-AUTOMATIC VERIFICATION OF QUANTITATIVE DATA
    OLSEN, M
    LEBLANC, P
    HISTOIRE SOCIALE-SOCIAL HISTORY, 1988, 21 (41): : 137 - 143
  • [4] A semi-automatic technique for selecting sets of photos
    Shioya, Hiroka
    Itoh, Takayuki
    Hagita, Mariko
    PROCEEDINGS NICOGRAPH INTERNATIONAL 2016, 2016, : 141 - 141
  • [5] Semi-automatic Video Assessment System
    Martins, Pedro
    Correia, Nuno
    PROCEEDINGS OF THE 15TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2017,
  • [6] SEMI-AUTOMATIC RECORDING OF WAVELENGTH DATA
    WERNER, GK
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA, 1953, 43 (07) : 620 - 621
  • [7] A Framework for Semi-automatic Data Integration
    Ceravolo, Paolo
    Cui, Zhan
    Damiani, Ernesto
    Gusmini, Alex
    Leida, Marcello
    ENTERPRISE INFORMATION SYSTEMS-B, 2009, 19 : 46 - +
  • [8] A framework for semi-automatic data integration
    Ceravolo, Paolo
    Cui, Zhan
    Damiani, Ernesto
    Gusmini, Alex
    Leida, Marcello
    Lecture Notes in Business Information Processing, 2009, 19 : 46 - 60
  • [9] Semi-automatic training sets acquisition for handwriting recognition
    Sas, Jerzy
    Markowska-Kaczmar, Urszula
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PROCEEDINGS, 2007, 4673 : 531 - 538
  • [10] Semi-automatic development of test program sets (TPS)
    Berk, K
    Flann, N
    Howell, C
    Wille, K
    AUTOTESTCON 2003, PROCEEDINGS: FUTURE SUSTAINMENT FOR MILITARY AND AEROSPACE, 2003, : 217 - 225