Investigating Semi-Automatic Assessment of Data Sets Fairness by Means of Fuzzy Logic

被引:1
|
作者
Gallese, Chiara [1 ]
Scantamburlo, Teresa [2 ]
Manzoni, Luca [3 ]
Nobile, Marco S. [4 ,5 ]
机构
[1] Eindhoven Univ Technol, Dept Elect Engn, Eindhoven, Netherlands
[2] Ca Foscari Univ Venice, European Ctr Living Technol, Dept Environm Sci Informat & Stat, Venice, Italy
[3] Univ Trieste, Dept Math & Geosci, Trieste, Italy
[4] Ca Foscari Univ Venice, Dept Environm Sci Informat & Stat, Venice, Italy
[5] Eindhoven Univ Technol, Dept Ind Engn & Innovat Sci, Eindhoven, Netherlands
关键词
Data Bias; Fairness; Trustworthy Artificial Intelligence; Fuzzy Logic;
D O I
10.1109/CIBCB56990.2023.10264913
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Research has shown how data sets convey social bias in AI systems, especially those based on machine learning. A biased data set is not representative of reality and might contribute to perpetuate societal biases within the model. To tackle this problem, it is important to understand how to avoid biases, errors, and unethical practices while creating the data sets. In this work we offer a preliminary framework for the semi-automated evaluation of fairness in data sets, by combining statistical information about data with qualitative consideration. We address the issue of how much (un)fairness can be included in a data set used for machine learning research, focusing on classification issues. In order to provide guidance for the use of data sets in contexts of critical decision-making, such as health decisions, we identify six fundamental features (balance, numerosity, unevenness, compliance, quality, incompleteness) that could affect model fairness. We developed a rule-based approach based on fuzzy logic that combines these characteristics into a single score and enables a semi-automatic evaluation of a data set in algorithmic fairness research.
引用
收藏
页码:106 / 115
页数:10
相关论文
共 50 条
  • [41] Semi-automatic reliability assessment of safety related embedded systems
    Kucera, Markus
    Mauser, Hans
    PROCEEDINGS OF THE 18TH IASTED INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING AND SYSTEMS, 2006, : 495 - 502
  • [42] Assessment and semi-automatic analysis of test results in mathematical education
    Bescherer, C
    Müller, W
    Heinrich, F
    Mettenheimer, S
    ED-MEDIA 2004: World Conference on Educational Multimedia, Hypermedia & Telecommunications, Vols. 1-7, 2004, : 3013 - 3018
  • [43] Semi-automatic assessment process in a ubiquitous environment for language learning
    Paredes, M
    Ortega, M
    Sánchez-Villalón, PP
    Redondo, MA
    Bravo, C
    Bravo, J
    WEB ENGINEERING, PROCEEDINGS, 2003, 2722 : 255 - 258
  • [44] Semi-Automatic Asynchronous Logic Synthesis in XILINX: Design Flow and Case Study
    Lemberski, Igor
    Gopejenko, Viktors
    IFAC PAPERSONLINE, 2019, 52 (27): : 50 - 55
  • [45] Semi-automatic construction of ontology based on data mining technique
    Wang, Jingyun
    Flanagan, Brendan
    Ogata, Hiroaki
    2017 6TH IIAI INTERNATIONAL CONGRESS ON ADVANCED APPLIED INFORMATICS (IIAI-AAI), 2017, : 511 - 515
  • [46] ELECTROACOUSTIC TRANSDUCER CALIBRATION COMBINED WITH SEMI-AUTOMATIC DATA REDUCTION
    ROSHON, J
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1960, 32 (11): : 1519 - 1519
  • [47] SEMI-AUTOMATIC ALLOCATION OF DATA STORAGE FOR PACT-I
    DERR, JI
    LUKE, RC
    JOURNAL OF THE ACM, 1956, 3 (04) : 299 - 308
  • [48] Semi-automatic feature extraction from GPR data for archaeology
    Leckebusch, Juerg
    Weibel, Andreas
    Buehler, Flurin
    NEAR SURFACE GEOPHYSICS, 2008, 6 (02) : 75 - 84
  • [49] Semi-automatic, data-driven construction of multimedia ontologies
    Jaimes, A
    Smith, JR
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 781 - 784
  • [50] Semi-automatic true orthophoto production by using LIDAR data
    Guenay, A.
    Arefi, H.
    Hahn, M.
    IGARSS: 2007 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-12: SENSING AND UNDERSTANDING OUR PLANET, 2007, : 2873 - 2876