BIAS: A Toolbox for Benchmarking Structural Bias in the Continuous Domain

被引:13
|
作者
Vermetten, Diederick [1 ]
van Stein, Bas [1 ]
Caraffini, Fabio [2 ]
Minku, Leandro L. L. [3 ]
Kononova, Anna V. V. [1 ]
机构
[1] Leiden Univ, Leiden Inst Adv Comp Sci, NL-2311 EZ Leiden, Netherlands
[2] De Montfort Univ, Sch Comp Sci & Informat, Leicester LE1 9BH, England
[3] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, England
关键词
Evolutionary computation; optimization methods; statistical analysis; OF-FIT TESTS; FALSE DISCOVERY RATE; NORMALITY; POWER;
D O I
10.1109/TEVC.2022.3189848
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Benchmarking heuristic algorithms is vital to understand under which conditions and on what kind of problems certain algorithms perform well. Most benchmarks are performance based, to test algorithm performance under a wide set of conditions. There is also resource- and behavior-based benchmarks to test the resource consumption and the behavior of algorithms. In this article, we propose a novel behavior-based benchmark toolbox: BIAS (Bias in algorithms, structural). This toolbox can detect structural bias (SB) per dimension and across dimension-based on 39 statistical tests. Moreover, it predicts the type of SB using a random forest model. BIAS can be used to better understand and improve existing algorithms (removing bias) as well as to test novel algorithms for SB in an early phase of development. Experiments with a large set of generated SB scenarios show that BIAS was successful in identifying bias. In addition, we also provide the results of BIAS on 432 existing state-of-the-art optimization algorithms showing that different kinds of SB are present in these algorithms, mostly toward the center of the objective space or showing discretization behavior. The proposed toolbox is made available open-source and recommendations are provided for the sample size and hyper-parameters to be used when applying the toolbox on other algorithms.
引用
收藏
页码:1380 / 1393
页数:14
相关论文
共 50 条
  • [41] Regional bias when benchmarking services using customer satisfaction scores
    Brint, Andrew
    Fry, John
    TOTAL QUALITY MANAGEMENT & BUSINESS EXCELLENCE, 2021, 32 (3-4) : 344 - 358
  • [42] BIAS, BIAS, WHO DOESNT HAVE THE BIAS
    HARCUM, ER
    ROSEN, E
    CONTEMPORARY PSYCHOLOGY, 1995, 40 (06): : 607 - 607
  • [43] Benchmarking the Effect of Poisoning Defenses on the Security and Bias of Deep Learning Models
    Baracaldo, Nathalie
    Ahmed, Farhan
    Eykholt, Kevin
    Zhou, Yi
    Priya, Shriti
    Lee, Taesung
    Kadhe, Swanand
    Tan, Mike
    Polavaram, Sridevi
    Suggs, Sterling
    Gao, Yuyang
    Slater, David
    2023 IEEE SECURITY AND PRIVACY WORKSHOPS, SPW, 2023, : 45 - 56
  • [44] BURFEL BIAS AND MCNP BENCHMARKING BURFEL CALCULATIONS OF NRU LOOP FUEL
    Thai Sinh Nguyen
    Wang, Xiaolin
    CNL NUCLEAR REVIEW, 2020, 9 (01) : 45 - 55
  • [45] Investigating bias in docking screens with target, ligand and decoy benchmarking sets
    Huang, Niu
    Irwin, John J.
    Shoichet, Brian
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2006, 232 : 360 - 360
  • [46] D Unsupervised and Semi-supervised Bias Benchmarking in Face Recognition
    Chouldechova, Alexandra
    Deng, Siqi
    Wang, Yongxin
    Xia, Wei
    Perona, Pietro
    COMPUTER VISION, ECCV 2022, PT XIII, 2022, 13673 : 289 - 306
  • [47] Benchmarking Heart Rate Variability to Overcome Sex-Related Bias
    Pagani, Massimo
    Sala, Roberto
    Malacarne, Mara
    Lucini, Daniela
    SEX-SPECIFIC ANALYSIS OF CARDIOVASCULAR FUNCTION, 2018, 1065 : 191 - 205
  • [48] Fairea: A Model Behaviour Mutation Approach to Benchmarking Bias Mitigation Methods
    Hort, Max
    Zhang, Jie M.
    Sarro, Federica
    Harman, Mark
    PROCEEDINGS OF THE 29TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE '21), 2021, : 994 - 1006
  • [49] Hindsight bias is not a bias
    Hedden, Brian
    ANALYSIS, 2019, 79 (01) : 43 - 52
  • [50] Bias in naming bias
    John P. Gibson
    Nature Reviews Genetics, 2002, 3 (1) : 80 - 80