BIAS: A Toolbox for Benchmarking Structural Bias in the Continuous Domain

被引：13

作者：

Vermetten, Diederick ^{[1
]}

van Stein, Bas ^{[1
]}

Caraffini, Fabio ^{[2
]}

Minku, Leandro L. L. ^{[3
]}

Kononova, Anna V. V. ^{[1
]}

机构：

[1] Leiden Univ, Leiden Inst Adv Comp Sci, NL-2311 EZ Leiden, Netherlands

[2] De Montfort Univ, Sch Comp Sci & Informat, Leicester LE1 9BH, England

[3] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, England

来源：

IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION | 2022年 / 26卷 / 06期

关键词：

Evolutionary computation; optimization methods; statistical analysis; OF-FIT TESTS; FALSE DISCOVERY RATE; NORMALITY; POWER;

D O I：

10.1109/TEVC.2022.3189848

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Benchmarking heuristic algorithms is vital to understand under which conditions and on what kind of problems certain algorithms perform well. Most benchmarks are performance based, to test algorithm performance under a wide set of conditions. There is also resource- and behavior-based benchmarks to test the resource consumption and the behavior of algorithms. In this article, we propose a novel behavior-based benchmark toolbox: BIAS (Bias in algorithms, structural). This toolbox can detect structural bias (SB) per dimension and across dimension-based on 39 statistical tests. Moreover, it predicts the type of SB using a random forest model. BIAS can be used to better understand and improve existing algorithms (removing bias) as well as to test novel algorithms for SB in an early phase of development. Experiments with a large set of generated SB scenarios show that BIAS was successful in identifying bias. In addition, we also provide the results of BIAS on 432 existing state-of-the-art optimization algorithms showing that different kinds of SB are present in these algorithms, mostly toward the center of the objective space or showing discretization behavior. The proposed toolbox is made available open-source and recommendations are provided for the sample size and hyper-parameters to be used when applying the toolbox on other algorithms.

引用

页码：1380 / 1393

页数：14

共 50 条

[41] Regional bias when benchmarking services using customer satisfaction scores
Brint, Andrew
Fry, John
TOTAL QUALITY MANAGEMENT & BUSINESS EXCELLENCE, 2021, 32 (3-4) : 344 - 358
[42] BIAS, BIAS, WHO DOESNT HAVE THE BIAS
HARCUM, ER
ROSEN, E
CONTEMPORARY PSYCHOLOGY, 1995, 40 (06): : 607 - 607
[43] Benchmarking the Effect of Poisoning Defenses on the Security and Bias of Deep Learning Models
Baracaldo, Nathalie
Ahmed, Farhan
Eykholt, Kevin
Zhou, Yi
Priya, Shriti
Lee, Taesung
Kadhe, Swanand
Tan, Mike
Polavaram, Sridevi
Suggs, Sterling
Gao, Yuyang
Slater, David
2023 IEEE SECURITY AND PRIVACY WORKSHOPS, SPW, 2023, : 45 - 56
[44] BURFEL BIAS AND MCNP BENCHMARKING BURFEL CALCULATIONS OF NRU LOOP FUEL
Thai Sinh Nguyen
Wang, Xiaolin
CNL NUCLEAR REVIEW, 2020, 9 (01) : 45 - 55
[45] Investigating bias in docking screens with target, ligand and decoy benchmarking sets
Huang, Niu
Irwin, John J.
Shoichet, Brian
ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2006, 232 : 360 - 360
[46] D Unsupervised and Semi-supervised Bias Benchmarking in Face Recognition
Chouldechova, Alexandra
Deng, Siqi
Wang, Yongxin
Xia, Wei
Perona, Pietro
COMPUTER VISION, ECCV 2022, PT XIII, 2022, 13673 : 289 - 306
[47] Benchmarking Heart Rate Variability to Overcome Sex-Related Bias
Pagani, Massimo
Sala, Roberto
Malacarne, Mara
Lucini, Daniela
SEX-SPECIFIC ANALYSIS OF CARDIOVASCULAR FUNCTION, 2018, 1065 : 191 - 205
[48] Fairea: A Model Behaviour Mutation Approach to Benchmarking Bias Mitigation Methods
Hort, Max
Zhang, Jie M.
Sarro, Federica
Harman, Mark
PROCEEDINGS OF THE 29TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE '21), 2021, : 994 - 1006
[49] Hindsight bias is not a bias
Hedden, Brian
ANALYSIS, 2019, 79 (01) : 43 - 52
[50] Bias in naming bias
John P. Gibson
Nature Reviews Genetics, 2002, 3 (1) : 80 - 80

← 1 2 3 4 5 →