Vectorization of Bias in Machine Learning Algorithms

被引：1

作者：

Bekerman, Sophie ^{[1
]}

Chen, Eric ^{[1
]}

Lin, Lily ^{[2
]}

Nez, George D. Monta ^{[1
]}

机构：

[1] Harvey Mudd Coll, Dept Comp Sci, AMISTAD Lab, Claremont, CA 91711 USA

[2] Biola Univ, Dept Math & Comp Sci, La Mirada, CA 90639 USA

来源：

ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2 | 2022年

基金：

美国国家科学基金会;

关键词：

Inductive Bias; Algorithmic Bias; Vectorization; Algorithmic Search Framework;

D O I：

10.5220/0010845000003116

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We develop a method to measure and compare the inductive bias of classifications algorithms by vectorizing aspects of their behavior. We compute a vectorized representation of the algorithm's bias, known as the inductive orientation vector, for a set of algorithms. This vector captures the algorithm's probability distribution over all possible hypotheses for a classification task. We cluster and plot the algorithms' inductive orientation vectors to visually characterize their relationships. As algorithm behavior is influenced by the training dataset, we construct a Benchmark Data Suite (BDS) matrix that considers algorithms' pairwise distances across many datasets, allowing for more robust comparisons. We identify many relationships supported by existing literature, such as those between k-Nearest Neighbor and Random Forests and among tree-based algorithms, and evaluate the strength of those known connections, showing the potential of this geometric approach to investigate black-box machine learning algorithms.

引用

页码：354 / 365

页数：12

共 50 条

[31] Algorithms for Interpretable Machine Learning
Rudin, Cynthia
PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 1519 - 1519
[32] Machine learning algorithms in sepsis
Agnello, Luisa
Vidali, Matteo
Padoan, Andrea
Lucis, Riccardo
Mancini, Alessio
Guerranti, Roberto
Plebani, Mario
Ciaccio, Marcello
Carobene, Anna
CLINICA CHIMICA ACTA, 2024, 553
[33] Genetic algorithms in machine learning
Giordana, A
Neri, F
AI COMMUNICATIONS, 1996, 9 (01) : 21 - 26
[34] Machine-learning media bias
D'Alonzo, Samantha
Tegmark, Max
PLOS ONE, 2022, 17 (08):
[35] Ethical Implications Of Bias In Machine Learning
Yapo, Adrienne
Weiss, Joseph
PROCEEDINGS OF THE 51ST ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS), 2018, : 5365 - 5372
[36] Mitigating Racial Bias in Machine Learning
Kostick-Quenet, Kristin M.
Cohen, I. Glenn
Gerke, Sara
Lo, Bernard
Antaki, James
Movahedi, Faezah
Njah, Hasna
Schoen, Lauren
Estep, Jerry E.
Blumenthal-Barby, J. S.
JOURNAL OF LAW MEDICINE & ETHICS, 2022, 50 (01): : 92 - 100
[37] A Survey on Bias and Fairness in Machine Learning
Mehrabi, Ninareh
Morstatter, Fred
Saxena, Nripsuta
Lerman, Kristina
Galstyan, Aram
ACM COMPUTING SURVEYS, 2021, 54 (06)
[38] Simplicity Bias in Overparameterized Machine Learning
Berchenko, Yakir
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 10, 2024, : 11052 - 11060
[39] Bias in Machine Learning: A Literature Review
Mavrogiorgos, Konstantinos
Kiourtis, Athanasios
Mavrogiorgou, Argyro
Menychtas, Andreas
Kyriazis, Dimosthenis
APPLIED SCIENCES-BASEL, 2024, 14 (19):
[40] Managing Bias in Machine Learning Projects
Fahse, Tobias
Huber, Viktoria
van Giffen, Benjamin
INNOVATION THROUGH INFORMATION SYSTEMS, VOL II: A COLLECTION OF LATEST RESEARCH ON TECHNOLOGY ISSUES, 2021, 47 : 94 - 109

← 1 2 3 4 5 →