Vectorization of Bias in Machine Learning Algorithms

被引：1

作者：

Bekerman, Sophie ^{[1
]}

Chen, Eric ^{[1
]}

Lin, Lily ^{[2
]}

Nez, George D. Monta ^{[1
]}

机构：

[1] Harvey Mudd Coll, Dept Comp Sci, AMISTAD Lab, Claremont, CA 91711 USA

[2] Biola Univ, Dept Math & Comp Sci, La Mirada, CA 90639 USA

来源：

ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2 | 2022年

基金：

美国国家科学基金会;

关键词：

Inductive Bias; Algorithmic Bias; Vectorization; Algorithmic Search Framework;

D O I：

10.5220/0010845000003116

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We develop a method to measure and compare the inductive bias of classifications algorithms by vectorizing aspects of their behavior. We compute a vectorized representation of the algorithm's bias, known as the inductive orientation vector, for a set of algorithms. This vector captures the algorithm's probability distribution over all possible hypotheses for a classification task. We cluster and plot the algorithms' inductive orientation vectors to visually characterize their relationships. As algorithm behavior is influenced by the training dataset, we construct a Benchmark Data Suite (BDS) matrix that considers algorithms' pairwise distances across many datasets, allowing for more robust comparisons. We identify many relationships supported by existing literature, such as those between k-Nearest Neighbor and Random Forests and among tree-based algorithms, and evaluate the strength of those known connections, showing the potential of this geometric approach to investigate black-box machine learning algorithms.

引用

页码：354 / 365

页数：12

共 50 条

[21] Addressing Bias in Machine Learning Algorithms: A Pilot Study on Emotion Recognition for Intelligent Systems
Howard, Ayanna
Zhang, Cha
Horvitz, Eric
2017 IEEE WORKSHOP ON ADVANCED ROBOTICS AND ITS SOCIAL IMPACTS, 2017,
[22] GRIL: A 2-parameter Persistence Based Vectorization for Machine Learning
Xin, Cheng
Mukherjee, Soham
Samaga, Shreyas N.
Dey, Tamal K.
TOPOLOGICAL, ALGEBRAIC AND GEOMETRIC LEARNING WORKSHOPS 2023, VOL 221, 2023, 221
[23] Instrument Bias Correction With Machine Learning Algorithms: Application to Field-Portable Mass Spectrometry
Loose, B.
Short, R. T.
Toler, S.
FRONTIERS IN EARTH SCIENCE, 2020, 8
[24] Research on runoff process vectorization and integration of deep learning algorithms for flood forecasting
Liu, Chengshuai
Li, Wenzhong
Hu, Caihong
Xie, Tianning
Jiang, Yunqiu
Li, Runxi
Soomro, Shan-e-hyder
Xu, Yuanhao
JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2024, 362
[25] Assessing socioeconomic bias in machine learning algorithms in health care: a case study of the HOUSES index
Juhn, Young J.
Ryu, Euijung
Wi, Chung-Il
King, Katherine S.
Malik, Momin
Romero-Brufau, Santiago
Weng, Chunhua
Sohn, Sunghwan
Sharp, Richard R.
Halamka, John D.
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2022, 29 (07) : 1142 - 1151
[26] Combinatorial algorithms in machine learning
Shaw, Peter
2018 FIRST IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE FOR INDUSTRIES (AI4I 2018), 2018, : 127 - 128
[27] Fair Algorithms for Machine Learning
Kearns, Michael
EC'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON ECONOMICS AND COMPUTATION, 2017, : 1 - 1
[28] Machine Learning Algorithms in Astronomy
Howard, E. M.
ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XXV, 2017, 512 : 245 - 248
[29] Validation of machine learning algorithms
Burzykowski, Tomasz
Geubbelmans, Melvin
Rousseau, Axel-Jan
Valkenborg, Dirk
AMERICAN JOURNAL OF ORTHODONTICS AND DENTOFACIAL ORTHOPEDICS, 2023, 164 (02) : 295 - 297
[30] ALGORITHMS, MACHINE LEARNING, AND COLLUSION
Schwalbe, Ulrich
JOURNAL OF COMPETITION LAW & ECONOMICS, 2018, 14 (04) : 568 - 607

← 1 2 3 4 5 →