Vectorization of Bias in Machine Learning Algorithms

被引:1
|
作者
Bekerman, Sophie [1 ]
Chen, Eric [1 ]
Lin, Lily [2 ]
Nez, George D. Monta [1 ]
机构
[1] Harvey Mudd Coll, Dept Comp Sci, AMISTAD Lab, Claremont, CA 91711 USA
[2] Biola Univ, Dept Math & Comp Sci, La Mirada, CA 90639 USA
基金
美国国家科学基金会;
关键词
Inductive Bias; Algorithmic Bias; Vectorization; Algorithmic Search Framework;
D O I
10.5220/0010845000003116
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We develop a method to measure and compare the inductive bias of classifications algorithms by vectorizing aspects of their behavior. We compute a vectorized representation of the algorithm's bias, known as the inductive orientation vector, for a set of algorithms. This vector captures the algorithm's probability distribution over all possible hypotheses for a classification task. We cluster and plot the algorithms' inductive orientation vectors to visually characterize their relationships. As algorithm behavior is influenced by the training dataset, we construct a Benchmark Data Suite (BDS) matrix that considers algorithms' pairwise distances across many datasets, allowing for more robust comparisons. We identify many relationships supported by existing literature, such as those between k-Nearest Neighbor and Random Forests and among tree-based algorithms, and evaluate the strength of those known connections, showing the potential of this geometric approach to investigate black-box machine learning algorithms.
引用
收藏
页码:354 / 365
页数:12
相关论文
共 50 条
  • [31] Algorithms for Interpretable Machine Learning
    Rudin, Cynthia
    PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 1519 - 1519
  • [32] Machine learning algorithms in sepsis
    Agnello, Luisa
    Vidali, Matteo
    Padoan, Andrea
    Lucis, Riccardo
    Mancini, Alessio
    Guerranti, Roberto
    Plebani, Mario
    Ciaccio, Marcello
    Carobene, Anna
    CLINICA CHIMICA ACTA, 2024, 553
  • [33] Genetic algorithms in machine learning
    Giordana, A
    Neri, F
    AI COMMUNICATIONS, 1996, 9 (01) : 21 - 26
  • [34] Machine-learning media bias
    D'Alonzo, Samantha
    Tegmark, Max
    PLOS ONE, 2022, 17 (08):
  • [35] Ethical Implications Of Bias In Machine Learning
    Yapo, Adrienne
    Weiss, Joseph
    PROCEEDINGS OF THE 51ST ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS), 2018, : 5365 - 5372
  • [36] Mitigating Racial Bias in Machine Learning
    Kostick-Quenet, Kristin M.
    Cohen, I. Glenn
    Gerke, Sara
    Lo, Bernard
    Antaki, James
    Movahedi, Faezah
    Njah, Hasna
    Schoen, Lauren
    Estep, Jerry E.
    Blumenthal-Barby, J. S.
    JOURNAL OF LAW MEDICINE & ETHICS, 2022, 50 (01): : 92 - 100
  • [37] A Survey on Bias and Fairness in Machine Learning
    Mehrabi, Ninareh
    Morstatter, Fred
    Saxena, Nripsuta
    Lerman, Kristina
    Galstyan, Aram
    ACM COMPUTING SURVEYS, 2021, 54 (06)
  • [38] Simplicity Bias in Overparameterized Machine Learning
    Berchenko, Yakir
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 10, 2024, : 11052 - 11060
  • [39] Bias in Machine Learning: A Literature Review
    Mavrogiorgos, Konstantinos
    Kiourtis, Athanasios
    Mavrogiorgou, Argyro
    Menychtas, Andreas
    Kyriazis, Dimosthenis
    APPLIED SCIENCES-BASEL, 2024, 14 (19):
  • [40] Managing Bias in Machine Learning Projects
    Fahse, Tobias
    Huber, Viktoria
    van Giffen, Benjamin
    INNOVATION THROUGH INFORMATION SYSTEMS, VOL II: A COLLECTION OF LATEST RESEARCH ON TECHNOLOGY ISSUES, 2021, 47 : 94 - 109