Vectorization of Bias in Machine Learning Algorithms

被引:1
|
作者
Bekerman, Sophie [1 ]
Chen, Eric [1 ]
Lin, Lily [2 ]
Nez, George D. Monta [1 ]
机构
[1] Harvey Mudd Coll, Dept Comp Sci, AMISTAD Lab, Claremont, CA 91711 USA
[2] Biola Univ, Dept Math & Comp Sci, La Mirada, CA 90639 USA
基金
美国国家科学基金会;
关键词
Inductive Bias; Algorithmic Bias; Vectorization; Algorithmic Search Framework;
D O I
10.5220/0010845000003116
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We develop a method to measure and compare the inductive bias of classifications algorithms by vectorizing aspects of their behavior. We compute a vectorized representation of the algorithm's bias, known as the inductive orientation vector, for a set of algorithms. This vector captures the algorithm's probability distribution over all possible hypotheses for a classification task. We cluster and plot the algorithms' inductive orientation vectors to visually characterize their relationships. As algorithm behavior is influenced by the training dataset, we construct a Benchmark Data Suite (BDS) matrix that considers algorithms' pairwise distances across many datasets, allowing for more robust comparisons. We identify many relationships supported by existing literature, such as those between k-Nearest Neighbor and Random Forests and among tree-based algorithms, and evaluate the strength of those known connections, showing the potential of this geometric approach to investigate black-box machine learning algorithms.
引用
收藏
页码:354 / 365
页数:12
相关论文
共 50 条
  • [1] Detecting racial bias in algorithms and machine learning
    Lee, Nicol Turner
    JOURNAL OF INFORMATION COMMUNICATION & ETHICS IN SOCIETY, 2018, 16 (03): : 252 - 260
  • [2] Investigating anatomical bias in clinical machine learning algorithms
    Pedersen, Jannik Skyttegaard
    Laursen, Martin Sundahl
    Vinholt, Pernille Just
    Alnor, Anne Bryde
    Savarimuthu, Thiusius Rajeeth
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1398 - 1410
  • [3] Exploring Bias and Fairness in Artificial Intelligence and Machine Learning Algorithms
    Khakurel, Utsab
    Abdelmoumin, Ghada
    Bajracharya, Aakriti
    Rawat, Danda B.
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS IV, 2022, 12113
  • [4] Bias, Fairness and Accountability with Artificial Intelligence and Machine Learning Algorithms
    Zhou, Nengfeng
    Zhang, Zach
    Nair, Vijayan N.
    Singhal, Harsh
    Chen, Jie
    INTERNATIONAL STATISTICAL REVIEW, 2022, 90 (03) : 468 - 480
  • [5] Reply to: ‘Potential sources of dataset bias complicate investigation of underdiagnosis by machine learning algorithms’ and ‘Confounding factors need to be accounted for in assessing bias by machine learning algorithms’
    Laleh Seyyed-Kalantari
    Haoran Zhang
    Matthew B. A. McDermott
    Irene Y. Chen
    Marzyeh Ghassemi
    Nature Medicine, 2022, 28 : 1161 - 1162
  • [6] Reply to: 'Potential sources of dataset bias complicate investigation of underdiagnosis by machine learning algorithms' and 'Confounding factors need to be accounted for in assessing bias by machine learning algorithms'
    Seyyed-Kalantari, Laleh
    Zhang, Haoran
    McDermott, Matthew B. A.
    Chen, Irene Y.
    Ghassemi, Marzyeh
    NATURE MEDICINE, 2022, 28 (06) : 1161 - +
  • [7] Using Machine Learning to Improve Automatic Vectorization
    Stock, Kevin
    Pouchet, Louis-Noel
    Sadayappan, P.
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2012, 8 (04)
  • [8] VECTORIZATION OF ALGORITHMS
    SOMMER, M
    SIEMENS FORSCHUNGS-UND ENTWICKLUNGSBERICHTE-SIEMENS RESEARCH AND DEVELOPMENT REPORTS, 1986, 15 (05): : 225 - 228
  • [9] Confounding factors need to be accounted for in assessing bias by machine learning algorithms
    Mukherjee, Pritam
    Shen, Thomas C.
    Liu, Jianfei
    Mathai, Tejas
    Shafaat, Omid
    Summers, Ronald M.
    NATURE MEDICINE, 2022, 28 (06) : 1159 - +
  • [10] Evaluation of Gender Bias in Facial Recognition with Traditional Machine Learning Algorithms
    Atay, Mustafa
    Gipson, Hailey
    Gwyn, Tony
    Roy, Kaushik
    2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,