Vectorization of Bias in Machine Learning Algorithms

被引:1
|
作者
Bekerman, Sophie [1 ]
Chen, Eric [1 ]
Lin, Lily [2 ]
Nez, George D. Monta [1 ]
机构
[1] Harvey Mudd Coll, Dept Comp Sci, AMISTAD Lab, Claremont, CA 91711 USA
[2] Biola Univ, Dept Math & Comp Sci, La Mirada, CA 90639 USA
来源
ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2 | 2022年
基金
美国国家科学基金会;
关键词
Inductive Bias; Algorithmic Bias; Vectorization; Algorithmic Search Framework;
D O I
10.5220/0010845000003116
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We develop a method to measure and compare the inductive bias of classifications algorithms by vectorizing aspects of their behavior. We compute a vectorized representation of the algorithm's bias, known as the inductive orientation vector, for a set of algorithms. This vector captures the algorithm's probability distribution over all possible hypotheses for a classification task. We cluster and plot the algorithms' inductive orientation vectors to visually characterize their relationships. As algorithm behavior is influenced by the training dataset, we construct a Benchmark Data Suite (BDS) matrix that considers algorithms' pairwise distances across many datasets, allowing for more robust comparisons. We identify many relationships supported by existing literature, such as those between k-Nearest Neighbor and Random Forests and among tree-based algorithms, and evaluate the strength of those known connections, showing the potential of this geometric approach to investigate black-box machine learning algorithms.
引用
收藏
页码:354 / 365
页数:12
相关论文
共 50 条
  • [21] Addressing Bias in Machine Learning Algorithms: A Pilot Study on Emotion Recognition for Intelligent Systems
    Howard, Ayanna
    Zhang, Cha
    Horvitz, Eric
    2017 IEEE WORKSHOP ON ADVANCED ROBOTICS AND ITS SOCIAL IMPACTS, 2017,
  • [22] GRIL: A 2-parameter Persistence Based Vectorization for Machine Learning
    Xin, Cheng
    Mukherjee, Soham
    Samaga, Shreyas N.
    Dey, Tamal K.
    TOPOLOGICAL, ALGEBRAIC AND GEOMETRIC LEARNING WORKSHOPS 2023, VOL 221, 2023, 221
  • [23] Instrument Bias Correction With Machine Learning Algorithms: Application to Field-Portable Mass Spectrometry
    Loose, B.
    Short, R. T.
    Toler, S.
    FRONTIERS IN EARTH SCIENCE, 2020, 8
  • [24] Research on runoff process vectorization and integration of deep learning algorithms for flood forecasting
    Liu, Chengshuai
    Li, Wenzhong
    Hu, Caihong
    Xie, Tianning
    Jiang, Yunqiu
    Li, Runxi
    Soomro, Shan-e-hyder
    Xu, Yuanhao
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2024, 362
  • [25] Assessing socioeconomic bias in machine learning algorithms in health care: a case study of the HOUSES index
    Juhn, Young J.
    Ryu, Euijung
    Wi, Chung-Il
    King, Katherine S.
    Malik, Momin
    Romero-Brufau, Santiago
    Weng, Chunhua
    Sohn, Sunghwan
    Sharp, Richard R.
    Halamka, John D.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2022, 29 (07) : 1142 - 1151
  • [26] Combinatorial algorithms in machine learning
    Shaw, Peter
    2018 FIRST IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE FOR INDUSTRIES (AI4I 2018), 2018, : 127 - 128
  • [27] Fair Algorithms for Machine Learning
    Kearns, Michael
    EC'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON ECONOMICS AND COMPUTATION, 2017, : 1 - 1
  • [28] Machine Learning Algorithms in Astronomy
    Howard, E. M.
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XXV, 2017, 512 : 245 - 248
  • [29] Validation of machine learning algorithms
    Burzykowski, Tomasz
    Geubbelmans, Melvin
    Rousseau, Axel-Jan
    Valkenborg, Dirk
    AMERICAN JOURNAL OF ORTHODONTICS AND DENTOFACIAL ORTHOPEDICS, 2023, 164 (02) : 295 - 297
  • [30] ALGORITHMS, MACHINE LEARNING, AND COLLUSION
    Schwalbe, Ulrich
    JOURNAL OF COMPETITION LAW & ECONOMICS, 2018, 14 (04) : 568 - 607