Statistical Inference for Fairness Auditing

被引:0
|
作者
Cherian, John J. [1 ]
Candes, Emmanuel J. [2 ]
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Math & Stat, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
algorithmic fairness; bootstrap; simultaneous inference; multiple testing; reproducing kernel Hilbert space; FALSE DISCOVERY RATE; BOOTSTRAP;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Before deploying a black-box model in high-stakes problems, it is important to evaluate the model's performance on sensitive subpopulations. For example, in a recidivism prediction task, we may wish to identify demographic groups for which our prediction model has unacceptably high false positive rates or certify that no such groups exist. In this paper, we frame this task, often referred to as "fairness auditing," in terms of multiple hypothesis testing. We show how the bootstrap can be used to simultaneously bound performance disparities over a collection of groups with statistical guarantees. Our methods can be used to flag subpopulations affected by model underperformance, and certify subpopulations for which the model performs adequately. Crucially, our audit is model-agnostic and applicable to nearly any performance metric or group fairness criterion. Our methods also accommodate extremely rich-even infinite-collections of subpopulations. Further, we generalize beyond subpopulations by showing how to assess performance over certain distribution shifts. We test the proposed methods on benchmark datasets in predictive inference and algorithmic fairness and find that our audits can provide interpretable and
引用
收藏
页数:49
相关论文
共 50 条
  • [1] AUDITING AND INFERENCE CONTROL IN STATISTICAL DATABASES
    CHIN, FY
    OZSOYOGLU, G
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1982, 8 (06) : 574 - 582
  • [2] Statistical inference on measures of lineup fairness
    Tredoux, CG
    LAW AND HUMAN BEHAVIOR, 1998, 22 (02) : 217 - 237
  • [3] Active Fairness Auditing
    Yan, Tom
    Zhang, Chicheng
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [4] Preventing Fairness Gerrymandering: Auditing and Learning for Subgroup Fairness
    Kearns, Michael
    Neel, Seth
    Roth, Aaron
    Wu, Zhiwei Steven
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [5] Auditing Fairness under Unobserved Confounding
    Byun, Yewon
    Sam, Dylan
    Oberst, Michael
    Lipton, Zachary C.
    Wilder, Bryan
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [6] Causal Inference in Auditing: A Framework
    Srivastava, Rajendra P.
    Mock, Theodore J.
    Pincus, Karen V.
    Wright, Arnold M.
    AUDITING-A JOURNAL OF PRACTICE & THEORY, 2012, 31 (03): : 177 - 201
  • [7] INFERENCE AND AUDITING - THE STRINGER BOUND
    BICKEL, PJ
    INTERNATIONAL STATISTICAL REVIEW, 1992, 60 (02) : 197 - 209
  • [8] Inference auditing the XQuery of XML
    Department of Computer and Information Technology, Fudan University, Shanghai 200433, China
    Jisuanji Xuebao, 2006, 8 (1308-1317):
  • [9] AUDITING AND STATISTICAL SAMPLING
    TRUEBLOOD, RM
    JOURNAL OF ACCOUNTANCY, 1957, 103 (04): : 48 - 52
  • [10] STATISTICAL SAMPLING IN AUDITING
    SULLY, JM
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES D-THE STATISTICIAN, 1974, 23 (01) : 71 - 80