JupyterLab in Retrograde: Contextual Notifications That Highlight Fairness and Bias Issues for Data Scientists

被引:0
|
作者
Harrison, Galen [1 ,2 ]
Bryson, Kevin [2 ]
Bamba, Ahmad Emmanuel Balla [2 ]
Dovichi, Luca [2 ]
Binion, Aleksander Herrmann [2 ]
Borem, Arthur [2 ]
Ur, Blase [2 ]
机构
[1] Univ Virginia, Charlottesville, VA 22903 USA
[2] Univ Chicago, Chicago, IL USA
基金
美国国家科学基金会;
关键词
fairness; data science; computational notebooks; Jupyter Notebook;
D O I
10.1145/3613904.3642755
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current algorithmic fairness tools focus on auditing completed models, neglecting the potential downstream impacts of iterative decisions about cleaning data and training machine learning models. In response, we developed Retrograde, a JupyterLab environment extension for Python that generates real-time, contextual notifcations for data scientists about decisions they are making regarding protected classes, proxy variables, missing data, and demographic differences in model performance. Our novel framework uses automated code analysis to trace data provenance in JupyterLab, enabling these notifcations. In a between-subjects online experiment, 51 data scientists constructed loan-decision models with Retrograde providing notifcations continuously throughout the process, only at the end, or never. Retrograde's notifcations successfully nudged participants to account for missing data, avoid using protected classes as predictors, minimize demographic differences in model performance, and exhibit healthy skepticism about their models.
引用
收藏
页数:19
相关论文
empty
未找到相关数据