JupyterLab in Retrograde: Contextual Notifications That Highlight Fairness and Bias Issues for Data Scientists

被引：0

作者：

Harrison, Galen ^{[1
,2
]}

Bryson, Kevin ^{[2
]}

Bamba, Ahmad Emmanuel Balla ^{[2
]}

Dovichi, Luca ^{[2
]}

Binion, Aleksander Herrmann ^{[2
]}

Borem, Arthur ^{[2
]}

Ur, Blase ^{[2
]}

机构：

[1] Univ Virginia, Charlottesville, VA 22903 USA

[2] Univ Chicago, Chicago, IL USA

来源：

PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS (CHI 2024) | 2024年

基金：

美国国家科学基金会;

关键词：

fairness; data science; computational notebooks; Jupyter Notebook;

D O I：

10.1145/3613904.3642755

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Current algorithmic fairness tools focus on auditing completed models, neglecting the potential downstream impacts of iterative decisions about cleaning data and training machine learning models. In response, we developed Retrograde, a JupyterLab environment extension for Python that generates real-time, contextual notifcations for data scientists about decisions they are making regarding protected classes, proxy variables, missing data, and demographic differences in model performance. Our novel framework uses automated code analysis to trace data provenance in JupyterLab, enabling these notifcations. In a between-subjects online experiment, 51 data scientists constructed loan-decision models with Retrograde providing notifcations continuously throughout the process, only at the end, or never. Retrograde's notifcations successfully nudged participants to account for missing data, avoid using protected classes as predictors, minimize demographic differences in model performance, and exhibit healthy skepticism about their models.

引用

页数：19