Universal Causal Evaluation Engine: An API for empirically evaluating causal inference models

被引:0
|
作者
Lin, Alexander [1 ]
Merchant, Amil [1 ]
Sarkar, Suproteem K. [1 ]
D'Amour, Alexander [2 ]
机构
[1] Harvard Univ, Cambridge, MA 02138 USA
[2] Google Res, Menlo Pk, CA USA
关键词
Causal Inference; Software Engineering; Machine Learning; MATCHING METHODS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A major driver in the success of predictive machine learning has been the "common task framework," where community-wide benchmarks are shared for evaluating new algorithms. This pattern, however, is difficult to implement for causal learning tasks because the ground truth in these tasks is in general unobservable. Instead, causal inference methods are often evaluated on synthetic or semi-synthetic datasets that incorporate idiosyncratic assumptions about the underlying data-generating process. These evaluations are often proposed in conjunction with new causal inference methods-as a result, many methods are evaluated on incomparable benchmarks. To address this issue, we establish an API for generalized causal inference model assessment, with the goal of developing a platform that lets researchers deploy and evaluate new model classes in instances where treatments are explicitly known. The API uses a common interface for each of its components, and it allows for new methods and datasets to be evaluated and saved for future benchmarking.
引用
收藏
页码:50 / 58
页数:9
相关论文
共 50 条