Double machine learning with gradient boosting and its application to the Big N audit quality effect

被引:135
作者
Yang, Jui-Chung
Chuang, Hui-Ching
Kuan, Chung-Ming [3 ]
机构
[1] Natl Tsing Hua Univ, Dept Econ, Hsinchu 30013, Taiwan
[2] Yuan Ze Univ, Coll Management, Taoyuan 32003, Taiwan
[3] Natl Taiwan Univ, Dept Finance, Taipei 10617, Taiwan
关键词
Audit quality; Average treatment effect; Big N effect; Double machine learning; Gradient boosting; Performance-matched discretionary accruals; CAUSAL INFERENCE; LEAST-SQUARES; REGRESSION; SELECTION;
D O I
10.1016/j.jeconom.2020.01.018
中图分类号
F [经济];
学科分类号
02 ;
摘要
In this paper, we study the double machine learning (DML) approach of Chernozhukov et al. (2018) for estimating average treatment effect and apply this approach to examine the Big N audit quality effect in the accounting literature. This approach relies on machine learning methods and is suitable when a high dimensional nuisance function with many covariates is present in the model. This approach does not suffer from the "regularization bias'' when a learning method with a proper convergence rate is used. We demonstrate by simulations that, for the DML approach, the gradient boosting method is fairly robust and to be preferred to other methods, such as regression tree, random forest, support vector regression machine, and the conventional Nadaraya-Watson nonparametric estimator. We then apply the DML approach with gradient boosting to estimate the Big N effect. We find that Big N auditors have a positive effect on audit quality and that this effect is not only statistically significant but also economically important. We further show that, in contrast to the results of propensity score matching, our estimates of said effect are quite robust to the hyper-parameters in the gradient boosting algorithm. (C) 2020 Published by Elsevier B.V.
引用
收藏
页码:268 / 283
页数:16
相关论文
共 38 条
[1]  
Angrist JD, 2009, MOSTLY HARMLESS ECONOMETRICS: AN EMPIRICISTS COMPANION, P1
[2]  
[Anonymous], 1988, Parallel distributed processing1
[3]  
[Anonymous], 2016, HIGH DIMENSTIONAL L2
[4]   Approximate residual balancing: debiased inference of average treatment effects in high dimensions [J].
Athey, Susan ;
Imbens, Guido W. ;
Wager, Stefan .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2018, 80 (04) :597-623
[5]   PROGRAM EVALUATION AND CAUSAL INFERENCE WITH HIGH-DIMENSIONAL DATA [J].
Belloni, A. ;
Chernozhukov, V. ;
Fernandez-Val, I. ;
Hansen, C. .
ECONOMETRICA, 2017, 85 (01) :233-298
[6]  
Belloni A, 2013, ECON SOC MONOGR, P245
[7]   Inference on Treatment Effects after Selection among High-Dimensional ControlsaEuro [J].
Belloni, Alexandre ;
Chernozhukov, Victor ;
Hansen, Christian .
REVIEW OF ECONOMIC STUDIES, 2014, 81 (02) :608-650
[8]   Least squares after model selection in high-dimensional sparse models [J].
Belloni, Alexandre ;
Chernozhukov, Victor .
BERNOULLI, 2013, 19 (02) :521-547
[9]  
Biau G, 2012, J MACH LEARN RES, V13, P1063
[10]   INTERVENTION ANALYSIS WITH APPLICATIONS TO ECONOMIC AND ENVIRONMENTAL PROBLEMS [J].
BOX, GEP ;
TIAO, GC .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1975, 70 (349) :70-79