Studies in empirical software engineering are often most useful if they make causal claims because this allows practitioners to identify how they can purposefully influence (rather than only predict) outcomes of interest. Unfortunately, many non-experimental studies suffer from potential endogeneity, for example, through omitted confounding variables, which precludes claims of causality. In this conceptual tutorial, we aim to transfer the proven solution of instrumental variables and two-stage models as a means to account for endogeneity from econometrics to the field of empirical software engineering. To this end, we discuss causality and causal inference, provide a definition of endogeneity, explain its causes, and lay out the conceptual idea behind instrumental variable approaches and two-stage models. We also provide an extensive illustration with simulated data and a brief illustration with real data to demonstrate the approach, offering Stata and R code to allow researchers to replicate our analyses and apply the techniques to their own research projects. We close with concrete recommendations and a guide for researchers on how to deal with endogeneity.
机构:
Estonian Business Sch, Format Fi, Tallinn, Estonia
City Univ Hong Kong, Kowloon, 83 Tat Chee Ave, Hong Kong, Peoples R ChinaEstonian Business Sch, Format Fi, Tallinn, Estonia
机构:
Cent Bank Hungary, Magyar Nemzet Bank, Budapest, Hungary
John von Neumann Univ, MNB Inst, Kecskemet, HungaryCent Bank Hungary, Magyar Nemzet Bank, Budapest, Hungary
Chen, Chaoyi
Stengos, Thanasis
论文数: 0引用数: 0
h-index: 0
机构:
Univ Guelph, Dept Econ & Finance, Guelph, ON, Canada
Univ Guelph, Dept Econ & Finance, Guelph, ON N1G 2W1, CanadaCent Bank Hungary, Magyar Nemzet Bank, Budapest, Hungary
Stengos, Thanasis
Sun, Yiguo
论文数: 0引用数: 0
h-index: 0
机构:
Univ Guelph, Dept Econ & Finance, Guelph, ON, CanadaCent Bank Hungary, Magyar Nemzet Bank, Budapest, Hungary