Toward a Causal Interpretation from Observational Data: A New Bayesian Networks Method for Structural Models with Latent Variables

被引：33

作者：

Zheng, Zhiqiang ^{[1
]}

Pavlou, Paul A. ^{[2
]}

机构：

[1] Univ Texas Dallas, Sch Management, Richardson, TX 75083 USA

[2] Temple Univ, Fox Sch Business & Management, Philadelphia, PA 19122 USA

来源：

INFORMATION SYSTEMS RESEARCH | 2010年 / 21卷 / 02期

关键词：

causality; Bayesian networks; structural equation modeling; observational data; Bayesian graphs; PROBABILISTIC NETWORKS; STATISTICS;

D O I：

10.1287/isre.1080.0224

中图分类号：

G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];

学科分类号：

1205 ; 120501 ;

摘要：

Because a fundamental attribute of a good theory is causality, the information systems (IS) literature has strived to infer causality from empirical data, typically seeking causal interpretations from longitudinal, experimental, and panel data that include time precedence. However, such data are not always obtainable and observational (cross-sectional, nonexperimental) data are often the only data available. To infer causality from observational data that are common in empirical IS research, this study develops a new data analysis method that integrates the Bayesian networks (BN) and structural equation modeling (SEM) literatures. Similar to SEM techniques (e. g., LISREL and PLS), the proposed Bayesian networks for latent variables (BN-LV) method tests both the measurement model and the structural model. The method operates in two stages: First, it inductively identifies the most likely LVs from measurement items without prespecifying a measurement model. Second, it compares all the possible structural models among the identified LVs in an exploratory (automated) fashion and it discovers the most likely causal structure. By exploring the causal structural model that is not restricted to linear relationships, BN-LV contributes to the empirical IS literature by overcoming three SEM limitations (Lee, B., A. Barua, A. B. Whinston. 1997. Discovery and representation of causal relationships in MIS research: A methodological framework. MIS Quart. 21(1) 109-136)-lack of causality inference, restrictive model structure, and lack of nonlinearities. Moreover, BN-LV extends the BN literature by (1) overcoming the problem of latent variable identification using observed (raw) measurement items as the only inputs, and (2) enabling the use of ordinal and discrete (Likert-type) data, which are commonly used in empirical IS studies. The BN-LV method is first illustrated and tested with actual empirical data to demonstrate how it can help reconcile competing hypotheses in terms of the direction of causality in a structural model. Second, we conduct a comprehensive simulation study to demonstrate the effectiveness of BN-LV compared to existing techniques in the SEM and BN literatures. The advantages of BN-LV in terms of measurement model construction and structural model discovery are discussed.

引用

页码：365 / 391

页数：27

共 50 条

[31] Toward Unique and Unbiased Causal Effect Estimation From Data With Hidden Variables
Cheng, Debo
Li, Jiuyong
Liu, Lin
Yu, Kui
Thuc Duy Le
Liu, Jixue
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (09) : 6108 - 6120
[32] A new method of regression on latent variables. Application to spectral data
Vigneau, E
Qannari, EM
Berttand, D
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2002, 63 (01) : 7 - 14
[33] R-squared change in structural equation models with latent variables and missing data
Hayes, Timothy
BEHAVIOR RESEARCH METHODS, 2021, 53 (05) : 2127 - 2157
[34] R-squared change in structural equation models with latent variables and missing data
Timothy Hayes
Behavior Research Methods, 2021, 53 : 2127 - 2157
[35] Evaluating the Efficacy of Latent Variables in Mitigating Data Poisoning Attacks in the Context of Bayesian Networks: An Empirical Study
Alzahrani, Shahad
Alsuwat, Hatim
Alsuwat, Emad
CMES - Computer Modeling in Engineering and Sciences, 2024, 139 (02): : 1635 - 1654
[36] Evaluating the Efficacy of Latent Variables in Mitigating Data Poisoning Attacks in the Context of Bayesian Networks: An Empirical Study
Alzahrani, Shahad
Alsuwat, Hatim
Alsuwat, Emad
CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 139 (02): : 1635 - 1654
[37] A BAYESIAN METHOD FOR THE INDUCTION OF PROBABILISTIC NETWORKS FROM DATA
COOPER, GF
HERSKOVITS, E
MACHINE LEARNING, 1992, 9 (04) : 309 - 347
[38] Scaling of Union of Intersections for Inference of Granger Causal Networks from Observational Data
Balasubramanian, Mahesh
Ruiz, Trevor D.
Cook, Brandon
Prabhat
Bhattacharyya, Sharmodeep
Shrivastava, Aviral
Bouchard, Kristofer E.
2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM IPDPS 2020, 2020, : 264 - 273
[39] Learning causal networks from data: a survey and a new algorithm for recovering possibilistic causal networks
Sanguesa, R
Cortes, U
AI COMMUNICATIONS, 1997, 10 (01) : 31 - 61
[40] A novel fault diagnosis method for Bayesian networks fusing models and data
Wang, Jinhua
Ma, Xuehua
Jie, Cao
Liu, Yunqiang
Li, Chen
NUCLEAR ENGINEERING AND DESIGN, 2024, 426

← 1 2 3 4 5 →