Toward a Causal Interpretation from Observational Data: A New Bayesian Networks Method for Structural Models with Latent Variables

被引:33
|
作者
Zheng, Zhiqiang [1 ]
Pavlou, Paul A. [2 ]
机构
[1] Univ Texas Dallas, Sch Management, Richardson, TX 75083 USA
[2] Temple Univ, Fox Sch Business & Management, Philadelphia, PA 19122 USA
关键词
causality; Bayesian networks; structural equation modeling; observational data; Bayesian graphs; PROBABILISTIC NETWORKS; STATISTICS;
D O I
10.1287/isre.1080.0224
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Because a fundamental attribute of a good theory is causality, the information systems (IS) literature has strived to infer causality from empirical data, typically seeking causal interpretations from longitudinal, experimental, and panel data that include time precedence. However, such data are not always obtainable and observational (cross-sectional, nonexperimental) data are often the only data available. To infer causality from observational data that are common in empirical IS research, this study develops a new data analysis method that integrates the Bayesian networks (BN) and structural equation modeling (SEM) literatures. Similar to SEM techniques (e. g., LISREL and PLS), the proposed Bayesian networks for latent variables (BN-LV) method tests both the measurement model and the structural model. The method operates in two stages: First, it inductively identifies the most likely LVs from measurement items without prespecifying a measurement model. Second, it compares all the possible structural models among the identified LVs in an exploratory (automated) fashion and it discovers the most likely causal structure. By exploring the causal structural model that is not restricted to linear relationships, BN-LV contributes to the empirical IS literature by overcoming three SEM limitations (Lee, B., A. Barua, A. B. Whinston. 1997. Discovery and representation of causal relationships in MIS research: A methodological framework. MIS Quart. 21(1) 109-136)-lack of causality inference, restrictive model structure, and lack of nonlinearities. Moreover, BN-LV extends the BN literature by (1) overcoming the problem of latent variable identification using observed (raw) measurement items as the only inputs, and (2) enabling the use of ordinal and discrete (Likert-type) data, which are commonly used in empirical IS studies. The BN-LV method is first illustrated and tested with actual empirical data to demonstrate how it can help reconcile competing hypotheses in terms of the direction of causality in a structural model. Second, we conduct a comprehensive simulation study to demonstrate the effectiveness of BN-LV compared to existing techniques in the SEM and BN literatures. The advantages of BN-LV in terms of measurement model construction and structural model discovery are discussed.
引用
收藏
页码:365 / 391
页数:27
相关论文
共 50 条
  • [41] Comparison of strategies for scalable causal discovery of latent variable models from mixed data
    Raghu V.K.
    Ramsey J.D.
    Morris A.
    Manatakis D.V.
    Sprites P.
    Chrysanthis P.K.
    Glymour C.
    Benos P.V.
    International Journal of Data Science and Analytics, 2018, 6 (1) : 33 - 45
  • [42] Bayesian modelling of rainfall data by using non-homogeneous hidden Markov models and latent Gaussian variables
    Heaps, Sarah E.
    Boys, Richard J.
    Farrow, Malcolm
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2015, 64 (03) : 543 - 568
  • [43] Revising regulatory networks: from expression data to linear causal models
    Bay, SD
    Shrager, J
    Pohorille, A
    Langley, P
    JOURNAL OF BIOMEDICAL INFORMATICS, 2002, 35 (5-6) : 289 - 297
  • [44] Spatio-temporal Bayesian network models with latent variables for revealing trophic dynamics and functional networks in fisheries ecology
    Trifonova, Neda
    Kenny, Andrew
    Maxwell, David
    Duplisea, Daniel
    Fernandes, Jose
    Tucker, Allan
    ECOLOGICAL INFORMATICS, 2015, 30 : 142 - 158
  • [45] Causal inference in survival analysis using longitudinal observational data: Sequential trials and marginal structural models
    Keogh, Ruth H.
    Gran, Jon Michael
    Seaman, Shaun R.
    Davies, Gwyneth
    Vansteelandt, Stijn
    STATISTICS IN MEDICINE, 2023, 42 (13) : 2191 - 2225
  • [46] Identification of marginal causal relationships in gene networks from observational and interventional expression data
    Monneret, Gilles
    Jaffrezic, Florence
    Rau, Andrea
    Zerjal, Tatiana
    Nuel, Gregory
    PLOS ONE, 2017, 12 (03):
  • [47] METHOD OF PROBABILISTIC INFERENCE FROM LEARNING DATA IN BAYESIAN NETWORKS
    Terent'yev, A. N.
    Biduk, P. I.
    CYBERNETICS AND SYSTEMS ANALYSIS, 2007, 43 (03) : 391 - 396
  • [48] Learning Unknown Intervention Targets in Structural Causal Models from Heterogeneous Data
    Yang, Yuqin
    Salehkaleybar, Saber
    Kiyavash, Negar
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [49] Deriving causal Bayesian networks from human reliability analysis data: A methodology and example model
    Groth, Katrina M.
    Mosleh, Ali
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART O-JOURNAL OF RISK AND RELIABILITY, 2012, 226 (O4) : 361 - 379
  • [50] An Efficient Entropy-Based Causal Discovery Method for Linear Structural Equation Models With IID Noise Variables
    Xie, Feng
    Cai, Ruichu
    Zeng, Yan
    Gao, Jiantao
    Hao, Zhifeng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (05) : 1667 - 1680