Improving Causal Bayesian Networks Using Expertise in Authoritative Medical Ontologies

被引:1
|
作者
Hu, Hengyi [1 ]
Kerschberg, Larry [1 ]
机构
[1] George Mason Univ, Fairfax, VA 22030 USA
来源
关键词
Patient data; data mining; data management; Bayesian networks; causal inference; causal networks; causality; healthcare data; healthcare information technology; ontology; ontology evolution; GENERALIZED ANXIETY DISORDER; PANIC DISORDER; DYSTHYMIC DISORDER; DEPRESSION; IRRITABILITY; INSOMNIA; SEQUENCE; PREVALENCE; EVOLUTION; DISCOVERY;
D O I
10.1145/3604561
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Discovering causal relationships among symptoms is a topical issue in the analysis of observational patient datasets. A Causal Bayesian Network (CBN) is a popular analytical framework for causal inference. While there are many methods and algorithms capable of learning a Bayesian network, they are reliant on the complexity and thoroughness of the algorithm and do not consider prior expertise from authoritative sources. This article proposes a novel method of extracting prior causal knowledge contained in Authoritative Medical Ontologies (AMOs) and using this prior knowledge to orient arcs in a CBN learned from observational patient data. Since AMOs are robust biomedical ontologies containing the collective knowledge of the experts who created them, utilizing the ordering information contained within them produces improved CBNs that provide additional insight into the disease domain. To demonstrate our method, we obtained prior causal ordering information among symptoms from three AMOs: (1) the Medical Dictionary for Regulatory Activities Terminology (MedDRA), (2) the International Classification of Diseases Version 10 Clinical Modification (ICD-10-CM), and (3) Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT). The prior ontological knowledge from these three AMOs is then used to orient arcs in a series of CBNs learned from the National Institutes of Mental Health study on Sequenced Treatment Alternatives to Relieve Depression (STAR*D) patient dataset using the Max-Min Hill-Climbing (MMHC) algorithm. Six distinct CBNs are generated using MMHC: an unmodified baseline model using only the algorithm, three CBNs oriented with ordered-variable pairs from MedDRA, ICD-10-CM, and SNOMED CT, and two more with ordered pairs from a combination of these AMOs. The resulting CBNs modified using ordered-variable pairs significantly change the structure of the network. The agreement between the Modified networks and the Baseline ranges from 50% to 90%. A modified network using ordering information from all ontologies obtained an agreement of 50% (10 out of 20 arcs exist in both the Baseline and Modified models) while maintaining comparable predictive accuracy. This indicates that the Modified CBN reflects the causal claims in the AMOs and agrees with both the AMOs and the observational STAR*D dataset. Furthermore, the Modified models discovered new potentially causal relationships among symptoms in the model, while eliminating weaker edges in a qualitative analysis of the significance of these relationships in existing epidemiological research.
引用
收藏
页数:32
相关论文
共 50 条
  • [31] Imprecise Bayesian Networks as Causal Models
    Kinney, David
    INFORMATION, 2018, 9 (09)
  • [32] Compatible priors for causal Bayesian networks
    Leucari, V
    Consonni, G
    BAYESIAN STATISTICS 7, 2003, : 597 - 606
  • [33] Autonomic decision making based on bayesian networks and ontologies
    Aguilar, Jose
    Torres, Jose
    Aguilar, Kristell
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3825 - 3832
  • [34] The confirmation of scientific theories using Bayesian causal networks and citation sentiments
    Small, Henry
    QUANTITATIVE SCIENCE STUDIES, 2022, 3 (02): : 393 - 419
  • [35] Counterfactual reasoning using causal Bayesian networks as a healthcare governance tool
    Kyrimi, Evangelia
    Mossadegh, Somayyeh
    Wohlgemut, Jared M.
    Stoner, Rebecca S.
    Tai, Nigel R. M.
    Marsh, William
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2025, 193
  • [36] Learning Causal Bayesian Networks Using Minimum Free Energy Principle
    Takashi Isozaki
    New Generation Computing, 2012, 30 : 17 - 52
  • [37] Applying DPPI: A Defect Causal Analysis Approach Using Bayesian Networks
    Kalinowski, Marcos
    Mendes, Emilia
    Card, David N.
    Travassos, Guilherme H.
    PRODUCT-FOCUSED SOFTWARE PROCESS IMPROVEMENT, 2010, 6156 : 92 - +
  • [38] Introducing Causal Inference Using Bayesian Networks and do-Calculus
    Lu, Yonggang
    Zheng, Qiujie
    Quinn, Daniel
    JOURNAL OF STATISTICS AND DATA SCIENCE EDUCATION, 2023, 31 (01): : 3 - 17
  • [39] Failure Propagation Modeling for Safety Analysis using Causal Bayesian Networks
    Nyberg, Mattias
    2013 2ND INTERNATIONAL CONFERENCE ON CONTROL AND FAULT-TOLERANT SYSTEMS (SYSTOL), 2013, : 91 - 97
  • [40] Learning Causal Bayesian Networks Using Minimum Free Energy Principle
    Isozaki, Takashi
    NEW GENERATION COMPUTING, 2012, 30 (01) : 17 - 52