Mining combined causes in large data sets

被引:13
|
作者
Ma, Saisai [1 ]
Li, Jiuyong [1 ]
Liu, Lin [1 ]
Thuc Duy Le [1 ]
机构
[1] Univ S Australia, Sch Informat Technol & Math Sci, Mawson Lakes, SA 5095, Australia
基金
澳大利亚研究理事会;
关键词
Causal discovery; Combined causes; Local causal discovery; HITON-PC; Multi-level HITON-PC; LEARNING BAYESIAN NETWORKS; ASSOCIATION; DISCOVERY; CAUSATION; MODELS;
D O I
10.1016/j.knosys.2015.10.018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, many methods have been developed for detecting causal relationships in observational data. Some of them have the potential to tackle large data sets. However, these methods fail to discover a combined cause, i.e. a multi-factor cause consisting of two or more component variables which individually are not causes. A straightforward approach to uncovering a combined cause is to include both individual and combined variables in the causal discovery using existing methods, but this scheme is computationally infeasible due to the huge number of combined variables. In this paper, we propose a novel approach to address this practical causal discovery problem, i.e. mining combined causes in large data sets. The experiments with both synthetic and real world data sets show that the proposed method can obtain high-quality causal discoveries with a high computational efficiency. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:104 / 111
页数:8
相关论文
共 50 条
  • [41] An experiment with fuzzy sets in data mining
    Olson, David L.
    Moshkovich, Helen
    Mechitov, Alexander
    COMPUTATIONAL SCIENCE - ICCS 2007, PT 2, PROCEEDINGS, 2007, 4488 : 462 - +
  • [42] Mining Overall Sentiment in Large Sets of Opinions
    Navrat, Pavol
    Ezzeddine, Anna Bou
    Slizik, Lukas
    ADVANCES IN INTELLIGENT WEB MASTERING-2, PROCEEDINGS, 2010, 67 : 167 - 173
  • [43] An Efficient Algorithm for Mining Large Item Sets
    Zheng, Hong-Zhen
    Chu, Dian-Hui
    Zhan, De-Chen
    Xu, Xiao-Fei
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2008, : 561 - 564
  • [44] An efficient algorithm for mining large item sets
    Zheng, Hong-Zhen
    Chu, Dian-Hui
    Zhan, De-Chen
    3RD INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS, AND APPLICAT/4TH INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL 2, 2006, : 151 - +
  • [45] Massive data sets, data mining, and decision support
    Dalal, S
    Dumais, S
    Kettenring, J
    Kurien, V
    McIntosh, A
    Maitra, R
    MINING AND MODELING MASSIVE DATA SETS IN SCIENCE, ENGINEERING, AND BUSINESS WITH A SUBTHEME IN ENVIRONMENTAL STATISTICS, 1997, 29 (01): : 329 - 329
  • [46] A Depository for Large Data Sets
    Burns, J. A.
    Physics of the Earth and Planetary Interiors, 1994, 864
  • [47] VISUALIZING LARGE DATA SETS
    HIBBARD, WL
    SANTEK, DA
    INTERACTIVE INFORMATION AND PROCESSING SYSTEMS FOR METEOROLOGY, OCEANOGRAPHY AND HYDROLOGY, 1988, : 172 - 174
  • [48] DEALING WITH LARGE DATA SETS
    GRAEFE, JF
    WOOD, RW
    NEUROTOXICOLOGY AND TERATOLOGY, 1990, 12 (05) : 449 - 454
  • [49] A Depository for Large Data Sets
    Burns, J. A.
    Icarus International Journal of Solar System Studies, 1995, 113 (01):
  • [50] THE CHALLENGE OF LARGE DATA SETS
    ENNIS, M
    SOUTH AFRICAN STATISTICAL JOURNAL, 1987, 21 (02) : 182 - 182