The Nu Expression for Probabilistic Data Integration

被引:0
|
作者
Evgenia I. Polyakova
Andre G. Journel
机构
[1] Stanford University,Department of Geological and Environmental Sciences
来源
Mathematical Geology | 2007年 / 39卷
关键词
Data integration; Data interaction vs. dependence; Updating probabilities; Conditional independence;
D O I
暂无
中图分类号
学科分类号
摘要
The general problem of data integration is expressed as that of combining probability distributions conditioned to each individual datum or data event into a posterior probability for the unknown conditioned jointly to all data. Any such combination of information requires taking into account data interaction for the specific event being assessed. The nu expression provides an exact analytical representation of such a combination. This representation allows a clear and useful separation of the two components of any data integration algorithm: individual data information content and data interaction, the latter being different from data dependence. Any estimation workflow that fails to address data interaction is not only suboptimal, but may result in severe bias. The nu expression reduces the possibly very complex joint data interaction to a single multiplicative correction parameter ν0, difficult to evaluate but whose exact analytical expression is given; availability of such an expression provides avenues for its determination or approximation. The case ν0=1 is more comprehensive than data conditional independence; it delivers a preliminary robust approximation in presence of actual data interaction. An experiment where the exact results are known allows the results of the ν0=1 approximation to be checked against the traditional estimators based on assumption of data independence.
引用
收藏
页码:715 / 733
页数:18
相关论文
共 50 条
  • [31] Probabilistic estimation of microarray data reliability and underlying gene expression
    Sven Bilke
    Thomas Breslin
    Mikael Sigvardsson
    BMC Bioinformatics, 4
  • [32] Integration of probabilistic exposure assessment and probabilistic hazard characterization
    van der Voet, Hilko
    Slob, Wout
    van Klaveren, Jacob
    TOXICOLOGY LETTERS, 2007, 172 : S109 - S109
  • [33] Integration of probabilistic exposure assessment and probabilistic hazard characterization
    van der Voet, Hilko
    Slob, Wout
    RISK ANALYSIS, 2007, 27 (02) : 351 - 371
  • [34] Probabilistic Histograms for Probabilistic Data
    Cormode, Graham
    Deligiannakis, Antonios
    Garofalakis, Minos
    McGregor, Andrew
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2009, 2 (01):
  • [35] Vertical integration methods for gene expression data analysis
    Wu, Mengyun
    Yi, Huangdi
    Ma, Shuangge
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (03)
  • [36] Knowledge Integration by Probabilistic Argumentation
    Oo, Saung Hnin Pwint
    Nguyen Duy Hung
    Theeramunkong, Thanaruk
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (08): : 1843 - 1855
  • [37] User feedback in probabilistic integration
    de Keijzer, Ander
    van Keulen, Maurice
    DEXA 2007: 18TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, : 377 - +
  • [38] Integration of Multi-Omics Data Using Probabilistic Graph Models and External Knowledge
    Tripp, Bridget A.
    Otu, Hasan H.
    CURRENT BIOINFORMATICS, 2022, 17 (01) : 37 - 47
  • [39] PINCAGE: probabilistic integration of cancer genomics data for perturbed gene identification and sample classification
    Switnicki, Michal P.
    Juul, Malene
    Madsen, Tobias
    Sorensen, Karina D.
    Pedersen, Jakob S.
    BIOINFORMATICS, 2016, 32 (09) : 1353 - 1365
  • [40] Integration of Meta-Multi-Omics Data Using Probabilistic Graphs and External Knowledge
    Can, Handan
    Chanumolu, Sree K.
    Nielsen, Barbara D.
    Alvarez, Sophie
    Naldrett, Michael J.
    Unlu, Guelhan
    Otu, Hasan H.
    CELLS, 2023, 12 (15)