Ranking Mutual Information Dependencies in a Summary-based Approximate Analytics Framework

被引:2
|
作者
Slezak, Dominik [1 ]
Borkowski, Janusz [2 ]
Chadzynska-Krasowska, Agnieszka [3 ]
机构
[1] Univ Warsaw, Inst Informat, Ul Banacha 2, PL-02097 Warsaw, Poland
[2] Secur On Demand, 12121 Scripps Summit Dr 320, San Diego, CA 92131 USA
[3] Polish Japanese Acad Informat Technol, Ul Koszykowa 86, PL-02008 Warsaw, Poland
关键词
Approximate Data Processing; Granulated Data Summaries; Approximate Mutual Information; ENGINE;
D O I
10.1109/HPCS.2018.00137
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We continue our research on utilizing histogram-based data summaries in approximate derivation of mutual information scores in large relational data sets. Our methodology of creating, storing and using summaries has been designed for the purpose of developing an approximate database engine that is currently deployed commercially in the area of cybersecurity data analytics. However, a similar idea of approximate data processing operations can be considered also in other fields, including machine learning whereby heuristic calculations are a component of many methods. In this paper, we focus on investigation of one possible source of inaccuracy of our previously proposed approach to approximating mutual information - that is, neglecting a kind of column domain drift during distributed summary-based computations. We illustrate it using an artificially created benchmark data set and we discuss how to cope this particular challenge in the future.
引用
收藏
页码:852 / 859
页数:8
相关论文
共 50 条
  • [31] A knowledge driven mutual information-based analytical framework for the identification of rumen metabolites
    Wang, Mengyuan
    Zheng, Huiru
    Wang, Haiying
    Dewhurst, Richard J.
    Roehe, Rainer
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 255 - 260
  • [32] Extensive framework based on novel convolutional and variational autoencoder based on maximization of mutual information for anomaly detection
    Qien Yu
    Muthu Subash Kavitha
    Takio Kurita
    Neural Computing and Applications, 2021, 33 : 13785 - 13807
  • [33] Extensive framework based on novel convolutional and variational autoencoder based on maximization of mutual information for anomaly detection
    Yu, Qien
    Kavitha, Muthusubash
    Kurita, Takio
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (20): : 13785 - 13807
  • [34] Double hierarchy hesitant fuzzy linguistic information based framework for personalized ranking of sustainable suppliers
    Raghunathan Krishankumar
    Dragan Pamucar
    Alok Pandey
    Samarjit Kar
    Kattur Soundarapandian Ravichandran
    Environmental Science and Pollution Research, 2022, 29 : 65371 - 65390
  • [35] Double hierarchy hesitant fuzzy linguistic information based framework for personalized ranking of sustainable suppliers
    Krishankumar, Raghunathan
    Pamucar, Dragan
    Pandey, Alok
    Kar, Samarjit
    Ravichandran, Kattur Soundarapandian
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2022, 29 (43) : 65371 - 65390
  • [36] A Probabilistic Topic-Based Ranking Framework for Location-Sensitive Domain Information Retrieval
    Li, Huajing
    Li, Zhisheng
    Lee, Wang-Chien
    Lee, Dik Lun
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 331 - 338
  • [37] Multimodal fusion of EEG-fNIRS: a mutual information-based hybrid classification framework
    Deligani, Roohollah Jafari
    Borgheai, Seyyed Bahram
    McLinden, John
    Shahriari, Yalda
    BIOMEDICAL OPTICS EXPRESS, 2021, 12 (03) : 1635 - 1650
  • [38] A Statistical Framework for Neuroimaging Data Analysis Based on Mutual Information Estimated via a Gaussian Copula
    Ince, Robin A. A.
    Giordano, Bruno L.
    Kayser, Christoph
    Rousselet, Guillaume A.
    Gross, Joachim
    Schyns, Philippe G.
    HUMAN BRAIN MAPPING, 2017, 38 (03) : 1541 - 1573
  • [40] Voxel selection framework based on meta-heuristic search and mutual information for brain decoding
    Hourani, Osama
    Charkari, Nasrollah Moghadam
    Jalili, Saeed
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2019, 29 (04) : 663 - 676