Ranking Mutual Information Dependencies in a Summary-based Approximate Analytics Framework

被引:2
|
作者
Slezak, Dominik [1 ]
Borkowski, Janusz [2 ]
Chadzynska-Krasowska, Agnieszka [3 ]
机构
[1] Univ Warsaw, Inst Informat, Ul Banacha 2, PL-02097 Warsaw, Poland
[2] Secur On Demand, 12121 Scripps Summit Dr 320, San Diego, CA 92131 USA
[3] Polish Japanese Acad Informat Technol, Ul Koszykowa 86, PL-02008 Warsaw, Poland
关键词
Approximate Data Processing; Granulated Data Summaries; Approximate Mutual Information; ENGINE;
D O I
10.1109/HPCS.2018.00137
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We continue our research on utilizing histogram-based data summaries in approximate derivation of mutual information scores in large relational data sets. Our methodology of creating, storing and using summaries has been designed for the purpose of developing an approximate database engine that is currently deployed commercially in the area of cybersecurity data analytics. However, a similar idea of approximate data processing operations can be considered also in other fields, including machine learning whereby heuristic calculations are a component of many methods. In this paper, we focus on investigation of one possible source of inaccuracy of our previously proposed approach to approximating mutual information - that is, neglecting a kind of column domain drift during distributed summary-based computations. We illustrate it using an artificially created benchmark data set and we discuss how to cope this particular challenge in the future.
引用
收藏
页码:852 / 859
页数:8
相关论文
共 50 条
  • [21] A mutual information based federated learning framework for edge computing networks
    Chen, Naiyue
    Li, Yinglong
    Liu, Xuejun
    Zhang, Zhenjiang
    COMPUTER COMMUNICATIONS, 2021, 176 (176) : 23 - 30
  • [22] A framework for parameter optimization in mutual information (MI)-based registration algorithms
    Gopalakrishnan, Girish
    Kumar, S. V. Bharath
    Mullick, Rakesh
    Narayanan, Ajay
    Suryanarayanan, Srikanth
    MEDICAL IMAGING 2006: IMAGE PROCESSING, PTS 1-3, 2006, 6144
  • [23] Discretization and Feature Selection Based on Bias Corrected Mutual Information Considering High-Order Dependencies
    Roy, Puloma
    Sharmin, Sadia
    Ali, Amin Ahsan
    Shoyaib, Mohammad
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT I, 2020, 12084 : 830 - 842
  • [24] Key Node Ranking in Complex Networks: A Novel Entropy and Mutual Information-Based Approach
    Li, Yichuan
    Cai, Weihong
    Li, Yao
    Du, Xin
    ENTROPY, 2020, 22 (01) : 52
  • [25] Remaining Useful Life Prediction Using Ranking Mutual Information Based Monotonic Health Indicator
    Qian, Fang
    Niu, Gang
    2015 PROGNOSTICS AND SYSTEM HEALTH MANAGEMENT CONFERENCE (PHM), 2015,
  • [26] A Mutual Information based Framework for the Analysis of Multiple-Subject fMRI data
    Accamma, I. V.
    Suma, H. N.
    2014 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2014,
  • [27] Genetic-based approaches in ranking function discovery and optimization in information retrieval - A framework
    Fan, Weiguo
    Pathak, Praveen
    Zhou, Mi
    DECISION SUPPORT SYSTEMS, 2009, 47 (04) : 398 - 407
  • [28] Social Analytics Framework for Intelligent Information Systems Based on a Complex Adaptive Systems Approach
    Koohborfardhaghighi, Somayeh
    Altmann, Jorn
    Tserpes, Konstantinos
    2017 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2017), 2017, : 1010 - 1017
  • [29] A Framework for Mutual Information-Based MIMO Integrated Sensing and Communication Beamforming Design
    Li, Jin
    Zhou, Gui
    Gong, Tantao
    Liu, Nan
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (06) : 8352 - 8366
  • [30] Scalable Global Mutual Information Based Feature Selection Framework for Large Scale Datasets
    Soheili, Majid
    Haeri, Maryam Amir
    2021 IEEE 25TH INTERNATIONAL ENTERPRISE DISTRIBUTED OBJECT COMPUTING CONFERENCE (EDOC 2021), 2021, : 41 - 50