Visual Bayesian Fusion to Navigate a Data Lake

被引:0
|
作者
Singh, Karamjit [1 ]
Paneri, Kaushal [1 ]
Pandey, Aditeya [1 ]
Gupta, Garima [1 ]
Sharma, Geetika [1 ]
Agarwal, Puneet [1 ]
Shroff, Gautam [1 ]
机构
[1] Tata Consultancy Serv Ltd, TCS Res, Gurgaon, India
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The evolution from traditional business intelligence to big data analytics has witnessed the emergence of 'Data Lakes' in which data is ingested in raw form rather than into traditional data warehouses. With the increasing availability of many more pieces of information about each entity of interest, e.g., a customer, often from diverse sources (socialmedia, mobility, internet-of-things), fusing, visualizing and deriving insights from such data pose a number of challenges: First, disparate datasets often lack a natural join key. Next, datasets may describe measures at different levels of granularity, e.g., individual vs. aggregate data, and finally, different datasets may be derived from physically distinct populations. Moreover, once data has been fused, queries are often an inefficient and inaccurate mechanism to derive insight from high-dimensional data. In this paper we describe iFuse, a data-fusion based visual analytics platform for navigating a data lake to derive insights. We rely on Bayesian graphical models to provide useful rudder with which to fuse and analyze disparate islands of data in a systematic manner. Our platform allows for rich interactive visualizations, querying and keyword-based search within and across datasets or models, as well as intuitive visual interfaces for value-imputation or model-based predictions. We illustrate the use of our platform in multiple scenarios, including two public data challenges as well as a real-life industry use-case involving the probabilistic fusion of datasets that lack a natural join-key.
引用
收藏
页码:987 / 994
页数:8
相关论文
共 50 条
  • [1] Bayesian robustification for audio visual fusion
    Movellan, J
    Mineiro, P
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 10, 1998, 10 : 742 - 748
  • [2] Bayesian-Based Fusion of Monitoring Data and Visual Inspections in Monumental Structures
    Ierimonti, Laura
    Venanzi, Ilaria
    Cavalagli, Nicola
    Garcia-Macias, Enrique
    Ubertini, Filippo
    EUROPEAN WORKSHOP ON STRUCTURAL HEALTH MONITORING (EWSHM 2022), VOL 2, 2023, : 1066 - 1075
  • [3] Fusion Utility, Search, Index, Obtain, and Navigate (FUSION) over Enormous Data
    Blasch, Erik
    Seetharaman, Guna
    Aved, Alexander J.
    Nagy, James
    SIGNAL PROCESSING, SENSOR FUSION, AND TARGET RECOGNITION XXII, 2013, 8745
  • [4] Fusion Utility, Search, Index, Obtain, and Navigate (FUSION) over Enormous Data
    Blasch, Erik P.
    Seetharaman, Guna
    Aved, Alexander J.
    Nagy, James
    SIGNAL PROCESSING, SENSOR FUSION, AND TARGET RECOGNITION XXII, 2013, 8745
  • [5] Sensors Data Fusion to Navigate Inside Pipe Using Kalman Filter
    Siqueira, Everson
    Azzolin, Rodrigo
    Botelho, Silvia
    Oliveira, Vinicius
    2016 IEEE 21ST INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2016,
  • [6] A Bayesian approach to NDT data fusion
    Gros, XE
    Strachan, P
    Lowden, DW
    INSIGHT, 1995, 37 (05) : 363 - +
  • [7] Bayesian Data Fusion With Shared Priors
    Wu, Peng
    Imbiriba, Tales
    Elvira, Victor
    Closas, Pau
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2024, 72 : 275 - 288
  • [8] Bayesian approaches to data fusion in metrology
    Kelly, GP
    ADVANCED MATHEMATICAL AND COMPUTATIONAL TOOLS IN METROLOGY V, 2001, 57 : 224 - 230
  • [9] Bayesian data analysis for fusion diagnostics
    Yoon, JS
    Fischer, R
    Gori, S
    Knauer, J
    JOURNAL OF THE KOREAN PHYSICAL SOCIETY, 2004, 45 (06) : 1544 - 1552
  • [10] Qualification of traffic data by Bayesian Network Data Fusion
    Junghans, Marek
    Jentschel, Hans-Joachim
    2007 PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, VOLS 1-4, 2007, : 17 - 23