Visual Bayesian Fusion to Navigate a Data Lake

被引:0
|
作者
Singh, Karamjit [1 ]
Paneri, Kaushal [1 ]
Pandey, Aditeya [1 ]
Gupta, Garima [1 ]
Sharma, Geetika [1 ]
Agarwal, Puneet [1 ]
Shroff, Gautam [1 ]
机构
[1] Tata Consultancy Serv Ltd, TCS Res, Gurgaon, India
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The evolution from traditional business intelligence to big data analytics has witnessed the emergence of 'Data Lakes' in which data is ingested in raw form rather than into traditional data warehouses. With the increasing availability of many more pieces of information about each entity of interest, e.g., a customer, often from diverse sources (socialmedia, mobility, internet-of-things), fusing, visualizing and deriving insights from such data pose a number of challenges: First, disparate datasets often lack a natural join key. Next, datasets may describe measures at different levels of granularity, e.g., individual vs. aggregate data, and finally, different datasets may be derived from physically distinct populations. Moreover, once data has been fused, queries are often an inefficient and inaccurate mechanism to derive insight from high-dimensional data. In this paper we describe iFuse, a data-fusion based visual analytics platform for navigating a data lake to derive insights. We rely on Bayesian graphical models to provide useful rudder with which to fuse and analyze disparate islands of data in a systematic manner. Our platform allows for rich interactive visualizations, querying and keyword-based search within and across datasets or models, as well as intuitive visual interfaces for value-imputation or model-based predictions. We illustrate the use of our platform in multiple scenarios, including two public data challenges as well as a real-life industry use-case involving the probabilistic fusion of datasets that lack a natural join-key.
引用
收藏
页码:987 / 994
页数:8
相关论文
共 50 条
  • [21] Deformable Bayesian Networks for Data Clustering and Fusion
    Kampa, Kittipat
    Principe, Jose C.
    Cobb, J. Tory
    Rangarajan, Anand
    DETECTION AND SENSING OF MINES, EXPLOSIVE OBJECTS, AND OBSCURED TARGETS XVI, 2011, 8017
  • [22] Bayesian data fusion: Spatial and temporal applications
    Fasbender, Dominique
    Obsomer, Valerie
    Radoux, Julien
    Bogaert, Patrick
    Defourny, Pierre
    2007 INTERNATIONAL WORKSHOP ON THE ANALYSIS OF MULTI-TEMPORAL REMOTE SENSING IMAGES, 2007, : 145 - 150
  • [23] Bayesian data fusion for adaptable image pansharpening
    Fasbender, Dominique
    Radoux, Julien
    Bogaert, Patrick
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2008, 46 (06): : 1847 - 1857
  • [24] Sensors Data Fusion via Bayesian Network
    Vechet, S.
    Krejsa, J.
    RECENT ADVANCES IN MECHATRONICS: 2008-2009, 2009, : 221 - 226
  • [25] Data fusion for visual tracking with particles
    Pérez, P
    Vermaak, J
    Blake, A
    PROCEEDINGS OF THE IEEE, 2004, 92 (03) : 495 - 513
  • [26] Visual localization using Bayesian decision fusion on omnidirectional sensing
    Paletta, L
    Frintrop, S
    Hertzberg, J
    SENSOR FUSION: ARCHITECTURES, ALGORITHMS AND APPLICATIONS V, 2001, 4385 : 58 - 66
  • [27] Tracker-Level Fusion for Robust Bayesian Visual Tracking
    Biresaw, Tewodros A.
    Cavallaro, Andrea
    Regazzoni, Carlo S.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2015, 25 (05) : 776 - 789
  • [28] Bayesian networks to classify visual field data
    Tucker, A
    Vinciotti, V
    Liu, X
    Garway-Heath, D
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2004, 45 : U782 - U782
  • [29] Data visualization and data fusion on the visual performance of illustration
    Huang Gongxiang
    Qu Huimin
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (06) : 8795 - 8803
  • [30] Bayesian Optimization based Dempster-Shafer fusion for brain-robot cooperation to navigate a mobile robot
    Han, Jixin
    Gao, Chen
    Cheng, Jian
    Chen, Lin
    Zhao, Jing
    2024 WRC SYMPOSIUM ON ADVANCED ROBOTICS AND AUTOMATION, WRC SARA, 2024, : 40 - 45