Enhancing the Interactive Visualisation of a Data Preparation Tool from in-Memory Fitting to Big Data Sets

被引:1
|
作者
Epelde, Gorka [1 ,2 ]
Alvarez, Roberto [1 ,2 ]
Beristain, Andoni [1 ,2 ]
Arrue, Monica [1 ,2 ]
Arangoa, Itsasne [1 ,2 ]
Rankin, Debbie [3 ]
机构
[1] Basque Res & Technol Alliance BRTA, Vicomtech Fdn, Mikeletegi 57, Donostia San Sebastian 20009, Spain
[2] Biodonostia Hlth Res Inst, E Hlth Grp, San Sebastian 20014, Spain
[3] Univ Ulsan, Sch Comp Engn & Intelligent Syst, Derry, Londonderry, North Ireland
基金
欧盟地平线“2020”;
关键词
Big data visualisation; Data preparation; Data quality; Exploratory data analysis; Visual information cluttering; Data reduction; Asynchronous pre-processing; EXPLORATION;
D O I
10.1007/978-3-030-61146-0_22
中图分类号
F [经济];
学科分类号
02 ;
摘要
In order to derive reliable insights or make evidence-based decisions, the starting point is to assess and meet a minimum quality of data, either by those that publish the data (preferably) or alternatively by those that prepare data for analysis and develop specific analytics. Much of the (open) data shared by governments and different institutions, or crowdsourced, is in tabular format, and the amount and size of it is increasing rapidly. This paper presents the challenges faced and the solutions adopted while evolving the web-based graphical user interface (GUI) of a tabular data preparation tool from in-memory fitting to Big Data sets. Traditional standalone processing and rendering solutions are no longer usable in a Big Data context. We report on the approach adopted to asynchronously precompute the visualisations required for the tool, in addition to the applied visualisation aggregation strategies. The implementation of this approach has allowed us to overcome web-browsers' client-side data handling limitations and to avoid information overloadwhen using granular information charts from our existing in-memory data preparation tool with Big Data sets. The developed solution provides the user with an acceptable GUI interaction time.
引用
收藏
页码:272 / 284
页数:13
相关论文
共 50 条
  • [1] In-Memory Performance for Big Data
    Graefe, Goetz
    Volos, Haris
    Kimura, Hideaki
    Kuno, Harumi
    Tucek, Joseph
    Lillibridge, Mark
    Veitch, Alistair
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 8 (01): : 37 - 48
  • [2] Interactive software tool for data visualisation
    Kolodnytsky, M
    Kovalchuk, A
    IDAACS'2001: PROCEEDINGS OF THE INTERNATIONAL WORKSHOP ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS: TECHNOLOGY AND APPLICATION, 2001, : 107 - 110
  • [3] A Mobile Tool for Interactive Visualisation of Genomics Data
    Quang Vinh Nguyen
    Lau, Chng Wei
    Qu, Zhonglin
    Simoff, Simeon
    Huang, Mao Lin
    Catchpoole, Daniel R.
    2018 NINTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY IN MEDICINE AND EDUCATION (ITME 2018), 2018, : 688 - 697
  • [4] In-Memory Big Data Management and Processing: A Survey
    Zhang, Hao
    Chen, Gang
    Ooi, Beng Chin
    Tan, Kian-Lee
    Zhang, Meihui
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (07) : 1920 - 1948
  • [5] Simba: Spatial In-Memory Big Data Analysis
    Xie, Dong
    Li, Feifei
    Yao, Bin
    Li, Gefei
    Chen, Zhongpu
    Zhou, Liang
    Guo, Minyi
    24TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2016), 2016,
  • [6] Fast and Efficient In-Memory Big Data Processing
    Malik, Babur Hayat
    Maryam, Maliha
    Khalid, Myda
    Khlaid, Javaria
    Rehman, Naj Am Ur
    Sajjad, Syeda Iqra
    Islam, Tanveer
    Butt, Umair Ahmed
    Raza, Ali
    Nasr, M. Saad
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (05) : 517 - 524
  • [7] Distributed In-Memory Analytics for Big Temporal Data
    Yao, Bin
    Zhang, Wei
    Wang, Zhi-Jie
    Chen, Zhongpu
    Shang, Shuo
    Zheng, Kai
    Guo, Minyi
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2018, PT I, 2018, 10827 : 549 - 565
  • [8] ViDaX: An Interactive Semantic Data Visualisation and Exploration Tool
    Dumas, Bruno
    Broche, Tim
    Hoste, Lode
    Signer, Beat
    PROCEEDINGS OF THE INTERNATIONAL WORKING CONFERENCE ON ADVANCED VISUAL INTERFACES, 2012, : 757 - 760
  • [9] From visualisation to data mining with large data sets
    Adelmann, A
    Ryne, RD
    Shalf, JM
    Siegerist, C
    2005 IEEE PARTICLE ACCELERATOR CONFERENCE (PAC), VOLS 1-4, 2005, : 542 - 544
  • [10] Online Data Deduplication for In-Memory Big-Data Analytic Systems
    Sun, Yushi
    Zeng, Catherine Y.
    Chung, Jaeyoon
    Huang, Zhe
    2017 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2017,