Enhancing the Interactive Visualisation of a Data Preparation Tool from in-Memory Fitting to Big Data Sets

被引:1
|
作者
Epelde, Gorka [1 ,2 ]
Alvarez, Roberto [1 ,2 ]
Beristain, Andoni [1 ,2 ]
Arrue, Monica [1 ,2 ]
Arangoa, Itsasne [1 ,2 ]
Rankin, Debbie [3 ]
机构
[1] Basque Res & Technol Alliance BRTA, Vicomtech Fdn, Mikeletegi 57, Donostia San Sebastian 20009, Spain
[2] Biodonostia Hlth Res Inst, E Hlth Grp, San Sebastian 20014, Spain
[3] Univ Ulsan, Sch Comp Engn & Intelligent Syst, Derry, Londonderry, North Ireland
基金
欧盟地平线“2020”;
关键词
Big data visualisation; Data preparation; Data quality; Exploratory data analysis; Visual information cluttering; Data reduction; Asynchronous pre-processing; EXPLORATION;
D O I
10.1007/978-3-030-61146-0_22
中图分类号
F [经济];
学科分类号
02 ;
摘要
In order to derive reliable insights or make evidence-based decisions, the starting point is to assess and meet a minimum quality of data, either by those that publish the data (preferably) or alternatively by those that prepare data for analysis and develop specific analytics. Much of the (open) data shared by governments and different institutions, or crowdsourced, is in tabular format, and the amount and size of it is increasing rapidly. This paper presents the challenges faced and the solutions adopted while evolving the web-based graphical user interface (GUI) of a tabular data preparation tool from in-memory fitting to Big Data sets. Traditional standalone processing and rendering solutions are no longer usable in a Big Data context. We report on the approach adopted to asynchronously precompute the visualisations required for the tool, in addition to the applied visualisation aggregation strategies. The implementation of this approach has allowed us to overcome web-browsers' client-side data handling limitations and to avoid information overloadwhen using granular information charts from our existing in-memory data preparation tool with Big Data sets. The developed solution provides the user with an acceptable GUI interaction time.
引用
收藏
页码:272 / 284
页数:13
相关论文
共 50 条
  • [21] Timo: In-Memory Temporal Query Processing for Big Temporal Data
    Zheng, Xiao
    Liu, Hou-kai
    Wei, Lin-na
    Wu, Xuan-gou
    Zhang, Zhen
    2019 SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2019, : 121 - 126
  • [22] Timo: In-memory temporal query processing for big temporal data
    Zheng, Xiao
    Liu, Houkai
    Wang, Xiujun
    Wu, Xuangou
    Yu, Feng
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (13):
  • [23] A hybrid memory built by SSD and DRAM to support in-memory Big Data analytics
    Chen, Zhiguang
    Lu, Yutong
    Xiao, Nong
    Liu, Fang
    KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (02) : 335 - 354
  • [24] Memory-Disaggregated In-Memory Object Store Framework for Big Data Applications
    Abrahamse, Robin
    Hadnagy, Akos
    Al-Ars, Zaid
    Proceedings - 2022 IEEE 36th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2022, 2022, : 1228 - 1234
  • [25] Enhancing in-memory efficiency for MapReduce-based data processing
    Veiga, Jorge
    Exposito, Roberto R.
    Taboada, Guillermo L.
    Tourino, Juan
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 120 : 323 - 338
  • [26] Memory-Disaggregated In-Memory Object Store Framework for Big Data Applications
    Abrahamse, Robin
    Hadnagy, Akos
    Al-Ars, Zaid
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022), 2022, : 1228 - 1234
  • [27] A hybrid memory built by SSD and DRAM to support in-memory Big Data analytics
    Zhiguang Chen
    Yutong Lu
    Nong Xiao
    Fang Liu
    Knowledge and Information Systems, 2014, 41 : 335 - 354
  • [28] Leveraging In-Memory Technology for Interactive Analyses of Point-of-Sales Data
    Schwalb, David
    Faust, Martin
    Krueger, Jens
    Plattner, Hasso
    2014 IEEE 30TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW), 2014, : 97 - 102
  • [29] Representing a Model for the Anonymization of Big Data Stream Using In-Memory Processing
    Shamsinejad E.
    Banirostam T.
    Pedram M.M.
    Rahmani A.M.
    Annals of Data Science, 2025, 12 (1) : 223 - 252
  • [30] Big data availability: Selective partial checkpointing for in-memory database queries
    Playfair, Daniel
    Trehan, Amitabh
    McLarnon, Barry
    Nikolopoulos, Dimitrios S.
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 2785 - 2794