Simba: Spatial In-Memory Big Data Analysis

被引:10
|
作者
Xie, Dong [1 ]
Li, Feifei [1 ]
Yao, Bin [2 ]
Li, Gefei [2 ]
Chen, Zhongpu [2 ]
Zhou, Liang [2 ]
Guo, Minyi [2 ]
机构
[1] Univ Utah, Salt Lake City, UT 84112 USA
[2] Shanghai Jiao Tong Univ, Shanghai 200030, Peoples R China
基金
美国国家科学基金会;
关键词
Simba; Spatial data anlaysis; Big data; Distributed system;
D O I
10.1145/2996913.2996935
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present the Simba ( Spatial In-Memory Big data Analytics) system, which offers scalable and efficient in-memory spatial query processing and analytics for big spatial data. Simba natively extends the Spark SQL engine to support rich spatial queries and analytics through both SQL and DataFrame API. It enables the construction of indexes over RDDs inside the engine in order to work with big spatial data and complex spatial operations. Simba also comes with an effective query optimizer, which leverages its indexes and novel spatial-aware optimizations, to achieve both low latency and high throughput in big spatial data analysis. This demonstration proposal describes key ideas in the design of Simba, and presents a demonstration plan.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] In-Memory Representations for Mining Big Graphs
    Goyal, Shruti
    Bindu, P. V.
    Thilagana, P. Santhi
    2016 SECOND IEEE INTERNATIONAL CONFERENCE ON RESEARCH IN COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (ICRCICN), 2016, : 163 - 168
  • [42] Optimizing Performance and Computing Resource Management of in-memory Big Data Analytics with Disaggregated Persistent Memory
    Chen, Shouwei
    Wang, Wensheng
    Wu, Xueyang
    Fan, Zhen
    Huang, Kunwu
    Zhuang, Peiyu
    Li, Yue
    Rodero, Ivan
    Parashar, Manish
    Weng, Dennis
    2019 19TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2019, : 21 - 30
  • [43] A perspective on applications of in-memory and associative approaches supporting cultural big data analytics
    Chianese A.
    Piccialli F.
    International Journal of Computational Science and Engineering, 2018, 16 (03): : 219 - 233
  • [44] A perspective on applications of in-memory and associative approaches supporting cultural big data analytics
    Chianese, Angelo
    Piccialli, Francesco
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2018, 16 (03) : 219 - 233
  • [45] Quantifying the Performance Impact of Large Pages on In-Memory Big-Data Workloads
    Park, Jinsu
    Han, Myeonggyun
    Baek, Woongki
    PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION, 2016, : 209 - 218
  • [46] TrajS']jSpark: A Scalable and Efficient In-Memory Management System for Big Trajectory Data
    Zhang, Zhigang
    Jin, Cheqing
    Mao, Jiali
    Yang, Xiaolin
    Zhou, Aoying
    WEB AND BIG DATA, APWEB-WAIM 2017, PT I, 2017, 10366 : 11 - 26
  • [47] In-memory big data analytics under space constraints using dynamic programming
    Gai, Keke
    Qiu, Meikang
    Liu, Meiqin
    Xiong, Zenggang
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 83 : 219 - 227
  • [48] SwapQt: Cloud-based in-memory indexing of dynamic spatial data
    Jadallah, Hiba
    Al Aghbari, Zaher
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 106 : 360 - 373
  • [49] Enhancing the Interactive Visualisation of a Data Preparation Tool from in-Memory Fitting to Big Data Sets
    Epelde, Gorka
    Alvarez, Roberto
    Beristain, Andoni
    Arrue, Monica
    Arangoa, Itsasne
    Rankin, Debbie
    BUSINESS INFORMATION SYSTEMS WORKSHOPS (BIS 2020), 2020, 394 : 272 - 284
  • [50] Data Prefetching and Eviction Mechanisms of In-Memory Storage Systems Based on Scheduling for Big Data Processing
    Chen, Chien-Hung
    Hsia, Ting-Yuan
    Huang, Yennun
    Kuo, Sy-Yen
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (08) : 1738 - 1752