Simba: Spatial In-Memory Big Data Analysis

被引:10
|
作者
Xie, Dong [1 ]
Li, Feifei [1 ]
Yao, Bin [2 ]
Li, Gefei [2 ]
Chen, Zhongpu [2 ]
Zhou, Liang [2 ]
Guo, Minyi [2 ]
机构
[1] Univ Utah, Salt Lake City, UT 84112 USA
[2] Shanghai Jiao Tong Univ, Shanghai 200030, Peoples R China
基金
美国国家科学基金会;
关键词
Simba; Spatial data anlaysis; Big data; Distributed system;
D O I
10.1145/2996913.2996935
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present the Simba ( Spatial In-Memory Big data Analytics) system, which offers scalable and efficient in-memory spatial query processing and analytics for big spatial data. Simba natively extends the Spark SQL engine to support rich spatial queries and analytics through both SQL and DataFrame API. It enables the construction of indexes over RDDs inside the engine in order to work with big spatial data and complex spatial operations. Simba also comes with an effective query optimizer, which leverages its indexes and novel spatial-aware optimizations, to achieve both low latency and high throughput in big spatial data analysis. This demonstration proposal describes key ideas in the design of Simba, and presents a demonstration plan.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Simba: Efficient In-Memory Spatial Analytics
    Xie, Dong
    Li, Feifei
    Yao, Bin
    Li, Gefei
    Zhou, Liang
    Guo, Minyi
    SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 1071 - 1085
  • [2] LocationSpark: A Distributed In-Memory Data Management System for Big Spatial Data
    Tang, Mingjie
    Yu, Yongyang
    Malluhi, Qutaibah M.
    Ouzzani, Mourad
    Aref, Walid G.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (13): : 1565 - 1568
  • [3] In-Memory Performance for Big Data
    Graefe, Goetz
    Volos, Haris
    Kimura, Hideaki
    Kuno, Harumi
    Tucek, Joseph
    Lillibridge, Mark
    Veitch, Alistair
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 8 (01): : 37 - 48
  • [4] SparkNN: A distributed in-memory data partitioning for KNN queries on big spatial data
    Al Aghbari Z.
    Ismail T.
    Kamel I.
    Data Science Journal, 2020, 19 (01) : 1 - 14
  • [5] Work in Progress - In-Memory Analysis for Healthcare Big Data
    Mian, Muaz
    Teredesai, Ankur
    Hazel, David
    Pokuri, Sreenivasulu
    Uppala, Krishna
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 778 - +
  • [6] Skia: Scalable and Efficient In-Memory Analytics for Big Spatial-Textual Data
    Xu, Yang
    Yao, Bin
    Wang, Zhi-Jie
    Gao, Xiaofeng
    Xie, Jiong
    Guo, Minyi
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (12) : 2467 - 2480
  • [7] In-Memory Big Data Management and Processing: A Survey
    Zhang, Hao
    Chen, Gang
    Ooi, Beng Chin
    Tan, Kian-Lee
    Zhang, Meihui
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (07) : 1920 - 1948
  • [8] Distributed In-Memory Analytics for Big Temporal Data
    Yao, Bin
    Zhang, Wei
    Wang, Zhi-Jie
    Chen, Zhongpu
    Shang, Shuo
    Zheng, Kai
    Guo, Minyi
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2018, PT I, 2018, 10827 : 549 - 565
  • [9] Fast and Efficient In-Memory Big Data Processing
    Malik, Babur Hayat
    Maryam, Maliha
    Khalid, Myda
    Khlaid, Javaria
    Rehman, Naj Am Ur
    Sajjad, Syeda Iqra
    Islam, Tanveer
    Butt, Umair Ahmed
    Raza, Ali
    Nasr, M. Saad
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (05) : 517 - 524
  • [10] SIMBA: A Skyrmionic In-Memory Binary Neural Network Accelerator
    Miriyala, Venkata Pavan Kumar
    Vishwanath, Kale Rahul
    Fong, Xuanyao
    IEEE TRANSACTIONS ON MAGNETICS, 2020, 56 (11)