Analytic Queries over Geospatial Time-Series Data Using Distributed Hash Tables

被引:17
|
作者
Malensek, Matthew [1 ]
Pallickara, Sangmi [1 ]
Pallickara, Shrideep [1 ]
机构
[1] Colorado State Univ, Dept Comp Sci, Ft Collins, CO 80523 USA
基金
美国国家科学基金会;
关键词
Exploratory analytics; predictive analytics; multidimensional data; distributed hash tables;
D O I
10.1109/TKDE.2016.2520475
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As remote sensing equipment and networked observational devices continue to proliferate, their corresponding data volumes have surpassed the storage and processing capabilities of commodity computing hardware. This trend has led to the development of distributed storage frameworks that incrementally scale out by assimilating resources as necessary. While challenging in its own right, storing and managing voluminous datasets is only the precursor to a broader field of research: extracting insights, relationships, and models from the underlying datasets. The focus of this study is twofold: exploratory and predictive analytics over voluminous, multidimensional datasets in a distributed environment. Both of these types of analysis represent a higher-level abstraction over standard query semantics; rather than indexing every discrete value for subsequent retrieval, our framework autonomously learns the relationships and interactions between dimensions in the dataset and makes the information readily available to users. This functionality includes statistical synopses, correlation analysis, hypothesis testing, probabilistic structures, and predictive models that not only enable the discovery of nuanced relationships between dimensions, but also allow future events and trends to be predicted. The algorithms presented in this work were evaluated empirically on a real-world geospatial time-series dataset in a production environment, and are broadly applicable across other storage frameworks.
引用
收藏
页码:1408 / 1422
页数:15
相关论文
共 50 条
  • [1] Evaluating Geospatial Geometry and Proximity Queries Using Distributed Hash Tables
    Malensek, Matthew
    Pallickara, Sangmi
    Pallickara, Shrideep
    COMPUTING IN SCIENCE & ENGINEERING, 2014, 16 (04) : 53 - 61
  • [2] Enhanced distributed hash tables for complex queries
    Garg, Pankaj
    Kumar, Amit
    Saran, Huzur
    2006 1ST INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS SOFTWARE & MIDDLEWARE, VOLS 1 AND 2, 2006, : 514 - +
  • [3] Implementing range queries with a decentralized balanced tree over distributed hash tables
    Lopes, Nuno
    Baquero, Carlos
    NETWORK-BASED INFORMATION SYSTEMS, PROCEEDINGS, 2007, 4658 : 197 - +
  • [4] Efficient processing of continuous join queries using distributed hash tables
    Palma, Wenceslao
    Akbarinia, Reza
    Pacitti, Esther
    Valduriez, Patrick
    EURO-PAR 2008 PARALLEL PROCESSING, PROCEEDINGS, 2008, 5168 : 632 - 641
  • [5] Bringing efficient advanced queries to distributed Hash Tables
    Bauer, D
    Hurley, P
    Pletka, R
    Waldvogel, M
    LCN 2004: 29TH ANNUAL IEEE INTERNATIONAL CONFERENCE ON LOCAL COMPUTER NETWORKS, PROCEEDINGS, 2004, : 6 - 14
  • [6] Data Distribution Algorithm using Time based Weighted Distributed Hash Tables
    Lang, Rongling
    Deng, Zhiqun
    GCC 2008: SEVENTH INTERNATIONAL CONFERENCE ON GRID AND COOPERATIVE COMPUTING, PROCEEDINGS, 2008, : 210 - +
  • [7] Processing top-k queries in distributed hash tables
    Akbarinia, Reza
    Pacitti, Esther
    Valduriez, Patrick
    EURO-PAR 2007 PARALLEL PROCESSING, PROCEEDINGS, 2007, 4641 : 489 - +
  • [8] An Efficient Computation of Skyline Queries Using Hash Tables
    Choi, Jong Hyeok
    Lee, Jong Yun
    Shin, HyunSoon
    Nasridinov, Aziz
    ADVANCED SCIENCE LETTERS, 2016, 22 (09) : 2348 - 2353
  • [9] Distributed Programming over Time-series Graphs
    Simmhan, Yogesh
    Choudhury, Neel
    Wickramaarachchi, Charith
    Kumbhare, Alok
    Frincu, Marc
    Raghavendra, Cauligi
    Prasanna, Viktor
    2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2015, : 809 - 818
  • [10] Atomic data access in distributed hash tables
    Lynch, N
    Malkhi, D
    Ratajczak, D
    PEER-TO-PEER SYSTEMS, 2002, 2429 : 295 - 305