Analytic Queries over Geospatial Time-Series Data Using Distributed Hash Tables

被引:17
|
作者
Malensek, Matthew [1 ]
Pallickara, Sangmi [1 ]
Pallickara, Shrideep [1 ]
机构
[1] Colorado State Univ, Dept Comp Sci, Ft Collins, CO 80523 USA
基金
美国国家科学基金会;
关键词
Exploratory analytics; predictive analytics; multidimensional data; distributed hash tables;
D O I
10.1109/TKDE.2016.2520475
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As remote sensing equipment and networked observational devices continue to proliferate, their corresponding data volumes have surpassed the storage and processing capabilities of commodity computing hardware. This trend has led to the development of distributed storage frameworks that incrementally scale out by assimilating resources as necessary. While challenging in its own right, storing and managing voluminous datasets is only the precursor to a broader field of research: extracting insights, relationships, and models from the underlying datasets. The focus of this study is twofold: exploratory and predictive analytics over voluminous, multidimensional datasets in a distributed environment. Both of these types of analysis represent a higher-level abstraction over standard query semantics; rather than indexing every discrete value for subsequent retrieval, our framework autonomously learns the relationships and interactions between dimensions in the dataset and makes the information readily available to users. This functionality includes statistical synopses, correlation analysis, hypothesis testing, probabilistic structures, and predictive models that not only enable the discovery of nuanced relationships between dimensions, but also allow future events and trends to be predicted. The algorithms presented in this work were evaluated empirically on a real-world geospatial time-series dataset in a production environment, and are broadly applicable across other storage frameworks.
引用
收藏
页码:1408 / 1422
页数:15
相关论文
共 50 条
  • [41] Time-Series Data Mining
    Esling, Philippe
    Agon, Carlos
    ACM COMPUTING SURVEYS, 2012, 45 (01)
  • [42] Multi-scale histograms for answering queries over time series data
    Chen, L
    Özsu, MT
    20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, : 838 - 838
  • [43] Durable Queries over Historical Time Series
    Wang, Hao
    Cai, Yilun
    Yang, Yin
    Zhang, Shiming
    Mamoulis, Nikos
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (03) : 595 - 607
  • [44] Time-series analysis of open data for studying urban heat island phenomenon: a geospatial approach
    Priyanka Rao
    Abhishek Singh
    Kamal Pandey
    Spatial Information Research, 2021, 29 : 907 - 918
  • [45] RACED: Routing in PAyment Channel NEtworks Using Distributed Hash Tables
    Kolachala, Kartick
    Ababneh, Mohammed
    Vishwanathan, Roopa
    PROCEEDINGS OF THE 19TH ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, ACM ASIACCS 2024, 2024, : 1895 - 1910
  • [46] Time-series analysis of open data for studying urban heat island phenomenon: a geospatial approach
    Rao, Priyanka
    Singh, Abhishek
    Pandey, Kamal
    SPATIAL INFORMATION RESEARCH, 2021, 29 (06) : 907 - 918
  • [47] Integration of Shareable Containers with Distributed Hash Tables for Storage of Structured and Dynamic Data
    Kuehn, Eva
    Mordinyi, Richard
    Goiss, Hannu-D.
    Moser, Thomas
    Bessler, Sandford
    Tomic, Slobodanka
    CISIS: 2009 INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS, VOLS 1 AND 2, 2009, : 866 - +
  • [48] FedQOGD: Federated Quantized Online Gradient Descent with Distributed Time-Series Data
    Park, Jonghwan
    Kwon, Dohyeok
    Hong, Songnam
    2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2022, : 536 - 541
  • [49] Cost-Effective Bad Synchrophasor Data Detection Based on Unsupervised Time-Series Data Analytic
    Zhu, Lipeng
    Hill, David J.
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (03) : 2027 - 2039
  • [50] Ratio Threshold Queries over Distributed Data Sources
    Gupta, Rajeev
    Ramamritham, Krithi
    Mohania, Mukesh
    26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING ICDE 2010, 2010, : 581 - 584