Analytic Queries over Geospatial Time-Series Data Using Distributed Hash Tables

被引:17
|
作者
Malensek, Matthew [1 ]
Pallickara, Sangmi [1 ]
Pallickara, Shrideep [1 ]
机构
[1] Colorado State Univ, Dept Comp Sci, Ft Collins, CO 80523 USA
基金
美国国家科学基金会;
关键词
Exploratory analytics; predictive analytics; multidimensional data; distributed hash tables;
D O I
10.1109/TKDE.2016.2520475
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As remote sensing equipment and networked observational devices continue to proliferate, their corresponding data volumes have surpassed the storage and processing capabilities of commodity computing hardware. This trend has led to the development of distributed storage frameworks that incrementally scale out by assimilating resources as necessary. While challenging in its own right, storing and managing voluminous datasets is only the precursor to a broader field of research: extracting insights, relationships, and models from the underlying datasets. The focus of this study is twofold: exploratory and predictive analytics over voluminous, multidimensional datasets in a distributed environment. Both of these types of analysis represent a higher-level abstraction over standard query semantics; rather than indexing every discrete value for subsequent retrieval, our framework autonomously learns the relationships and interactions between dimensions in the dataset and makes the information readily available to users. This functionality includes statistical synopses, correlation analysis, hypothesis testing, probabilistic structures, and predictive models that not only enable the discovery of nuanced relationships between dimensions, but also allow future events and trends to be predicted. The algorithms presented in this work were evaluated empirically on a real-world geospatial time-series dataset in a production environment, and are broadly applicable across other storage frameworks.
引用
收藏
页码:1408 / 1422
页数:15
相关论文
共 50 条
  • [31] A high performance system for processing queries on distributed geospatial data sets
    Abdelguerfi, M
    Mahadevan, V
    Challier, N
    Flanagin, M
    Shaw, K
    Ratcliff, J
    HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2004, 2005, 3402 : 119 - 128
  • [32] Index Recommendation Tool for Optimized Information Discovery over Distributed Hash Tables
    Memon, Faraz
    Duerr, Frank
    Rothermel, Kurt
    IEEE LOCAL COMPUTER NETWORK CONFERENCE, 2010, : 104 - 111
  • [33] Effective time-series Data Augmentation with Analytic Wavelets for bearing fault diagnosis
    Kulevome, Delanyo Kwame Bensah
    Wang, Hong
    Cobbinah, Bernard Mawuli
    Mawuli, Ernest Smith
    Kumar, Rajesh
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [34] Optimizing monitoring queries over distributed data
    Neven, Frank
    Van de Craen, Dieter
    ADVANCES IN DATABASE TECHNOLOGY - EDBT 2006, 2006, 3896 : 829 - +
  • [35] Using Time-Series Databases for Energy Data Infrastructures
    Hadjichristofi, Christos
    Diochnos, Spyridon
    Andresakis, Kyriakos
    Vescoukis, Vassilios
    ENERGIES, 2024, 17 (21)
  • [36] AN ANALYSIS OF FRINGE BENEFITS USING TIME-SERIES DATA
    ALPERT, WT
    APPLIED ECONOMICS, 1987, 19 (01) : 1 - 16
  • [37] Using signature files for querying time-series data
    Andre-Jonsson, H
    Badal, DZ
    PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1997, 1263 : 211 - 220
  • [38] Analysis of Time-Series Data Using the Rough Set
    Matsumoto, Yoshiyuki
    Watada, Junzo
    INNOVATION IN MEDICINE AND HEALTHCARE 2015, 2016, 45 : 139 - 148
  • [39] Teaching Predictive Audit Data Analytic Techniques: Time-Series Forecasting with Transactional and Exogenous Data
    Yan, Zhaokai
    Appelbaum, Deniz
    Kogan, Alexander
    Vasarhelyi, Miklos A.
    JOURNAL OF EMERGING TECHNOLOGIES IN ACCOUNTING, 2023, 20 (01) : 169 - 194
  • [40] Using Property Graphs to Segment Time-Series Data
    Karetnikov, Aleksei
    Rehberger, Tobias
    Lettner, Christian
    Himmelbauer, Johannes
    Nikzad-Langerodi, Ramin
    Gsellmann, Guenter
    Nestelberger, Susanne
    Schutzeneder, Stefan
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2022 WORKSHOPS, 2022, 1633 : 416 - 423