When Good-Enough is Enough: Complex Queries at Fixed Cost

被引：4

作者：

Mickulicz, Nathan D. ^{[1
]}

Martins, Rolando ^{[1
]}

Narasimhan, Priya ^{[1
]}

Gandhi, Rajecv ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Dept Elect & Comp Engn, Pittsburgh, PA 15213 USA

来源：

2015 IEEE FIRST INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (BIGDATASERVICE 2015) | 2015年

关键词：

D O I：

10.1109/BigDataService.2015.24

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Collections of time-series data appear in a wide variety of contexts. To gain insight into the underlying phenomenon (that the data represents), one must analyze the time-series data. Analysis can quickly become challenging for very large data (similar to terabytes or more) sets, and it may be infeasible to scan the entire data-set on each query due to time limits or resource constraints. To avoid this problem, one might pre-compute partial results by scanning the data-set (usually as the data arrives). However, for complex queries, where the value of a new data record depends on all of the data previously seen, this might be infeasible because incorporating a large amount of historical data into a query requires a large amount of storage. We present an approach to performing complex queries over very large data-sets in a manner that is (i) practical, meaning that a query does not require a scan of the entire data-set, and (ii) fixed-cost, meaning that the amount of storage required only depends on the time-range spanned by the entire data-set (and not the size of the data-set itself). We evaluate our approach with three different data-sets: (i) a 4-year commercial analytics data-set from a production content-delivery platform with over 15 million mobile users, (ii) an 18-year data-set from the Linux-kernel commit-history, and (iii) an 8-day data-set from Common Crawl HTTP logs. Our evaluation demonstrates the feasibility and practicality of our approach for a diverse set of complex queries on a diverse set of very large data-sets.

引用

页码：89 / 98

页数：10

共 50 条

[21] God, gays and good-enough enemies
Cynthia Burack
Psychoanalysis, Culture & Society, 2009, 14 (1) : 41 - 48
[22] Good-enough software process in Nokia
Känsälä, K
PRODUCT FOCUSED SOFTWARE PROCESS IMPROVEMENT, 2004, 3009 : 424 - 430
[23] Good-enough representations in language comprehension
Ferreira, F
Bailey, KGD
Ferraro, V
CURRENT DIRECTIONS IN PSYCHOLOGICAL SCIENCE, 2002, 11 (01) : 11 - 15
[24] Good-enough reading: Momentum and accuracy in the reading of complex fiction
Mackey, M
RESEARCH IN THE TEACHING OF ENGLISH, 1997, 31 (04) : 428 - 458
[25] THE GOOD-ENOUGH FATHER OF WHATEVER SEX
SAMUELS, A
FEMINISM & PSYCHOLOGY, 1995, 5 (04) : 511 - 530
[26] Representation and Practical Accomplishment in the Laboratory: When is an Animal Model Good-enough?
Lewis, Jamie
Atkinson, Paul
Harrington, Jean
Featherstone, Katie
SOCIOLOGY-THE JOURNAL OF THE BRITISH SOCIOLOGICAL ASSOCIATION, 2013, 47 (04): : 776 - 792
[27] "Good-enough" and "not good-enough" parenting of persons under long-term follow-up psychiatrical observation
Rusakovskaya, O.
Kostjuk, G.
Golubev, S.
Drykina, L.
Galkina, A.
Andrianova, S.
Nyrkova, A.
EUROPEAN PSYCHIATRY, 2020, 63 : S190 - S191
[28] When true enough is not good enough
Nature Medicine, 2013, 19 (1) : 1 - 1
[29] When true enough is not good enough
不详
NATURE MEDICINE, 2013, 19 (01) : 1 - 1
[30] The making of an analyst: from 'ideal' to 'good-enough'
Kelly, Tom
JOURNAL OF ANALYTICAL PSYCHOLOGY, 2007, 52 (02) : 157 - 169

← 1 2 3 4 5 →