When Good-Enough is Enough: Complex Queries at Fixed Cost

被引:4
|
作者
Mickulicz, Nathan D. [1 ]
Martins, Rolando [1 ]
Narasimhan, Priya [1 ]
Gandhi, Rajecv [1 ]
机构
[1] Carnegie Mellon Univ, Dept Elect & Comp Engn, Pittsburgh, PA 15213 USA
关键词
D O I
10.1109/BigDataService.2015.24
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Collections of time-series data appear in a wide variety of contexts. To gain insight into the underlying phenomenon (that the data represents), one must analyze the time-series data. Analysis can quickly become challenging for very large data (similar to terabytes or more) sets, and it may be infeasible to scan the entire data-set on each query due to time limits or resource constraints. To avoid this problem, one might pre-compute partial results by scanning the data-set (usually as the data arrives). However, for complex queries, where the value of a new data record depends on all of the data previously seen, this might be infeasible because incorporating a large amount of historical data into a query requires a large amount of storage. We present an approach to performing complex queries over very large data-sets in a manner that is (i) practical, meaning that a query does not require a scan of the entire data-set, and (ii) fixed-cost, meaning that the amount of storage required only depends on the time-range spanned by the entire data-set (and not the size of the data-set itself). We evaluate our approach with three different data-sets: (i) a 4-year commercial analytics data-set from a production content-delivery platform with over 15 million mobile users, (ii) an 18-year data-set from the Linux-kernel commit-history, and (iii) an 8-day data-set from Common Crawl HTTP logs. Our evaluation demonstrates the feasibility and practicality of our approach for a diverse set of complex queries on a diverse set of very large data-sets.
引用
收藏
页码:89 / 98
页数:10
相关论文
共 50 条
  • [41] Prediction, Information Structure, and Good-Enough Language Processing
    Ferreira, Fernanda
    Lowder, Matthew W.
    PSYCHOLOGY OF LEARNING AND MOTIVATION, VOL 65, 2016, 65 : 217 - 247
  • [42] Generation Methodology for Good-Enough Approximate Modules of ATMR
    Abdus Sami Hassan
    Tooba Arifeen
    Hossein Moradian
    Jeong-A Lee
    Journal of Electronic Testing, 2018, 34 : 651 - 665
  • [43] Synthesizing Good-Enough Strategies for LTL f Specifications
    Li, Yong
    Turrini, Andrea
    Vardi, Moshe Y.
    Zhang, Lijun
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4144 - 4151
  • [44] Characterizing the good-enough Balint group: A work in progress
    Scott, C. Paul
    Milberg, Laurel
    Addison, Richard
    INTERNATIONAL JOURNAL OF PSYCHIATRY IN MEDICINE, 2018, 53 (1-2): : 70 - 74
  • [45] The Difference: When Good Enough isn't Enough
    Patra, Banita Kumari
    INTERNATIONAL JOURNAL OF QUALITY & RELIABILITY MANAGEMENT, 2018, 35 (01) : 312 - 314
  • [46] Repairing the female pelvic floor: when good enough is not good enough
    MacNeil, Sheila
    El Neil, Sohier
    Esegbona, Gloria
    Damaser, Margot
    NATURE REVIEWS UROLOGY, 2018, 15 (04) : 197 - 198
  • [47] When close enough isn't good enough
    不详
    ASSEMBLY AUTOMATION, 2001, 21 (01) : 76 - 77
  • [48] Repairing the female pelvic floor: when good enough is not good enough
    Sheila MacNeil
    Sohier El Neil
    Gloria Esegbona
    Margot Damaser
    Nature Reviews Urology, 2018, 15 : 197 - 198
  • [49] WHEN HALF GOOD IS GOOD ENOUGH
    IVERSEN, WR
    ELECTRONICS-US, 1979, 52 (19): : 90 - 91
  • [50] When doing good is not good enough
    Carrigan, Mark J.
    JOURNAL OF PHILANTHROPY AND MARKETING, 2021,