FishStore: Fast Ingestion and Indexing of Raw Data

被引:2
|
作者
Chandramouli, Badrish [1 ]
Xie, Dong [2 ]
Li, Yinan [1 ]
Kossmann, Donald [1 ]
机构
[1] Microsoft Res, Redmond, WA 98052 USA
[2] Univ Utah, Salt Lake City, UT 84112 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2019年 / 12卷 / 12期
关键词
D O I
10.14778/3352063.3352100
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The last decade has witnessed a huge increase in data being ingested into the cloud from a variety of data sources. The ingested data takes various forms such as JSON, CSV, and binary formats. Traditionally, data is either ingested into storage in raw form, indexed ad-hoc using range indices, or cooked into analytics-friendly columnar formats. None of these solutions is able to handle modern requirements on storage: making the data available immediately for ad-hoc and streaming queries while ingesting at extremely high throughputs. We demonstrate FishStore, our open-source concurrent latch-free storage layer for data with flexible schema. FishStore builds on recent advances in parsing and indexing techniques, and is based on multi-chain hash indexing of dynamically registered predicated subsets of data. We find predicated subset hashing to be a powerful primitive that supports a broad range of queries on ingested data and admits a higher performance (by up to an order of magnitude) implementation than current alternatives.
引用
收藏
页码:1922 / 1925
页数:4
相关论文
共 50 条
  • [31] AN OUTBREAK OF GASTROENTERITIS ASSOCIATED WITH INGESTION OF RAW CLAMS
    RATZAN, KR
    BRYAN, JA
    KRACKOW, J
    MEYER, G
    LARSON, CD
    JOURNAL OF INFECTIOUS DISEASES, 1969, 120 (02): : 265 - &
  • [32] On indexing evidential data
    Bahri, Nassim
    Tobji, Mohamed Anis Bach
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2019, 106 : 63 - 87
  • [33] FAST REVERSIBLE CODING AND INDEXING OF COLORATIONS
    DURRE, K
    COMPUTING, 1976, 16 (03) : 271 - 279
  • [34] A Fast PQ Hash Code Indexing
    Shan, Jingsong
    Zhang, Yongjun
    Jiang, Mingxin
    Jin, Chunhua
    Zhang, Zhengwei
    INNOVATIVE MOBILE AND INTERNET SERVICES IN UBIQUITOUS COMPUTING, IMIS-2018, 2019, 773 : 395 - 402
  • [35] A new and fast method of image indexing
    Lina-Huang
    Zhijing-Liu
    FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4223 : 448 - 451
  • [36] Indexing fuzzy data
    Helmer, S
    JOINT 9TH IFSA WORLD CONGRESS AND 20TH NAFIPS INTERNATIONAL CONFERENCE, PROCEEDINGS, VOLS. 1-5, 2001, : 2120 - 2125
  • [37] Indexing Evidential Data
    Jammali, Anouar
    Tobji, Mohamed Anis Bach
    Martin, Arnaud
    Ben Yaghlane, Boutheina
    2014 SECOND WORLD CONFERENCE ON COMPLEX SYSTEMS (WCCS), 2014, : 196 - 201
  • [38] NUMERICAL DATA INDEXING
    MURDOCK, JW
    JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1980, 20 (03): : 132 - 136
  • [39] A spatiotemporal data and indexing
    Kim, JS
    Kim, DH
    Ryu, KH
    IEEE REGION 10 INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONIC TECHNOLOGY, VOLS 1 AND 2, 2001, : 110 - 113
  • [40] NUMERICAL DATA INDEXING
    MURDOCK, JW
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1979, (SEP): : 25 - 25