CINTIA: a Distributed, Low-Latency Index for Big Interval Data

被引:0
|
作者
Mavlyutov, Ruslan [1 ]
Cudre-Mauroux, Philippe [1 ]
机构
[1] Univ Fribourg, eXascale Infolab, CH-1700 Fribourg, Switzerland
关键词
Interval Data; Low-Latency; Scalability; Distributed Data Management;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Intervals have become prominent in data management as they are the main data structure to represent a number of key data types such as temporal or genomic data. Yet, there exists no solution to compactly store and efficiently query big interval data. In this paper we introduce CINTIA-the Checkpoint INTerval Index Array-an efficient data structure to store and query interval data, which achieves high memory locality and outperforms state-of-the art solutions. We also propose a low-latency, Big Data system that implements CINTIA on top of a popular distributed file system and efficiently manages large interval data on clusters of commodity machines. Our system can easily be scaled-out and was designed to accommodate large delays between the various components of a distributed infrastructure. We experimentally evaluate the performance of our approach on several datasets and show that it outperforms current solutions by several orders of magnitude in distributed settings.
引用
收藏
页码:619 / 628
页数:10
相关论文
共 50 条
  • [1] Managing Big Interval Data with CINTIA: The Checkpoint INTerval Array
    Mavlyutov, Ruslan
    Cudre-Mauroux, Philippe
    IEEE TRANSACTIONS ON BIG DATA, 2021, 7 (02) : 285 - 298
  • [2] Towards Low-Latency Big Data Infrastructure at Sangfor
    Chen, Fei
    Yan, Zhengzheng
    Gu, Liang
    EMERGING INFORMATION SECURITY AND APPLICATIONS, EISA 2022, 2022, 1641 : 37 - 54
  • [3] Fragola: Low-Latency Transactions in Distributed Data Stores
    Gottesman, Yonatan
    Bergman, Aran
    Bortnikov, Edward
    Hillel, Eshcar
    Keidar, Idit
    Shacham, Ohad
    PROCEEDINGS OF THE 2017 SYMPOSIUM ON CLOUD COMPUTING (SOCC '17), 2017, : 642 - 642
  • [4] Low-Latency Distributed Applications in Finance
    Brook, Andrew
    COMMUNICATIONS OF THE ACM, 2015, 58 (07) : 42 - 50
  • [5] Distributed Low-Latency Data Aggregation Scheduling in Wireless Sensor Networks
    Bagaa, Miloud
    Younis, Mohamed
    Djenouri, Djamel
    Derhab, Abdelouahid
    Badache, Nadjib
    ACM TRANSACTIONS ON SENSOR NETWORKS, 2015, 11 (03)
  • [6] Low-latency MLLM Inference with Spatiotemporal Heterogeneous Distributed Multimodal Data
    Xu, Xiangrui
    Liu, Sicong
    Yu, Zhiwen
    Wang, Lehao
    Gu, Bin
    2024 IEEE COUPLING OF SENSING & COMPUTING IN AIOT SYSTEMS, CSCAIOT 2024, 2024, : 19 - 20
  • [7] Carousel: Low-Latency Transaction Processing for Globally-Distributed Data
    Yan, Xinan
    Yang, Linguan
    Zhang, Hongbo
    Lin, Xiayue Charles
    Wong, Bernard
    Salem, Kenneth
    Brecht, Tim
    SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, : 231 - 243
  • [8] A Path Computing Scheme for Low-Latency Requirement of Medical Big Data Task
    Zhang X.
    Ren Z.
    Hu J.
    Zhang Y.
    Zhang H.
    Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2020, 54 (02): : 119 - 126
  • [9] Low-Latency Partition Tolerant Distributed Ledger
    Gorczyca, Andrew T.
    Decker, Audrey M.
    DISRUPTIVE TECHNOLOGIES IN INFORMATION SCIENCES, 2018, 10652
  • [10] Distributed low-latency rendering for mobile AR
    Pasman, W
    Jansen, FW
    IEEE AND ACM INTERNATIONAL SYMPOSIUM ON AUGMENTED REALITY, PROCEEDINGS, 2001, : 107 - 113