Banian: A Cross-Platform Interactive Query System for Structured Big Data

被引:1
|
作者
Tao Xu [1 ]
Dongsheng Wang [2 ]
Guodong Liu [3 ]
机构
[1] the Department of Computer Science and Technology, Tsinghua University
[2] the Department of Computer Science and Technology and Tsinghua National Laboratory for Information Science and Technology, Tsinghua University
[3] Tsinghua National Laboratory for Information Science and Technology, Tsinghua University
关键词
big data; interactive query; relational database; HDFS; cross platform;
D O I
暂无
中图分类号
TP311.13 [];
学科分类号
1201 ;
摘要
The rapid growth of structured data has presented new technological challenges in the research fields of big data and relational database. In this paper, we present an efficient system for managing and analyzing PB level structured data called Banian. Banian overcomes the storage structure limitation of relational database and effectively integrates interactive query with large-scale storage management. It provides a uniform query interface for cross-platform datasets and thus shows favorable compatibility and scalability. Banian’s system architecture mainly includes three layers:(1) a storage layer using HDFS for the distributed storage of massive data;(2) a scheduling and execution layer employing the splitting and scheduling technology of parallel database; and(3)an application layer providing a cross-platform query interface and supporting standard SQL. We evaluate Banian using PB level Internet data and the TPC-H benchmark. The results show that when compared with Hive, Banian improves the query performance to a maximum of 30 times and achieves better scalability and concurrency.
引用
收藏
页码:62 / 71
页数:10
相关论文
共 50 条
  • [1] Banian: A Cross-Platform Interactive Query System for Structured Big Data
    Xu, Tao
    Wang, Dongsheng
    Liu, Guodong
    TSINGHUA SCIENCE AND TECHNOLOGY, 2015, 20 (01) : 62 - 71
  • [2] SPEAR-Board: Cross-Platform Interactive Spatio-Temporal Big Data Analytics
    Baig, Furqan
    Nalluri, Pradeep
    Kong, Jun
    Wang, Fusheng
    30TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS, ACM SIGSPATIAL GIS 2022, 2022, : 740 - 743
  • [3] A Cross-platform Metaverse Data Management System
    Chen, Bohan
    Song, Chengxin
    Lin, Boyu
    Xu, Xin
    Tang, Ruoyan
    Lin, Yunxuan
    Yao, Yuan
    Timoney, Joseph
    Bi, Ting
    2022 IEEE INTERNATIONAL CONFERENCE ON METROLOGY FOR EXTENDED REALITY, ARTIFICIAL INTELLIGENCE AND NEURAL ENGINEERING (METROXRAINE), 2022, : 145 - 150
  • [4] CROSS-PLATFORM AVIATION ANALYTICS USING BIG-DATA METHODS
    Larsen, Tulinda
    2013 INTEGRATED COMMUNICATIONS, NAVIGATION AND SURVEILLANCE CONFERENCE (ICNS), 2013,
  • [5] Twister2 Cross-platform resource scheduler for big data
    Uyar, Ahmet
    Gunduz, Gurhan
    Kamburugamuve, Supun
    Wickramasinghe, Pulasthi
    Widanage, Chathura
    Govindarajan, Kannan
    Perera, Niranda
    Abeykoon, Vibhatha
    Akkas, Selahattin
    Fox, Geoffrey
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (09):
  • [6] Big Data Through Cross-Platform Interest-Based Interactivity
    Zelenkauskaite, Asta
    Simoes, Bruno
    2014 INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2014, : 191 - +
  • [7] INforE: Interactive Cross-platform Analytics for Everyone
    Giatrakos, Nikos
    Arnu, David
    Bitsakis, Theodoros
    Deligiannakis, Antonios
    Garofalakis, Minos
    Klinkenberg, Ralf
    Konidaris, Aris
    Kontaxakis, Antonis
    Kotidis, Yannis
    Samoladas, Vasilis
    Simitsis, Alkis
    Stamatakis, George
    Temme, Fabian
    Torok, Mate
    Yaqub, Edwin
    Montagud, Arnau
    de Leon, Miguel Ponce
    Arndt, Holger
    Burkard, Stefan
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 3389 - 3392
  • [8] ML-based Cross-Platform Query Optimization
    Kaoudi, Zoi
    Quiane-Ruiz, Jorge-Arnulfo
    Contreras-Rojas, Bertty
    Pardo-Meza, Rodrigo
    Troudi, Anis
    Chawla, Sanjay
    2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 1489 - 1500
  • [9] Visual Omics Explorer (VOE): a cross-platform portal for interactive data visualization
    Kim, Baekdoo
    Ali, Thahmina
    Hosmer, Samuel
    Krampis, Konstantinos
    BIOINFORMATICS, 2016, 32 (13) : 2050 - 2052
  • [10] Remediation, convergence, and big data: Conceptual limits of cross-platform social media
    Zelenkauskaite, Asta
    CONVERGENCE-THE INTERNATIONAL JOURNAL OF RESEARCH INTO NEW MEDIA TECHNOLOGIES, 2017, 23 (05): : 512 - 527