ByteGraph: A High-Performance Distributed Graph Database in ByteDance

被引:5
|
作者
Li, Changji [1 ,2 ]
Chen, Hongzhi [2 ]
Zhang, Shuai [2 ]
Hu, Yingqian [2 ]
Chen, Chao [2 ]
Zhang, Zhenjie [2 ]
Li, Meng [2 ]
Li, Xiangchen [2 ]
Han, Dongqing [2 ]
Chen, Xiaohui [2 ]
Wang, Xudong [2 ]
Zhu, Huiming [2 ]
Fu, Xuwei [2 ]
Wu, Tingwei [2 ]
Tan, Hongfei [2 ]
Ding, Hengtian [2 ]
Liu, Mengjin [2 ]
Wang, Kangcheng [2 ]
Ye, Ting [2 ]
Li, Lei [2 ]
Li, Xin [2 ]
Wang, Yu [2 ]
Zheng, Chenguang [1 ,2 ]
Yang, Hao [2 ]
Cheng, James [1 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] ByteDance Inc, Beijing, Peoples R China
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2022年 / 15卷 / 12期
关键词
D O I
10.14778/3554821.3554824
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most products at ByteDance, e.g., TikTok, Douyin, and Toutiao, naturally generate massive amounts of graph data. To efficiently store, query and update massive graph data is challenging for the broad range of products at ByteDance with various performance requirements. We categorize graph workloads at ByteDance into three types: online analytical, transaction, and serving processing, where each workload has its own characteristics. Existing graph databases have different performance bottlenecks in handling these workloads and none can efficiently handle the scale of graphs at ByteDance. We developed ByteGraph to process these graph workloads with high throughput, low latency and high scalability. There are several key designs in ByteGraph that make it efficient for processing our workloads, including edge-trees to store adjacency lists for high parallelism and low memory usage, adaptive optimizations on thread pools and indexes, and geographic replications to achieve fault tolerance and availability. ByteGraph has been in production use for several years and its performance has shown to be robust for processing a wide range of graph workloads at ByteDance.
引用
收藏
页码:3306 / 3318
页数:13
相关论文
共 50 条
  • [21] High-performance Graph Analytics on Manycore Processors
    Slota, George M.
    Rajamanickam, Sivasankaran
    Madduri, Kamesh
    2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2015, : 17 - 27
  • [22] Fraud detection in the distributed graph database
    Srivastava, Sakshi
    Singh, Anil Kumar
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2023, 26 (01): : 515 - 537
  • [23] Fraud detection in the distributed graph database
    Sakshi Srivastava
    Anil Kumar Singh
    Cluster Computing, 2023, 26 : 515 - 537
  • [24] CDSBen: Benchmarking the Performance of Storage Services in Cloud-native Database System at ByteDance
    Zhang, Jiashu
    Jiang, Wen
    Tang, Bo
    Ma, Haoxiang
    Cao, Lixun
    Jiang, Zhongbin
    Nie, Yuanyuan
    Wang, Fan
    Zhang, Lei
    Liang, Yuming
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (12): : 3584 - 3596
  • [25] High-Performance Graph Storage and Mutation for Graph Processing and Streaming: A Review
    Firmli, Soukaina
    Chiadmi, Dalila
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2025, 16 (01) : 19 - 30
  • [26] A DATABASE CACHE FOR HIGH-PERFORMANCE AND FAST RESTART IN DATABASE-SYSTEMS
    ELHARDT, K
    BAYER, R
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 1984, 9 (04): : 503 - 525
  • [27] Log(Graph): A Near-Optimal High-Performance Graph Representation
    Besta, Maciej
    Stanojevic, Dimitri
    Zivic, Tijana
    Singh, Jagpreet
    Hoerold, Maurice
    Hoefler, Torsten
    27TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT 2018), 2018,
  • [28] HIGH-PERFORMANCE DISTRIBUTED COMPUTING - PROMISES AND CHALLENGES
    HARIRI, S
    VARMA, A
    CONCURRENCY-PRACTICE AND EXPERIENCE, 1993, 5 (04): : 233 - 238
  • [29] The BORG distributed architecture for high-performance computing
    Mou, ZG
    Duong, L
    Donuhue, D
    Ku, HC
    APPLICATIONS OF HIGH-PERFORMANCE COMPUTING IN ENGINEERING VI, 2000, 6 : 399 - 408
  • [30] Topic 11: Distributed and high-performance multimedia
    Coulson, Geoff
    Kosch, Harald
    Kao, Odej
    Seinstra, Frank
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2006, 4128 LNCS