Turbine: Facebook's Service Management Platform for Stream Processing

被引:23
|
作者
Mei, Yuan [1 ]
Cheng, Luwei [1 ]
Talwar, Vanish [1 ]
Levin, Michael Y. [1 ]
Jacques-Silva, Gabriela [1 ]
Simha, Nikhil [1 ]
Banerjee, Anirban [1 ]
Smith, Brian [1 ]
Williamson, Tim [1 ]
Yilmaz, Serhat [1 ]
Chen, Weitao [1 ]
Chen, Guoqiang Jerry [1 ]
机构
[1] Facebook Inc, Menlo Pk, CA 94025 USA
关键词
Stream Processing; Cluster Management;
D O I
10.1109/ICDE48307.2020.00141
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The demand for stream processing at Facebook has grown as services increasingly rely on real-time signals to speed up decisions and actions. Emerging real-time applications require strict Service Level Objectives (SLOs) with low downtime and processing lag even in the presence of failures and load variability. Addressing this challenge at Facebook scale led to the development of Turbine, a management platform designed to bridge the gap between the capabilities of the existing generalpurpose cluster management frameworks and Facebook's stream processing requirements. Specifically, Turbine features a fast and scalable task scheduler; an efficient predictive auto scaler; and an application update mechanism that provides fault-tolerance, atomicity, consistency, isolation and durability. Turbine has been in production for over three years, and one of the core technologies that enabled a booming growth of stream processing at Facebook. It is currently deployed on clusters spanning tens of thousands of machines, managing several thousands of streaming pipelines processing terabytes of data per second in real time. Our production experience has validated Turbine's effectiveness: its task scheduler evenly balances workload fluctuation across clusters; its auto scaler effectively and predictively handles unplanned load spikes; and the application update mechanism consistently and efficiently completes high scale updates within minutes. This paper describes the Turbine architecture, discusses the design choices behind it, and shares several case studies demonstrating Turbine capabilities in production.
引用
收藏
页码:1591 / 1602
页数:12
相关论文
共 50 条
  • [1] Universal Platform for Composite Data Stream Processing Services Management
    Stelmach, Pawel
    Schauer, Patryk
    Kokot, Adam
    Demkiewicz, Maciej
    NEW RESULTS IN DEPENDABILITY AND COMPUTER SYSTEMS, 2013, 224 : 399 - 407
  • [2] Service Management in the Edge Cloud for Stream Processing of IoT Data
    Moussa, Hachem
    Yen, I-Ling
    Bastani, Farokh
    2020 IEEE 13TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2020), 2020, : 91 - 98
  • [3] Audio/Video Stream binding in a pan-European Service Management Platform
    Sellin, E
    Loosemore, P
    Rana, S
    Dittrich, J
    Wegdam, M
    INTELLIGENCE IN SERVICES AND NETWORKS: PAVING THE WAY FOR AN OPEN SERVICE MARKET, 1999, 1597 : 357 - 372
  • [4] CNN Model on Stream Processing Platform
    Furedi, Laszlo
    Szolgay, Peter
    2009 EUROPEAN CONFERENCE ON CIRCUIT THEORY AND DESIGN, VOLS 1 AND 2, 2009, : 843 - 846
  • [5] Decentralized management of bi-modal network resources in a distributed stream processing platform
    Asaduzzaman, Shah
    Maheswaran, Muthucumaru
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2011, 71 (06) : 774 - 787
  • [6] A Distributed Computing Platform for Task Stream Processing
    Xing Weiyan
    Huang Wenqing
    Liu Dong
    Deng Youyi
    INFORMATION COMPUTING AND APPLICATIONS, ICICA 2013, PT I, 2013, 391 : 110 - +
  • [7] Complex event processing oriented integrated network management service computing platform
    Deng, Bo
    Ding, Kun
    Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition), 2008, 38 (SUPPL. 1): : 308 - 311
  • [8] Domesticating Facebook: The Shift From Compulsive Connection to Personal Service Platform
    Sujon, Zoetanya
    Viney, Leslie
    Toker-Turnalar, Elif
    SOCIAL MEDIA + SOCIETY, 2018, 4 (04):
  • [9] Performing Platform Governance: Facebook and the Stage Management of Data Relations
    Huang, Karen
    Krafft, P. M.
    SCIENCE AND ENGINEERING ETHICS, 2024, 30 (02)
  • [10] Performing Platform Governance: Facebook and the Stage Management of Data Relations
    Karen Huang
    P. M. Krafft
    Science and Engineering Ethics, 30